A Steered-Response Power (SRP) based Framework for Sound Source Localization using Microphone Arrays in Reverberant Rooms for Enhancement of Speech Intelligibility
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
APPLICATION OF DSP IN BIOMEDICAL ENGINEERING
APPLICATION OF DSP IN BIOMEDICAL ENGINEERING
APPLICATION OF DSP IN BIOMEDICAL ENGINEERING
APPLICATION OF DSP IN BIOMEDICAL ENGINEERING
APPLICATION OF DSP IN BIOMEDICAL ENGINEERING
FPGA Based Acoustic Source Localization ProjectShristi Pradhan
Our Bachelor's final year project for active localization of a sound source using FPGA, microphones and stepper motor. Simulation was also accomplished on Simulink.
APPLICATION OF DSP IN BIOMEDICAL ENGINEERING
APPLICATION OF DSP IN BIOMEDICAL ENGINEERING
APPLICATION OF DSP IN BIOMEDICAL ENGINEERING
APPLICATION OF DSP IN BIOMEDICAL ENGINEERING
APPLICATION OF DSP IN BIOMEDICAL ENGINEERING
FPGA Based Acoustic Source Localization ProjectShristi Pradhan
Our Bachelor's final year project for active localization of a sound source using FPGA, microphones and stepper motor. Simulation was also accomplished on Simulink.
Food Preservation for Farming Communities in Nepal: A Low Cost Engineering So...Shristi Pradhan
I presented our Mondialogo project to a small group of students in an informal setting in Vancouver, Canada. The proposed idea is a low cost solution to preserve potatoes in rural areas of Nepal.
Immersive audio rendering for interactive complex virtual architectural envir...Muhammad Imran
Development of coexistence virtual space to experience acoustically immersive environment during the augmented/virtual reality (AR/VR) telepresence system
Shoichi Koyama, "Source-Location-Informed Sound Field Recording and Reproduction: A Generalization to Arrays of Arbitrary Geometry"
Presented in 2016 AES International Conference on Sound Field Control (July 18-20 2016, Guildford, UK)
Shoichi Koyama, Naoki Murata, and Hiroshi Saruwatari. "Super-resolution in sound field recording and reproduction based on sparse representation"
presented at 5th Joint Meeting Acoustical Society of America and Acoustical Society of Japan (28 Nov. - 2 Dec. 2016, Honolulu, USA)
A novel high resolution doa estimation design algorithm of close sources sign...eSAT Journals
Abstract Underwater target finding in ocean environment has gain considerable interest in both military and civilian applications. In this paper the performance of directions finding techniques, subspace and the non-subspace methods are presented. In this paper, the Eigen analysis of high resolution and supreme resolution algorithms, comparisons and the performance, resolution analysis are done. The analysis is based on linear array elements and the calculation of the pseudo spectra function of the valuation algorithms. Traditional MUSIC algorithm decomposes the signal covariance matrix and then make the signals subspace obtained is orthogonal to the noise subspace, which decreases the effect of the noise. But when the signals intervals are very small, traditional improved MUSIC algorithm has been unable to distinguish the signals as the SNR decreases. A new improved algorithm is introduced using Singular value decomposition of the covariance matrix. An antenna of ULA configuration is taken for both the algorithms. Simulation results show that projected method gives better performance than MUSIC algorithm. In this newly Modified MUSIC algorithm, conditions required for under water environment are taken into account such as water density, permittivity of water, pressure, Signal to Noise Ratio, speed of sound wave in water. Keywords: Underwater Communication, Number of Snapshots, Antenna Noise, Uniform Linear Array (ULA) and Distance Between Array Elements.
Direction of Arrival Estimation Based on MUSIC Algorithm Using Uniform and No...IJERA Editor
In signal processing, the direction of arrival (DOA) estimation denotes the direction from which a propagating wave arrives at a point, where a set of antennas is located. Using the array antenna has an advantage over the single antenna in achieving an improved performance by applying Multiple Signal Classification (MUSIC) algorithm. This paper focuses on estimating the DOA using uniform linear array (ULA) and non-uniform linear array (NLA)of antennas to analyze the performance factors that affect the accuracy and resolution of the system based on MUSIC algorithm. The direction of arrival estimation is simulated on a MATLAB platform with a set of input parameters such as array elements, signal to noise ratio, number of snapshots and number of signal sources. An extensive simulation has been conducted and the results show that the NLA with DOA estimation for co-prime array can achieve an accurate and efficient DOA estimation
Room Transfer Function Estimation and Room Equalization in Noise EnvironmentsIJERA Editor
Audio quality in listening situation is degraded by indoor room reverberation. Room equalization can be used to
increase the audio quality by applying the inverse transfer function to the input audio signals. In noise
environments, however, it is hard to exactly measure the room transfer function. In this work, we developed the
techniques to measure the room transfer function in indoor noise environments and to enhance the audio quality
by room equalization. From the experimental results, we showed that the proposed techniques can be
successfully used in indoor noise environments
Spectrum Sensing Detection with Sequential Forward Search in Comparison to Kn...IJMTST Journal
FCC is currently working on the concept of white space users “borrowing” spectrum from free license
holders temporarily to improve the spectrum utilization.
This project provides a relation between a Pf and the SNR value of any spectrum detector to have a
certain performance. Previous spectrum sensing detection techniques are only suitable for Low SNR and
are based on signal information values. But these methods are purely narrow band spectrum applications
In order to overcome the above said drawbacks we propose a novel method of spectrum sensing method
and is suitable for low and high SNR values, the sensed spectrum applicable for wide band applications.
Our proposed method does not require signal information at the receiver and channel information, because
this flexibility sensing rate is very high compared to previous techniques.
Design and Implementation of Polyphase based Subband Adaptive Structure for N...Pratik Ghotkar
With the tremendous growth in the Digital Signal processing technology, there are many techniques available to remove noise from the speech signals which is used in the speech processing. Widely used LMS algorithm is modified with much advancement but still there are many limitations are introducing. This paper consist of a new approach i.e. subband adaptive processing for noise cancelation in the speech signals. Subband processing employs the multirate signal processing. The polyphase based subband adaptive implementation finds better results in term of MMSE , PSNR and processing time; also the synthesis filter bank is works on the lower data rate which reduces the computational Burdon as compare to the direct implementation of Subband adaptive filter. The normalized least mean squares (NLMS) algorithm is a class of adaptive filter used.
Speech Enhancement Using Spectral Flatness Measure Based Spectral SubtractionIOSRJVSP
This paper is aimed to reduce background noise introduced in speech signal during capture, storage, transmission and processing using Spectral Subtraction algorithm. To consider the fact that colored noise corrupts the speech signal non-uniformly over different frequency bands, Multi-Band Spectral Subtraction (MBSS) approach is exploited wherein amount of noise subtracted from noisy speech signal is decided by a weighting factor. Choice of optimal values of weights decides the performance of the speech enhancement system. In this paper weights are decided based on SFM (Spectral Flatness Measure) than conventional SNR (Signal to Noise Ratio) based rule. Since SFM is able to provide true distinction between speech signal and noise signal. Spectrogram, Mean Opinion Score show that speech enhanced from proposed SFM based MBSS possess better perceptual quality and improved intelligibility than existing SNR based MBSS
Bayesian modelling and computation for Raman spectroscopyMatt Moores
Raman spectroscopy can be used to identify molecules by the characteristic scattering of light from a laser. Each Raman-active dye label has a unique spectral signature, comprised by the locations and amplitudes of the peaks. The Raman spectrum is discretised into a multivariate observation that is highly collinear, hence it lends itself to a reduced-rank representation. We introduce a sequential Monte Carlo (SMC) algorithm to separate this signal into a series of peaks plus a smoothly-varying baseline, corrupted by additive white noise. By incorporating this representation into a Bayesian functional regression, we can quantify the relationship between dye concentration and peak intensity. We also estimate the model evidence using SMC to investigate long-range dependence between peaks. These methods have been implemented as an R package, using RcppEigen and OpenMP.
Slideshow of the presentation given at Eusiipco 2016
Due to its possible low-power implementation, Compressed Sensing (CS) is an attractive tool for physiological signal acquisition in emerging scenarios like Wireless Body Sensor Networks (WBSN) and telemonitoring applications. In this work we consider the continuous monitoring and analysis of the fetal ECG signal (fECG). We propose a modification of the low-complexity CS reconstruction SL0 algorithm, improving its robustness in the presence of noisy original signals and possibly ill-conditioned sensing/reconstruction procedures. We show that, while maintaining the same computational cost of the original algorithm, the proposed modification significantly improves the reconstruction quality, both for synthetic and real-world ECG signals. We also show that the proposed algorithm allows robust heart beat classification when sparse matrices, implementable with very low computational complexity, are used for compressed sensing of the ECG signal.
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...CSCJournals
In this paper, a voice activity detector is proposed on the basis of Gaussian modeling of noise in the spectro-temporal space. Spectro-temporal space is obtained from auditory cortical processing. The auditory model that offers a multi-dimensional picture of the sound includes two stages: the initial stage is a model of inner ear and the second stage is the auditory central cortical modeling in the brain. In this paper, the speech noise in this picture has been modeled by a 3-D mono Gaussian cluster. At the start of suggested VAD process, the noise is modeled by a Gaussian shaped cluster. The average noise behavior is obtained in different spectrotemporal space in various points for each frame. In the stage of separation of speech from noise, the criterion is the difference between the average noise behavior and the speech signal amplitude in spectrotemporal domain. This was measured for each frame and was used as the criterion of classification. Using Noisex92, this method is tested in different noise models such as White, exhibition, Street, Office and Train noises. The results are compared to both auditory model and multifeature method. It is observed that the performance of this method in low signal-to-noise ratios (SNRs) conditions is better than other current methods.
Cosmetic shop management system project report.pdfKamal Acharya
Buying new cosmetic products is difficult. It can even be scary for those who have sensitive skin and are prone to skin trouble. The information needed to alleviate this problem is on the back of each product, but it's thought to interpret those ingredient lists unless you have a background in chemistry.
Instead of buying and hoping for the best, we can use data science to help us predict which products may be good fits for us. It includes various function programs to do the above mentioned tasks.
Data file handling has been effectively used in the program.
The automated cosmetic shop management system should deal with the automation of general workflow and administration process of the shop. The main processes of the system focus on customer's request where the system is able to search the most appropriate products and deliver it to the customers. It should help the employees to quickly identify the list of cosmetic product that have reached the minimum quantity and also keep a track of expired date for each cosmetic product. It should help the employees to find the rack number in which the product is placed.It is also Faster and more efficient way.
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdffxintegritypublishin
Advancements in technology unveil a myriad of electrical and electronic breakthroughs geared towards efficiently harnessing limited resources to meet human energy demands. The optimization of hybrid solar PV panels and pumped hydro energy supply systems plays a pivotal role in utilizing natural resources effectively. This initiative not only benefits humanity but also fosters environmental sustainability. The study investigated the design optimization of these hybrid systems, focusing on understanding solar radiation patterns, identifying geographical influences on solar radiation, formulating a mathematical model for system optimization, and determining the optimal configuration of PV panels and pumped hydro storage. Through a comparative analysis approach and eight weeks of data collection, the study addressed key research questions related to solar radiation patterns and optimal system design. The findings highlighted regions with heightened solar radiation levels, showcasing substantial potential for power generation and emphasizing the system's efficiency. Optimizing system design significantly boosted power generation, promoted renewable energy utilization, and enhanced energy storage capacity. The study underscored the benefits of optimizing hybrid solar PV panels and pumped hydro energy supply systems for sustainable energy usage. Optimizing the design of solar PV panels and pumped hydro energy supply systems as examined across diverse climatic conditions in a developing country, not only enhances power generation but also improves the integration of renewable energy sources and boosts energy storage capacities, particularly beneficial for less economically prosperous regions. Additionally, the study provides valuable insights for advancing energy research in economically viable areas. Recommendations included conducting site-specific assessments, utilizing advanced modeling tools, implementing regular maintenance protocols, and enhancing communication among system components.
Final project report on grocery store management system..pdfKamal Acharya
In today’s fast-changing business environment, it’s extremely important to be able to respond to client needs in the most effective and timely manner. If your customers wish to see your business online and have instant access to your products or services.
Online Grocery Store is an e-commerce website, which retails various grocery products. This project allows viewing various products available enables registered users to purchase desired products instantly using Paytm, UPI payment processor (Instant Pay) and also can place order by using Cash on Delivery (Pay Later) option. This project provides an easy access to Administrators and Managers to view orders placed using Pay Later and Instant Pay options.
In order to develop an e-commerce website, a number of Technologies must be studied and understood. These include multi-tiered architecture, server and client-side scripting techniques, implementation technologies, programming language (such as PHP, HTML, CSS, JavaScript) and MySQL relational databases. This is a project with the objective to develop a basic website where a consumer is provided with a shopping cart website and also to know about the technologies used to develop such a website.
This document will discuss each of the underlying technologies to create and implement an e- commerce website.
Student information management system project report ii.pdfKamal Acharya
Our project explains about the student management. This project mainly explains the various actions related to student details. This project shows some ease in adding, editing and deleting the student details. It also provides a less time consuming process for viewing, adding, editing and deleting the marks of the students.
Saudi Arabia stands as a titan in the global energy landscape, renowned for its abundant oil and gas resources. It's the largest exporter of petroleum and holds some of the world's most significant reserves. Let's delve into the top 10 oil and gas projects shaping Saudi Arabia's energy future in 2024.
Hierarchical Digital Twin of a Naval Power SystemKerry Sado
A hierarchical digital twin of a Naval DC power system has been developed and experimentally verified. Similar to other state-of-the-art digital twins, this technology creates a digital replica of the physical system executed in real-time or faster, which can modify hardware controls. However, its advantage stems from distributing computational efforts by utilizing a hierarchical structure composed of lower-level digital twin blocks and a higher-level system digital twin. Each digital twin block is associated with a physical subsystem of the hardware and communicates with a singular system digital twin, which creates a system-level response. By extracting information from each level of the hierarchy, power system controls of the hardware were reconfigured autonomously. This hierarchical digital twin development offers several advantages over other digital twins, particularly in the field of naval power systems. The hierarchical structure allows for greater computational efficiency and scalability while the ability to autonomously reconfigure hardware controls offers increased flexibility and responsiveness. The hierarchical decomposition and models utilized were well aligned with the physical twin, as indicated by the maximum deviations between the developed digital twin hierarchy and the hardware.
Using recycled concrete aggregates (RCA) for pavements is crucial to achieving sustainability. Implementing RCA for new pavement can minimize carbon footprint, conserve natural resources, reduce harmful emissions, and lower life cycle costs. Compared to natural aggregate (NA), RCA pavement has fewer comprehensive studies and sustainability assessments.
Planning Of Procurement o different goods and services
Sound Source Localization
1. 1 November 2016
HANYANG UNIVERSITY
ARCHITECTURAL ACOUSTICS LAB
Office +82-2-2220-1795 | Fax +82-2-2220-4794
http://acoustics.hanyang.ac.kr
Muhammad Imran, Jin Yong Jeon
December 12, 2015
A Steered-Response Power (SRP) based Framework for
Sound Source Localization using Microphone Arrays in
Reverberant Rooms for Enhancement of Speech Intelligibility
42. Jahrestagung für Akustik, 14.-17. März 2016
2. 1 November 2016
HANYANG UNIVERSITY
ARCHITECTURAL ACOUSTICS LAB
2
o Introduction
o Background and Motivation
o Sound Source Localization
• Methodology
◦ VAD: Voice Activity Detection
◦ SRP (Beamforming) filters
◦ PHAT-weighting
• Real-time Framework and Implementation
• Optimization and Clustering
o Results
o Conclusion
Contents
3. 1 November 2016
HANYANG UNIVERSITY
ARCHITECTURAL ACOUSTICS LAB
3
o Sound Source Localization and Tracking using microphone arrays for
• Room Acoustics measurements
• Teleconference Systems
o Traditional Methods
• Time-delay estimation (TDOA) techniques between microphone pairs using Correlation
Function, ignoring
◦ Ambient noise
◦ Reflections from surrounding
◦ Reverberation in closed space
o Therefore
• Producing poor results in terms of Precision, Resolution and Robustness
• Require additional post-processing to track multiple sources in real time applications
• Limited bandwidth
Introduction (Background and Motivation)
4. 1 November 2016
HANYANG UNIVERSITY
ARCHITECTURAL ACOUSTICS LAB
4
o The Methods for sound source localization using microphone arrays
o Time difference of arrival estimation (TDOA)
o Generalized cross-correlation (GCC)
o Weighting function
o Optimum detection in the presence of reverberant environment
o Improved Signal to Noise Ratio (SNR)
o Steered Response Power (SRP)
o Weighting function as Beamformer
o Source localization and tracking
o Robust in Reverb Condition
Sound Source Localization
5. 1 November 2016
HANYANG UNIVERSITY
ARCHITECTURAL ACOUSTICS LAB
5
o The method is based on Steered-response power (SRP)
o The power at the beamformer output as a function of the look-up direction 𝑐 is
o Weighted Steered-response Power
o MVDR beamformer as Weight
o After simplifications
Methodology (1/3)
𝑃𝐵𝐹(𝑐) = 𝐷(𝑐) 𝐻 𝑆𝐷(𝑐)
𝑆 is cross-power matrix
𝐷(𝑐) is Array directivity
𝑤 =
𝐷 𝐻
(𝑐)𝑆−1
𝐷 𝐻(𝑐)𝑆−1 𝐷(𝑐)
𝑃 𝑀𝑉𝐷𝑅(𝑐) =
1
𝐷(𝑐) 𝐻 𝑆−1 𝐷(𝑐)
6. 1 November 2016
HANYANG UNIVERSITY
ARCHITECTURAL ACOUSTICS LAB
6
o Combining the Bins
o Combining the signals 𝑃 𝑀𝑉𝐷𝑅(𝑐, 𝑘) from different frequency bins
o Approach used is PHAT weighting
o Information of noise variance can be used and the final beamformer is improved
o Therefore,
Methodology (2/3)
𝑃𝑆𝑆𝐿(𝑐) =
1
𝐾
𝑘=1
𝐾
𝑀
𝑋 𝑘
𝐻
𝑋 𝑘
𝑃 𝑀𝑉𝐷𝑅(𝑐, 𝑘)
𝑋 𝑘 input vector with length M containing the input signals for this frequency bin from all microphones
𝑁𝑘 =
1
𝑀
𝑖
𝑁𝑖(𝑘)
𝑃𝑆𝑆𝐿(𝑐) =
1
𝐾
𝑘=1
𝐾
𝑀
𝑞𝑋 𝑘
𝐻
𝑋 𝑘 + (1 − 𝑞) 𝑁𝑘
𝑃 𝑀𝑉𝐷𝑅(𝑐, 𝑘)
7. 1 November 2016
HANYANG UNIVERSITY
ARCHITECTURAL ACOUSTICS LAB
7
o Post Processing (Simple Clustering)
o Algorithm used is so-called “Bucket Clustering”
o Step: 1; Grouping the Measurements
o Based on Single-frame Information of azimuth ′𝜑′, elevation ′𝜃′, standard deviations ′𝜎 𝜑
′
and ′𝜎 𝜃
′ microphone-array working volume is computed as (50% overlapping)
o Step: 2; Number of Cluster Candidates
o Applying threshold defined as the average confidence of sections with more than one measurement:
o Step: 3; Averaging the Measurements in Each Cluster Candidate
Methodology (3/3)
𝑀 = 4
𝜑 𝑚𝑎𝑥 − 𝜑 𝑚𝑖𝑛
6𝜎 𝜑
×
𝜃 𝑚𝑎𝑥 − 𝜃 𝑚𝑖𝑛
6𝜎 𝜃
𝐶 𝑇ℎ =
1
𝐿
𝑖=1
𝑀
𝑗=1
𝑁 𝑖
𝐶𝑖𝑗
𝐶𝑖𝑗 confidence of the jth measurement in the ith section
𝑁𝑖 number of measurements in the ith section, 𝑀 number of sections
𝐿 number of sections with number of measurements larger than 1
8. 1 November 2016
HANYANG UNIVERSITY
ARCHITECTURAL ACOUSTICS LAB
8
Real time framework
o Sound capturing by 3D microphone array (6-channel)
o Data is Subjected to Framing and windowing block
o Short time discrete Fourier transform (DFT)
o Voice Activity Detectors (VAD)
• VAD are used for detecting active signals
• Based on Energy and Spectral shifts for each frame
o Localization block (Source estimation)
• MVDR Weights for each frequency bin
• PHAT weightings for combining all frequency response
o Optimization (Improving the localization estimates by averaging several measurements)
• Simple Clustering
Framework
9. 1 November 2016
HANYANG UNIVERSITY
ARCHITECTURAL ACOUSTICS LAB
9
Measurement setup and Procedure
o Microphone Array:
o 6-channel orthogonal array
o Speech Sources:
o Three Speech Sources placed at 0o, 45o, -45o Azimuth
o Speech duration = 20 sec
o 1.5 m from array
o Pure Speech mixed with Pink Noise
o SNR is 15 dB
Evaluation
Source 02 (-45o)
Source 03 (+45o)
Source 01 (0o)
10. 1 November 2016
HANYANG UNIVERSITY
ARCHITECTURAL ACOUSTICS LAB
Results (1/3)
o Localization Results at 0o azimuth
• VAD Voiced Frames = 500
Results
Number of Frames 496
STD 2.03
Localization Error 0.027
Localization error =
1
𝑁
𝑖=1
𝑁
𝜑𝑖 − 𝜑𝑖
11. 1 November 2016
HANYANG UNIVERSITY
ARCHITECTURAL ACOUSTICS LAB
Results (2/3)
o Localization Results at 45o azimuth
• VAD Voiced Frames = 500
Results
Number of Frames 493
STD 1.9
Localization error 0.081
Localization error =
1
𝑁
𝑖=1
𝑁
𝜑𝑖 − 𝜑𝑖
12. 1 November 2016
HANYANG UNIVERSITY
ARCHITECTURAL ACOUSTICS LAB
Results (3/3)
o Localization Results at -45o azimuth
• VAD Voiced Frames = 500
Results
Number of Frames 494
STD 1.3
Localization error 0.058
Localization error =
1
𝑁
𝑖=1
𝑁
𝜑𝑖 − 𝜑𝑖
13. 1 November 2016
HANYANG UNIVERSITY
ARCHITECTURAL ACOUSTICS LAB
Conclusion
o Presented a framework for sound source localization
• Using six channel spherical microphone array based on
SRP-MVDR algorithms weighted with PHAT
VAD is used for extracting voiced frames of Speech
Optimization using clustering method for accurate localization
o Producing convincing results within the accuracy of ±2o works well for 15 dB SNR
o MVDR weighted SPR Localizer estimated the sound sources with STD value of ±1.3o in
DOA
Editor's Notes
Hello
I have been presenting a framework for sound source localization using SRP Method in order to get a clean speech signal in reverb environments form a particular speaker
Staring from background and motivation of the presented work, I would talk about the methodology of the sound localizer, including the usage of VAD as voiced frame detection in speech signals and PHAT as combining the frequency bins of the beamformer for different microphone pairs of arrays.
Secondly, a real-time frame would be introduced with optimization of the results of DOA
Finally, results will be shown and discussed
The common purpose of sound source localizer and tracker is for room acoustics measurements and teleconference systems where the speakers are detected and localized for reverb rooms
Traditional methods use TDOA between single pair of microphone using GCC. Some recent techniques have been introduced that uses different microphone combination for computing GCC function with certain weighting functions.
But these methods often ignores the ambience and reverb room condition, therefore offering poor results in terms of Precision, Resolution and Robustness. Additionally they require post-processing algorithm to track multiple sources in real time applications and have limited bandwidth
Generally the sound localization is categorized into two areas
Time difference of arrival estimation (TDOA), where Generalized cross-correlation are used for finding the delay time among different microphone pairs of arrays
Steered Response Power (SRP) methods are used for steering the source and commonly applied for tracking purposed
We have been discussing in this presentation a SRP based approach
These slides would present the methodology and necessary mathematical background for our proposed framework
The general formula for beamforming is show in first equation, where 𝑆 is cross-power matrix and 𝐷(𝑐) is Array directivity
We have bean using MVDR as weighting function for the beamformer and after simplification we get the final equation for the beamformer steering at direction c
This beamformer is for single frequency bin
After getting the beamformer functions for al frequencies of interest, we Combining the signals 𝑃 𝑀𝑉𝐷𝑅 (𝑐,𝑘) from different frequency bins.
The approach followed is by using PHAT weight as a combiner
Here, 𝑋 𝑘 input vector with length M containing the input signals for this frequency bin from all microphones
In addition, the noise variance information is used and the final beamformer is improved as shown in last equation
A post processing is performed using so-called “Bucket Clustering” Algorithm for improving the confidence level of measurements and detecting actual sound source among reflections
This slide describes the Real time framework of the method
Sound is captured and transformed into frequency domain
VAD is applied
And the data f voiced frames are subjected to localization block
Finally, Optimization is performed for Improving the localization estimates by averaging several measurements using Simple Clustering
For evaluation
6-ch microphone array is used
Three Speech Sources were placed at 0o, 45o, -45o Azimuth at a distance of 1.5 from array and speech was recorded for 20sec
The speech was mixed with pink noise and final speech has a 15dB SNR
Localization Results at 0o azimuth
Localization Results at 45o azimuth
Localization Results at -45o azimuth
Presented a framework for sound source localization
Using six channel spherical microphone array based on
SRP-MVDR algorithms weighted with PHAT
VAD is used for extracting voiced frames of Speech
Optimization using clustering method for accurate localization
Producing convincing results within the accuracy of ±2o works well for 15 dB SNR
MVDR weighted SPR Localizer estimated the sound sources with STD value of ±1.3o in DOA