Speech measurement using laser doppler vibrometer

•Download as PPTX, PDF•

1 like•1,565 views

I'am Ajas

Technology

Guided By: Presented By:
Asif Ali Shamil. C
Lecturer in E.I Roll no: 68
E.I

 Introduction
 Speech measurement with LDV
 Principe of LDV
 Measurement Setup
 Problem formulation
 Speech Enhancement Algorithm
 Speckle noise suppression
 LDV-Based time frequency VAD
 Spectral gain modification
 Experimental Results
 Conclusion

 Achieving high speech intelligibility in noisy
environments is one of the most challenging and
important problems for existing speech-
enhancement and speech-recognition systems.
 Recently, several approaches have been
proposed that make use of auxiliary non acoustic
sensors, such as bone and throat- microphones.
 Major drawback of most existing sensors is the
requirement for a physical contact between the
sensor and the speaker.
 Here present an alternative approach that
enables a remote measurement of speech, using
an auxiliary laser Doppler vibrometer (LDV)
sensor.

 fd(t) = 2ν(t) cos(α)/λ

ν(t)=> instantaneous throat-vibrational
velocity
α => Angle between the object beam and the
velocity vector
λ =>laser wavelength.

 LDV-output signal after an FM-demodulator
is
Z(t) = fb + [2Av cos(α)/λ].cos(2πfvt). (1)

 let y(n) =x(n) + d(n)

y(n)-observed signal in the acoustic sensor.
x(n) -Speech signal.
d(n)-Un correlated additive noise signal.
 In the STFT domain,
Ylk = Xlk + Dlk
Where l= 0, 1, . . . is the frame index.
k = 0, 1, . . . , N − 1is the frequency-
bin index.

Use overlapping frames of N samples with a
framing-step of M samples.
Let H0lk and H1lk indicate, respectively, speech
absence and presence hypotheses in the
time-frequency bin (l, k), i.e.,
H0lk: Ylk = Dlk
H1lk: Ylk = Xlk + Dlk.

X̂lk = GlkYlk.

The OM-LSA estimator minimizes the log spectral
amplitude under signal presence uncertainty
resulting in,
Glk = {GH1lk}ˆPlk.Gminˆ1−Plk .
Where,
GH1lk is a conditional gain function given H1lk &
Gmin<< 1 is a constant attenuation factor.

Plk is the conditional speech presence
probability.

 Denoting by ξlk and γlk we
get,
-Priori SNR
-Posteriori SNR
is the a priori probability for speech
absence,

 Speckle-Noise Suppression

The output of the speckle-noise detector is,
Wl(n) = Gl Zl(n)
Where Gl= Gsmin<<1 for Il = 1(speckle noise is
present)
Gl = 1 otherwise.

-Represents the noise-estimate bias

-Smoothed-version of the power spectrum

Then, we propose the following soft-
decision VAD:

Speech in a given frame is defined by

We attenuate high-energy transient components to the level
of the stationary background noise by updating the gain
floor to

-Stationary noise-spectrum estimate

-Smoothed noisy spectrum

 Speckle noise was successfully attenuated
from the LDV-measured signal using a
kurtosis-based decision rule.
 A soft-decision VAD was derived in the
time-frequency domain and the gain function
of the OM-LSA algorithm was appropriately
modified.
 The effectiveness of the proposed approach
in suppressing highly non-stationary noise
components was demonstrated.

 I. Cohen and B. Berdugo, “Speech enhancement for nonstationary noise
environment,” Signal Process., vol. 81
 T. F. Quatieri, K. Brady, D. Messing, J. P. Campbell, W. M. Campbell, M. S.
Brandstein, C. J.Weinstein, J. D. Tardelli, and P. D. Gatewood, “Exploiting
nonacoustic sensors for speech encoding,”
 T. Dekens, W. Verhelst, F. Capman, and F. Beaugendre, “Improved speech
recognition in noisy environments by using a throat microphone for accurate
voicing detection,” in 18th European Signal Processing Conf. (EUSIPCO),
Aallborg, Denmark, Aug. 2010, pp. 23–27
 M. Johansmann, G. Siegmund, and M. Pineda, “Targeting the limits of laser
doppler vibrometry,”
 http://www.metrolaserinc.com

Speech measurement using laser doppler vibrometer

What's hot

Dccn- Physical LayerTaimoor Muzaffar Gondal

Channel capacityPALLAB DAS

Noise cancellationMohammed Abuibaid

Ii Communications ChannelDon Bosco BSIT

A novel speech enhancement techniqueeSAT Publishing House

seminar slides Sindhu Csprasanth

Physics learning objectOhi Akpengbe

communication system lec2ZareenRauf1

Learning Object-Measuring Soundarbinddhaliwal

Video Compression Part 1 Video Principles Dr. Mohieddin Moradi

Measuring sound levels lo4Albert Lee

Saito2017icasspYuki Saito

Oo2423882391IJERA Editor

komdat3pasca

Communication systemZareenRauf1

Impairment of signals & Transmission MediumAkila Dulaj

1- Principles of Wireless Trans..pdfed8dc9f63e

LO4: The decibel scaleBrandon Morton

Baud rate is the number of change in signalAbhishek Pathak

Understanding the FM Slide Chirp Advantages in Hydroacoustics for Fisheries A...HTI Hydroacoustic Technology, Inc.

What's hot (20)

Dccn- Physical Layer

Channel capacity

Noise cancellation

Ii Communications Channel

A novel speech enhancement technique

seminar slides

Physics learning object

communication system lec2

Learning Object-Measuring Sound

Video Compression Part 1 Video Principles

Measuring sound levels lo4

Saito2017icassp

Oo2423882391

komdat3

Communication system

Impairment of signals & Transmission Medium

1- Principles of Wireless Trans..pdf

LO4: The decibel scale

Baud rate is the number of change in signal

Understanding the FM Slide Chirp Advantages in Hydroacoustics for Fisheries A...

Similar to Speech measurement using laser doppler vibrometer

Final presentationMeghasyam Tummalacherla

F010334548IOSR Journals

K31074076IJERA Editor

A_Noise_Reduction_Method_Based_on_LMS_Adaptive_Fil.pdfBala Murugan

SPEECH COMPRESSION TECHNIQUES: A REVIEWijiert bestjournal

A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...sipij

A NOVEL METHOD FOR OBTAINING A BETTER QUALITY SPEECH SIGNAL FOR COCHLEAR IMPL...acijjournal

A literature review on improving speech intelligibility in noisy environmentOHSU | Oregon Health & Science University

Speech Enhancement for Nonstationary Noise Environmentssipij

Acoustic fMRI noise reduction: a perceived loudness approachDimitri Vrehen

129966864160453838[1]威華王

Experimental Evaluation of Distortion in Amplitude Modulation Techniques for ...Huynh MVT

Reduction of audio acoustic in Audio-visual transceiving with single portInternational Journal of Engineering Inventions www.ijeijournal.com

Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing IJECEIAES

3. speech processing algorithms for perception improvement of hearing impaire...k srikanth

Deep Learning Based Voice Activity Detection and Speech EnhancementNAVER Engineering

Application of Fisher Linear Discriminant Analysis to Speech/Music Classifica...Lushanthan Sivaneasharajah

A New Speech Enhancement Technique to Reduce Residual Noise Using Perceptual ...IOSR Journals

My Conferecence Publicationharikrish.u

Similar to Speech measurement using laser doppler vibrometer (20)

Final presentation

F010334548

K31074076

A_Noise_Reduction_Method_Based_on_LMS_Adaptive_Fil.pdf

SPEECH COMPRESSION TECHNIQUES: A REVIEW

A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...

A NOVEL METHOD FOR OBTAINING A BETTER QUALITY SPEECH SIGNAL FOR COCHLEAR IMPL...

A literature review on improving speech intelligibility in noisy environment

Speech Enhancement for Nonstationary Noise Environments

Acoustic fMRI noise reduction: a perceived loudness approach

129966864160453838[1]

Experimental Evaluation of Distortion in Amplitude Modulation Techniques for ...

Reduction of audio acoustic in Audio-visual transceiving with single port

Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing

3. speech processing algorithms for perception improvement of hearing impaire...

Deep Learning Based Voice Activity Detection and Speech Enhancement

Application of Fisher Linear Discriminant Analysis to Speech/Music Classifica...

A New Speech Enhancement Technique to Reduce Residual Noise Using Perceptual ...

My Conferecence Publication

Recently uploaded

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55

Salesforce Community Group Quito, Salesforce 101Paola De la Torre

My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar

Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko

How to Remove Document Management Hurdles with X-Docs?XfilesPro

Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software

How to convert PDF to text with Nanonetsnaman860154

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls

08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls

[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106

AI as an Interface for Commercial BuildingsMemoori

SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren

Understanding the Laravel MVC ArchitecturePixlogix Infotech

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j

Pigging Solutions Piggable Sweeping ElbowsPigging Solutions

Recently uploaded (20)

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service

08448380779 Call Girls In Friends Colony Women Seeking Men

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...

Salesforce Community Group Quito, Salesforce 101

My Hashitalk Indonesia April 2024 Presentation

Handwritten Text Recognition for manuscripts and early printed texts

How to Remove Document Management Hurdles with X-Docs?

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation

How to convert PDF to text with Nanonets

08448380779 Call Girls In Civil Lines Women Seeking Men

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

08448380779 Call Girls In Greater Kailash - I Women Seeking Men

[2024]Digital Global Overview Report 2024 Meltwater.pdf

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics

AI as an Interface for Commercial Buildings

SQL Database Design For Developers at php[tek] 2024

Understanding the Laravel MVC Architecture

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...

Pigging Solutions Piggable Sweeping Elbows

Speech measurement using laser doppler vibrometer

1. Guided By: Presented By: Asif Ali Shamil. C Lecturer in E.I Roll no: 68 E.I

2.  Introduction  Speech measurement with LDV  Principe of LDV  Measurement Setup  Problem formulation  Speech Enhancement Algorithm  Speckle noise suppression  LDV-Based time frequency VAD  Spectral gain modification  Experimental Results  Conclusion

3.  Achieving high speech intelligibility in noisy environments is one of the most challenging and important problems for existing speech- enhancement and speech-recognition systems.  Recently, several approaches have been proposed that make use of auxiliary non acoustic sensors, such as bone and throat- microphones.  Major drawback of most existing sensors is the requirement for a physical contact between the sensor and the speaker.  Here present an alternative approach that enables a remote measurement of speech, using an auxiliary laser Doppler vibrometer (LDV) sensor.

5.  fd(t) = 2ν(t) cos(α)/λ ν(t)=> instantaneous throat-vibrational velocity α => Angle between the object beam and the velocity vector λ =>laser wavelength.  LDV-output signal after an FM-demodulator is Z(t) = fb + [2Av cos(α)/λ].cos(2πfvt). (1)

7.     

9.  let y(n) =x(n) + d(n) y(n)-observed signal in the acoustic sensor. x(n) -Speech signal. d(n)-Un correlated additive noise signal.  In the STFT domain, Ylk = Xlk + Dlk Where l= 0, 1, . . . is the frame index. k = 0, 1, . . . , N − 1is the frequency- bin index.

10. Use overlapping frames of N samples with a framing-step of M samples. Let H0lk and H1lk indicate, respectively, speech absence and presence hypotheses in the time-frequency bin (l, k), i.e., H0lk: Ylk = Dlk H1lk: Ylk = Xlk + Dlk. X̂lk = GlkYlk.

11. The OM-LSA estimator minimizes the log spectral amplitude under signal presence uncertainty resulting in, Glk = {GH1lk}ˆPlk.Gminˆ1−Plk . Where, GH1lk is a conditional gain function given H1lk & Gmin<< 1 is a constant attenuation factor. Plk is the conditional speech presence probability.

12.  Denoting by ξlk and γlk we get, -Priori SNR -Posteriori SNR is the a priori probability for speech absence,

13.  Speckle-Noise Suppression The output of the speckle-noise detector is, Wl(n) = Gl Zl(n) Where Gl= Gsmin<<1 for Il = 1(speckle noise is present) Gl = 1 otherwise.

14.

15. -Represents the noise-estimate bias -Smoothed-version of the power spectrum Then, we propose the following soft- decision VAD:

16.

17. Speech in a given frame is defined by We attenuate high-energy transient components to the level of the stationary background noise by updating the gain floor to -Stationary noise-spectrum estimate -Smoothed noisy spectrum

18.

19.  Speckle noise was successfully attenuated from the LDV-measured signal using a kurtosis-based decision rule.  A soft-decision VAD was derived in the time-frequency domain and the gain function of the OM-LSA algorithm was appropriately modified.  The effectiveness of the proposed approach in suppressing highly non-stationary noise components was demonstrated.

20.  I. Cohen and B. Berdugo, “Speech enhancement for nonstationary noise environment,” Signal Process., vol. 81  T. F. Quatieri, K. Brady, D. Messing, J. P. Campbell, W. M. Campbell, M. S. Brandstein, C. J.Weinstein, J. D. Tardelli, and P. D. Gatewood, “Exploiting nonacoustic sensors for speech encoding,”  T. Dekens, W. Verhelst, F. Capman, and F. Beaugendre, “Improved speech recognition in noisy environments by using a throat microphone for accurate voicing detection,” in 18th European Signal Processing Conf. (EUSIPCO), Aallborg, Denmark, Aug. 2010, pp. 23–27  M. Johansmann, G. Siegmund, and M. Pineda, “Targeting the limits of laser doppler vibrometry,”  http://www.metrolaserinc.com

Speech measurement using laser doppler vibrometer

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Speech measurement using laser doppler vibrometer

Similar to Speech measurement using laser doppler vibrometer (20)

More from I'am Ajas

More from I'am Ajas (15)

Recently uploaded

Recently uploaded (20)

Speech measurement using laser doppler vibrometer