SlideShare a Scribd company logo
1
Detection of Acoustic Landmarks for
Speech Processing with High
Resolution
M.Tech Credit Seminar
Pushpa Gothwal (09307054)
Supervisor: Prof. P. C. Pandey
Electrical Engineering Department
November 2009
2
 Introduction
 Landmarks and their categorization
 Landmark detection methods
1. Manual labeling of landmarks
2. Detection of abrupt consonant and abrupt
landmarks
3. Stop consonant landmark detection method
 Summary and Future work
Outline
2
3
Introduction
Perception of speech under adverse listening
conditions is improved by processing of speech
Landmark detection is needed for processing
Aim : To study 3 different methods of landmark
detection and compare their temporal
resolution
3
4
 Introduction
Landmarks and their categorization
 Landmark detection methods
1. Manual labeling of landmarks
2. Detection of abrupt consonant and abrupt landmarks
3. Stop consonant landmark detection method
 Summary and Future work
4
5
.
 Landmarks is the region where the spectral
discontinuity in speech.
 They can be categorized as:
– Abrupt Consonantal :It is the closure and release of
constriction. Example- /able/
– Abrupt: It shows the change in sound due to glottal
activity. Example- /paint/
– Nonabrupt: It marks the transition between semivowel
to vowel and vice versa. Example-/away/
– Vocalic: It occurs when the vocal cord is extremely
open for a vowel. Example-/bat/
What is a Landmark?
6
An illustration of landmarks. AC = abrupt-consonantal, A = abrupt,
N = nonabrupt, V = vocalic (Lui 1996)
7
 Introduction
 Landmarks and their categorization
 Landmark detection methods
1. Manual labeling of landmarks
2. Detection of abrupt consonant and abrupt landmarks
3. Stop consonant landmark detection method
 Summary and Future work
7
8
Manual labeling of landmarks
Spectrogram of /aba/ (Prat)
9
 Introduction
 Landmarks and their categorization
 Landmark detection methods
1. Manual labeling of landmarks
2. Detection of abrupt consonant and abrupt landmarks
3. Stop consonant landmark detection method
 Summary and Future work
9
10
Detection of abrupt consonant and abrupt
landmarks
 It detects two landmarks
 Spectrum is divided into 6 bands
Band1. 0.0-0.4 Khz
2. 0.8-1.5
3. 1.2-2.0
4. 2.0-3.5
5. 3.5-5.0
6. 5.0-8.0
Band 1-Monitor glottal activity
Band 2-5-Monitor Closure and release of sonorant
Band 6-Monitor the stop
11Landmark detection algorithm (Lui 1996)
Detection of abrupt consonant and abrupt
landmarks (cont.)
12
Spectrogram of “the money is coming today". The middle figure shows
energy of band 1; and bottom figure shows ROR of band.(Lui,1996)
Detection of abrupt consonant and abrupt
landmarks (cont.)
13
 Introduction
 Landmarks and their categorization
 Landmark detection methods
1. Manual labeling of landmarks
2. Detection of abrupt consonant and abrupt landmarks
3. Stop consonant landmark detection method
 Summary and Future work
13
14
Pass I
Step 1 : Spectrum is divided into 5 bands
Band Frequency (kHz)
1 0.0-0.4 (Monitor glottal vibration)
2 0.4-1.2
3 1.2-2.0
4 2.0-3.5
5 3.5-5.0
(Consonant closure and
release)
Stop consonant landmark detection method
15
Short time spectral
analysis
Computation of energy peaks and
centroids
Computation of RORs energy and
centroid
Computation of spectral transition
index
Landmark localization
Wavelet decomposition around
landmarks
Computation of short time energy
and ZCR
Computation of energy and ZCR
RORs
Landmark localization
Landmark
(Pass 1)
Landmark
(Pass 2)
Pass 1 Pass2
Processing stage for landmark detection (Arjun et al., 2008)
speech
16
Step 2 - Computation of energy peaks and centroid in frequency bands
where k1 and k2 upper and lower frequency index for band b,n frame.
Centroid frequency is
k2 k2
fc(b,n)= ∑ k|Xn(k)|
2
/ ∑ |Xn(k)|2
fs/N (2)
k=k1 k=k1
Ep (b, n) = 10 log10 (max [|X n (k)|] 2
), k1 ≤ k ≤k2 (1)
Stop consonant landmark detection method
(cont.)
17
Step 3-Computation of energy and centroid RORs
E'p(b,n) = | Ep(b, n+K) − Ep(b,n−K)| (3)
f'c(b, n) = | fc(b, n+K) − fc(b,n−K) | (4)
Stop consonant landmark detection method
(cont.)
18
Step 4-Computation of transition index for energy and centroid
frequency
5
Tec(n) = 1/5∑E’pn(p, n)f’cn(b,n) (5)
b=1
Stop consonant landmark detection method
(cont.)
19
Waveform for /uka/ , ROR for band1(b), band2(c), band3(d) (Arjun et al.,2008)
Stop consonant landmark detection method
(cont.)
20
Processing results /uka/ of (Arjun et al., 2008)
Stop consonant landmark detection method
(cont.)
21
(a) Windowed segment used in second pass, (b) energy and ZCR ROR’s of level 1,
(c) ROR’s of level 2, and (d) transition index Tez computed from ROR’s in (b) and (c)
(Arjun et.al.2008)
Stop consonant landmark detection method
(cont.)
22
Pass2:
Step1-Compute the wavelet decomposition for segmenting the speech
Step2-Compute the energy and Zero Crossing Rate (ZCR)
Step3-Compute the ROR for energy and ZCR
Stop consonant landmark detection method
(cont.)
23
 Introduction
 Landmarks and their categorization
 Landmark detection methods
1. Manual labeling of landmarks
2. Detection of abrupt consonant and abrupt landmarks
3. Stop consonant landmark detection method
Summary and Future work
23
24
Summary
The first method of landmark detection is time
consuming and tedious. Moreover the resolution is
also very poor.
The second method is relatively faster but it also
gives poor temporal resolution.
The third method gives very high temporal resolution
at a faster pace.
24
25
Future Work
To focus on the algorithms for landmark
detection in speech and to improvise them to
implement in the phone-based recognition
system.
26
REFERENCES
[Lui 1996] S. A. Liu, “Landmark detection for distinctive feature based
speech recognition,” J. acoust. Soc. Am., vol. 100, no. 5, pp. 3417-
3430.
[Arjun et al., 2008] A.R.Jayan,P.C.Pandey and ,”Detection of Acoustic
Landmarks with high resolution for Speech Processing” Procc,14th
National conf.communication.
[Alani et al.,1999] A.Alani and M.Deriche, “A novel approach to speech
segmentation using the wavelet transform,” in proc.5th int.stmp.signal
Processing and Applications.(ISSSPA’99),127-129.
[OS 2001] D. O'shaughnesey, Speech Communications: Humans and
Machine, University Press (India).
[L.R., 2008] L. R. Rabiner, R. W. Schafer, Digital Processing of Speech
Signals, Pearson Education Inc. and Dorling Kindersley Publishing Inc., India.

More Related Content

Viewers also liked

Silent sound technology
Silent sound technologySilent sound technology
Silent sound technology
Kishan Vemula
 
Voice morphing
Voice morphingVoice morphing
Voice morphing
sukhbeer2314
 
Computer engineering and it seminar topics
Computer engineering and  it seminar topicsComputer engineering and  it seminar topics
Computer engineering and it seminar topics
Ace Beau
 
Silent sound-technology ppt final
Silent sound-technology ppt finalSilent sound-technology ppt final
Silent sound-technology ppt final
Lohit Dalal
 
Artificial intelligence Speech recognition system
Artificial intelligence Speech recognition systemArtificial intelligence Speech recognition system
Artificial intelligence Speech recognition system
REHMAT ULLAH
 
Solar roadways presented by RK
Solar roadways presented by RKSolar roadways presented by RK
Solar roadways presented by RK
Raam Krishna Prakash Jagali
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
Richie
 

Viewers also liked (7)

Silent sound technology
Silent sound technologySilent sound technology
Silent sound technology
 
Voice morphing
Voice morphingVoice morphing
Voice morphing
 
Computer engineering and it seminar topics
Computer engineering and  it seminar topicsComputer engineering and  it seminar topics
Computer engineering and it seminar topics
 
Silent sound-technology ppt final
Silent sound-technology ppt finalSilent sound-technology ppt final
Silent sound-technology ppt final
 
Artificial intelligence Speech recognition system
Artificial intelligence Speech recognition systemArtificial intelligence Speech recognition system
Artificial intelligence Speech recognition system
 
Solar roadways presented by RK
Solar roadways presented by RKSolar roadways presented by RK
Solar roadways presented by RK
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
 

Similar to Detection of acoustic landmark

Trend removal from raman spectra with local variance estimation and cubic spl...
Trend removal from raman spectra with local variance estimation and cubic spl...Trend removal from raman spectra with local variance estimation and cubic spl...
Trend removal from raman spectra with local variance estimation and cubic spl...
csijjournal
 
TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...
TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...
TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...
csijjournal
 
TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...
TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...
TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...
csijjournal
 
Trend Removal from Raman Spectra With Local Variance Estimation and Cubic Spl...
Trend Removal from Raman Spectra With Local Variance Estimation and Cubic Spl...Trend Removal from Raman Spectra With Local Variance Estimation and Cubic Spl...
Trend Removal from Raman Spectra With Local Variance Estimation and Cubic Spl...
csijjournal
 
Spectrum Sensing Detection with Sequential Forward Search in Comparison to Kn...
Spectrum Sensing Detection with Sequential Forward Search in Comparison to Kn...Spectrum Sensing Detection with Sequential Forward Search in Comparison to Kn...
Spectrum Sensing Detection with Sequential Forward Search in Comparison to Kn...
IJMTST Journal
 
Performance evaluation of different spectrum sensing techniques for realistic...
Performance evaluation of different spectrum sensing techniques for realistic...Performance evaluation of different spectrum sensing techniques for realistic...
Performance evaluation of different spectrum sensing techniques for realistic...
ijwmn
 
In tech recent-advances_in_synthetic_aperture_radar_enhancement_and_informati...
In tech recent-advances_in_synthetic_aperture_radar_enhancement_and_informati...In tech recent-advances_in_synthetic_aperture_radar_enhancement_and_informati...
In tech recent-advances_in_synthetic_aperture_radar_enhancement_and_informati...
Naivedya Mishra
 
Derivative spectroscopy
Derivative spectroscopyDerivative spectroscopy
Derivative spectroscopy
akkimi padama
 
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...
CSCJournals
 
Ac35162165
Ac35162165Ac35162165
Ac35162165
IJERA Editor
 
Ac35162165
Ac35162165Ac35162165
Ac35162165
IJERA Editor
 
Ac35162165
Ac35162165Ac35162165
Ac35162165
IJERA Editor
 
6.5.1 Review of Beat Detection and Phase Alignment Techniques used in DMM
6.5.1 Review of Beat Detection and Phase Alignment Techniques used in DMM6.5.1 Review of Beat Detection and Phase Alignment Techniques used in DMM
6.5.1 Review of Beat Detection and Phase Alignment Techniques used in DMM
Nasir Ahmad
 
Design and implementation of different audio restoration techniques for audio...
Design and implementation of different audio restoration techniques for audio...Design and implementation of different audio restoration techniques for audio...
Design and implementation of different audio restoration techniques for audio...
eSAT Journals
 
Blind, Non-stationary Source Separation Using Variational Mode Decomposition ...
Blind, Non-stationary Source Separation Using Variational Mode Decomposition ...Blind, Non-stationary Source Separation Using Variational Mode Decomposition ...
Blind, Non-stationary Source Separation Using Variational Mode Decomposition ...
CSCJournals
 
Cognitive Radio
Cognitive RadioCognitive Radio
Cognitive Radio
Owaîs Járå
 
A Novel Method for Silence Removal in Sounds Produced by Percussive Instruments
A Novel Method for Silence Removal in Sounds Produced by Percussive InstrumentsA Novel Method for Silence Removal in Sounds Produced by Percussive Instruments
A Novel Method for Silence Removal in Sounds Produced by Percussive Instruments
IJMTST Journal
 
Modeling and Estimation of Stationary and Non-stationary Noises of Rubidium A...
Modeling and Estimation of Stationary and Non-stationary Noises of Rubidium A...Modeling and Estimation of Stationary and Non-stationary Noises of Rubidium A...
Modeling and Estimation of Stationary and Non-stationary Noises of Rubidium A...
IJERA Editor
 
IRJET - Essential Features Extraction from Aaroh and Avroh of Indian Clas...
IRJET -  	  Essential Features Extraction from Aaroh and Avroh of Indian Clas...IRJET -  	  Essential Features Extraction from Aaroh and Avroh of Indian Clas...
IRJET - Essential Features Extraction from Aaroh and Avroh of Indian Clas...
IRJET Journal
 
A New Underwater Acoustic Navigation Method Based on the Doppler Principle
A New Underwater Acoustic Navigation Method Based on the Doppler PrincipleA New Underwater Acoustic Navigation Method Based on the Doppler Principle
A New Underwater Acoustic Navigation Method Based on the Doppler Principle
Nooria Sukmaningtyas
 

Similar to Detection of acoustic landmark (20)

Trend removal from raman spectra with local variance estimation and cubic spl...
Trend removal from raman spectra with local variance estimation and cubic spl...Trend removal from raman spectra with local variance estimation and cubic spl...
Trend removal from raman spectra with local variance estimation and cubic spl...
 
TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...
TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...
TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...
 
TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...
TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...
TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...
 
Trend Removal from Raman Spectra With Local Variance Estimation and Cubic Spl...
Trend Removal from Raman Spectra With Local Variance Estimation and Cubic Spl...Trend Removal from Raman Spectra With Local Variance Estimation and Cubic Spl...
Trend Removal from Raman Spectra With Local Variance Estimation and Cubic Spl...
 
Spectrum Sensing Detection with Sequential Forward Search in Comparison to Kn...
Spectrum Sensing Detection with Sequential Forward Search in Comparison to Kn...Spectrum Sensing Detection with Sequential Forward Search in Comparison to Kn...
Spectrum Sensing Detection with Sequential Forward Search in Comparison to Kn...
 
Performance evaluation of different spectrum sensing techniques for realistic...
Performance evaluation of different spectrum sensing techniques for realistic...Performance evaluation of different spectrum sensing techniques for realistic...
Performance evaluation of different spectrum sensing techniques for realistic...
 
In tech recent-advances_in_synthetic_aperture_radar_enhancement_and_informati...
In tech recent-advances_in_synthetic_aperture_radar_enhancement_and_informati...In tech recent-advances_in_synthetic_aperture_radar_enhancement_and_informati...
In tech recent-advances_in_synthetic_aperture_radar_enhancement_and_informati...
 
Derivative spectroscopy
Derivative spectroscopyDerivative spectroscopy
Derivative spectroscopy
 
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...
 
Ac35162165
Ac35162165Ac35162165
Ac35162165
 
Ac35162165
Ac35162165Ac35162165
Ac35162165
 
Ac35162165
Ac35162165Ac35162165
Ac35162165
 
6.5.1 Review of Beat Detection and Phase Alignment Techniques used in DMM
6.5.1 Review of Beat Detection and Phase Alignment Techniques used in DMM6.5.1 Review of Beat Detection and Phase Alignment Techniques used in DMM
6.5.1 Review of Beat Detection and Phase Alignment Techniques used in DMM
 
Design and implementation of different audio restoration techniques for audio...
Design and implementation of different audio restoration techniques for audio...Design and implementation of different audio restoration techniques for audio...
Design and implementation of different audio restoration techniques for audio...
 
Blind, Non-stationary Source Separation Using Variational Mode Decomposition ...
Blind, Non-stationary Source Separation Using Variational Mode Decomposition ...Blind, Non-stationary Source Separation Using Variational Mode Decomposition ...
Blind, Non-stationary Source Separation Using Variational Mode Decomposition ...
 
Cognitive Radio
Cognitive RadioCognitive Radio
Cognitive Radio
 
A Novel Method for Silence Removal in Sounds Produced by Percussive Instruments
A Novel Method for Silence Removal in Sounds Produced by Percussive InstrumentsA Novel Method for Silence Removal in Sounds Produced by Percussive Instruments
A Novel Method for Silence Removal in Sounds Produced by Percussive Instruments
 
Modeling and Estimation of Stationary and Non-stationary Noises of Rubidium A...
Modeling and Estimation of Stationary and Non-stationary Noises of Rubidium A...Modeling and Estimation of Stationary and Non-stationary Noises of Rubidium A...
Modeling and Estimation of Stationary and Non-stationary Noises of Rubidium A...
 
IRJET - Essential Features Extraction from Aaroh and Avroh of Indian Clas...
IRJET -  	  Essential Features Extraction from Aaroh and Avroh of Indian Clas...IRJET -  	  Essential Features Extraction from Aaroh and Avroh of Indian Clas...
IRJET - Essential Features Extraction from Aaroh and Avroh of Indian Clas...
 
A New Underwater Acoustic Navigation Method Based on the Doppler Principle
A New Underwater Acoustic Navigation Method Based on the Doppler PrincipleA New Underwater Acoustic Navigation Method Based on the Doppler Principle
A New Underwater Acoustic Navigation Method Based on the Doppler Principle
 

Recently uploaded

一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
ecqow
 
Computational Engineering IITH Presentation
Computational Engineering IITH PresentationComputational Engineering IITH Presentation
Computational Engineering IITH Presentation
co23btech11018
 
International Conference on NLP, Artificial Intelligence, Machine Learning an...
International Conference on NLP, Artificial Intelligence, Machine Learning an...International Conference on NLP, Artificial Intelligence, Machine Learning an...
International Conference on NLP, Artificial Intelligence, Machine Learning an...
gerogepatton
 
The Python for beginners. This is an advance computer language.
The Python for beginners. This is an advance computer language.The Python for beginners. This is an advance computer language.
The Python for beginners. This is an advance computer language.
sachin chaurasia
 
AI assisted telemedicine KIOSK for Rural India.pptx
AI assisted telemedicine KIOSK for Rural India.pptxAI assisted telemedicine KIOSK for Rural India.pptx
AI assisted telemedicine KIOSK for Rural India.pptx
architagupta876
 
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by AnantLLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
Anant Corporation
 
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECTCHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
jpsjournal1
 
Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...
IJECEIAES
 
Certificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi AhmedCertificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi Ahmed
Mahmoud Morsy
 
john krisinger-the science and history of the alcoholic beverage.pptx
john krisinger-the science and history of the alcoholic beverage.pptxjohn krisinger-the science and history of the alcoholic beverage.pptx
john krisinger-the science and history of the alcoholic beverage.pptx
Madan Karki
 
Welding Metallurgy Ferrous Materials.pdf
Welding Metallurgy Ferrous Materials.pdfWelding Metallurgy Ferrous Materials.pdf
Welding Metallurgy Ferrous Materials.pdf
AjmalKhan50578
 
Material for memory and display system h
Material for memory and display system hMaterial for memory and display system h
Material for memory and display system h
gowrishankartb2005
 
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student MemberIEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
VICTOR MAESTRE RAMIREZ
 
Introduction to AI Safety (public presentation).pptx
Introduction to AI Safety (public presentation).pptxIntroduction to AI Safety (public presentation).pptx
Introduction to AI Safety (public presentation).pptx
MiscAnnoy1
 
Curve Fitting in Numerical Methods Regression
Curve Fitting in Numerical Methods RegressionCurve Fitting in Numerical Methods Regression
Curve Fitting in Numerical Methods Regression
Nada Hikmah
 
官方认证美国密歇根州立大学毕业证学位证书原版一模一样
官方认证美国密歇根州立大学毕业证学位证书原版一模一样官方认证美国密歇根州立大学毕业证学位证书原版一模一样
官方认证美国密歇根州立大学毕业证学位证书原版一模一样
171ticu
 
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
Gino153088
 
Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...
Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...
Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...
shadow0702a
 
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Sinan KOZAK
 
BRAIN TUMOR DETECTION for seminar ppt.pdf
BRAIN TUMOR DETECTION for seminar ppt.pdfBRAIN TUMOR DETECTION for seminar ppt.pdf
BRAIN TUMOR DETECTION for seminar ppt.pdf
LAXMAREDDY22
 

Recently uploaded (20)

一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
 
Computational Engineering IITH Presentation
Computational Engineering IITH PresentationComputational Engineering IITH Presentation
Computational Engineering IITH Presentation
 
International Conference on NLP, Artificial Intelligence, Machine Learning an...
International Conference on NLP, Artificial Intelligence, Machine Learning an...International Conference on NLP, Artificial Intelligence, Machine Learning an...
International Conference on NLP, Artificial Intelligence, Machine Learning an...
 
The Python for beginners. This is an advance computer language.
The Python for beginners. This is an advance computer language.The Python for beginners. This is an advance computer language.
The Python for beginners. This is an advance computer language.
 
AI assisted telemedicine KIOSK for Rural India.pptx
AI assisted telemedicine KIOSK for Rural India.pptxAI assisted telemedicine KIOSK for Rural India.pptx
AI assisted telemedicine KIOSK for Rural India.pptx
 
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by AnantLLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
 
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECTCHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
 
Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...
 
Certificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi AhmedCertificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi Ahmed
 
john krisinger-the science and history of the alcoholic beverage.pptx
john krisinger-the science and history of the alcoholic beverage.pptxjohn krisinger-the science and history of the alcoholic beverage.pptx
john krisinger-the science and history of the alcoholic beverage.pptx
 
Welding Metallurgy Ferrous Materials.pdf
Welding Metallurgy Ferrous Materials.pdfWelding Metallurgy Ferrous Materials.pdf
Welding Metallurgy Ferrous Materials.pdf
 
Material for memory and display system h
Material for memory and display system hMaterial for memory and display system h
Material for memory and display system h
 
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student MemberIEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
 
Introduction to AI Safety (public presentation).pptx
Introduction to AI Safety (public presentation).pptxIntroduction to AI Safety (public presentation).pptx
Introduction to AI Safety (public presentation).pptx
 
Curve Fitting in Numerical Methods Regression
Curve Fitting in Numerical Methods RegressionCurve Fitting in Numerical Methods Regression
Curve Fitting in Numerical Methods Regression
 
官方认证美国密歇根州立大学毕业证学位证书原版一模一样
官方认证美国密歇根州立大学毕业证学位证书原版一模一样官方认证美国密歇根州立大学毕业证学位证书原版一模一样
官方认证美国密歇根州立大学毕业证学位证书原版一模一样
 
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
 
Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...
Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...
Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...
 
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
 
BRAIN TUMOR DETECTION for seminar ppt.pdf
BRAIN TUMOR DETECTION for seminar ppt.pdfBRAIN TUMOR DETECTION for seminar ppt.pdf
BRAIN TUMOR DETECTION for seminar ppt.pdf
 

Detection of acoustic landmark

  • 1. 1 Detection of Acoustic Landmarks for Speech Processing with High Resolution M.Tech Credit Seminar Pushpa Gothwal (09307054) Supervisor: Prof. P. C. Pandey Electrical Engineering Department November 2009
  • 2. 2  Introduction  Landmarks and their categorization  Landmark detection methods 1. Manual labeling of landmarks 2. Detection of abrupt consonant and abrupt landmarks 3. Stop consonant landmark detection method  Summary and Future work Outline 2
  • 3. 3 Introduction Perception of speech under adverse listening conditions is improved by processing of speech Landmark detection is needed for processing Aim : To study 3 different methods of landmark detection and compare their temporal resolution 3
  • 4. 4  Introduction Landmarks and their categorization  Landmark detection methods 1. Manual labeling of landmarks 2. Detection of abrupt consonant and abrupt landmarks 3. Stop consonant landmark detection method  Summary and Future work 4
  • 5. 5 .  Landmarks is the region where the spectral discontinuity in speech.  They can be categorized as: – Abrupt Consonantal :It is the closure and release of constriction. Example- /able/ – Abrupt: It shows the change in sound due to glottal activity. Example- /paint/ – Nonabrupt: It marks the transition between semivowel to vowel and vice versa. Example-/away/ – Vocalic: It occurs when the vocal cord is extremely open for a vowel. Example-/bat/ What is a Landmark?
  • 6. 6 An illustration of landmarks. AC = abrupt-consonantal, A = abrupt, N = nonabrupt, V = vocalic (Lui 1996)
  • 7. 7  Introduction  Landmarks and their categorization  Landmark detection methods 1. Manual labeling of landmarks 2. Detection of abrupt consonant and abrupt landmarks 3. Stop consonant landmark detection method  Summary and Future work 7
  • 8. 8 Manual labeling of landmarks Spectrogram of /aba/ (Prat)
  • 9. 9  Introduction  Landmarks and their categorization  Landmark detection methods 1. Manual labeling of landmarks 2. Detection of abrupt consonant and abrupt landmarks 3. Stop consonant landmark detection method  Summary and Future work 9
  • 10. 10 Detection of abrupt consonant and abrupt landmarks  It detects two landmarks  Spectrum is divided into 6 bands Band1. 0.0-0.4 Khz 2. 0.8-1.5 3. 1.2-2.0 4. 2.0-3.5 5. 3.5-5.0 6. 5.0-8.0 Band 1-Monitor glottal activity Band 2-5-Monitor Closure and release of sonorant Band 6-Monitor the stop
  • 11. 11Landmark detection algorithm (Lui 1996) Detection of abrupt consonant and abrupt landmarks (cont.)
  • 12. 12 Spectrogram of “the money is coming today". The middle figure shows energy of band 1; and bottom figure shows ROR of band.(Lui,1996) Detection of abrupt consonant and abrupt landmarks (cont.)
  • 13. 13  Introduction  Landmarks and their categorization  Landmark detection methods 1. Manual labeling of landmarks 2. Detection of abrupt consonant and abrupt landmarks 3. Stop consonant landmark detection method  Summary and Future work 13
  • 14. 14 Pass I Step 1 : Spectrum is divided into 5 bands Band Frequency (kHz) 1 0.0-0.4 (Monitor glottal vibration) 2 0.4-1.2 3 1.2-2.0 4 2.0-3.5 5 3.5-5.0 (Consonant closure and release) Stop consonant landmark detection method
  • 15. 15 Short time spectral analysis Computation of energy peaks and centroids Computation of RORs energy and centroid Computation of spectral transition index Landmark localization Wavelet decomposition around landmarks Computation of short time energy and ZCR Computation of energy and ZCR RORs Landmark localization Landmark (Pass 1) Landmark (Pass 2) Pass 1 Pass2 Processing stage for landmark detection (Arjun et al., 2008) speech
  • 16. 16 Step 2 - Computation of energy peaks and centroid in frequency bands where k1 and k2 upper and lower frequency index for band b,n frame. Centroid frequency is k2 k2 fc(b,n)= ∑ k|Xn(k)| 2 / ∑ |Xn(k)|2 fs/N (2) k=k1 k=k1 Ep (b, n) = 10 log10 (max [|X n (k)|] 2 ), k1 ≤ k ≤k2 (1) Stop consonant landmark detection method (cont.)
  • 17. 17 Step 3-Computation of energy and centroid RORs E'p(b,n) = | Ep(b, n+K) − Ep(b,n−K)| (3) f'c(b, n) = | fc(b, n+K) − fc(b,n−K) | (4) Stop consonant landmark detection method (cont.)
  • 18. 18 Step 4-Computation of transition index for energy and centroid frequency 5 Tec(n) = 1/5∑E’pn(p, n)f’cn(b,n) (5) b=1 Stop consonant landmark detection method (cont.)
  • 19. 19 Waveform for /uka/ , ROR for band1(b), band2(c), band3(d) (Arjun et al.,2008) Stop consonant landmark detection method (cont.)
  • 20. 20 Processing results /uka/ of (Arjun et al., 2008) Stop consonant landmark detection method (cont.)
  • 21. 21 (a) Windowed segment used in second pass, (b) energy and ZCR ROR’s of level 1, (c) ROR’s of level 2, and (d) transition index Tez computed from ROR’s in (b) and (c) (Arjun et.al.2008) Stop consonant landmark detection method (cont.)
  • 22. 22 Pass2: Step1-Compute the wavelet decomposition for segmenting the speech Step2-Compute the energy and Zero Crossing Rate (ZCR) Step3-Compute the ROR for energy and ZCR Stop consonant landmark detection method (cont.)
  • 23. 23  Introduction  Landmarks and their categorization  Landmark detection methods 1. Manual labeling of landmarks 2. Detection of abrupt consonant and abrupt landmarks 3. Stop consonant landmark detection method Summary and Future work 23
  • 24. 24 Summary The first method of landmark detection is time consuming and tedious. Moreover the resolution is also very poor. The second method is relatively faster but it also gives poor temporal resolution. The third method gives very high temporal resolution at a faster pace. 24
  • 25. 25 Future Work To focus on the algorithms for landmark detection in speech and to improvise them to implement in the phone-based recognition system.
  • 26. 26 REFERENCES [Lui 1996] S. A. Liu, “Landmark detection for distinctive feature based speech recognition,” J. acoust. Soc. Am., vol. 100, no. 5, pp. 3417- 3430. [Arjun et al., 2008] A.R.Jayan,P.C.Pandey and ,”Detection of Acoustic Landmarks with high resolution for Speech Processing” Procc,14th National conf.communication. [Alani et al.,1999] A.Alani and M.Deriche, “A novel approach to speech segmentation using the wavelet transform,” in proc.5th int.stmp.signal Processing and Applications.(ISSSPA’99),127-129. [OS 2001] D. O'shaughnesey, Speech Communications: Humans and Machine, University Press (India). [L.R., 2008] L. R. Rabiner, R. W. Schafer, Digital Processing of Speech Signals, Pearson Education Inc. and Dorling Kindersley Publishing Inc., India.

Editor's Notes

  1. MODon