SlideShare a Scribd company logo
CSE, Indian Institute of Technology Bombay
Pitch Detection of Singing Voice in Tabla
Accompaniment
Ashutosh Bapat
(03305010)
CSE, Indian Institute of Technology Bombay
Outline
 Motivation
 Music transcription
 Pitch & pitch detection
 Signal characteristics
 Two-way mismatch procedure
 Post processing
 DP based smoothing
 Pitch correction
 Experimental evaluation
 Conclusion
CSE, Indian Institute of Technology Bombay
Automatic Music Transcription (AMT) system
 Converts acoustic musical signal to symbolic
representation
 Documents musical attributes
 Pitch
 Timbre
 Rhythm
 Pitch is the most salient
 Melody = pitch contour
CSE, Indian Institute of Technology Bombay
AMT for Indian classical music
 Components of Indian classical and semi-classical
music
 Melody line sung by a single main voice
o Gamakas: are described by detailed pitch contour
 Accompaniment of tabla and tanpura
 Musicological and pedagogical applications
 Rich archives of audio recordings => need for a
reliable PDA for singing voice pitch tracking in tabla
accompaniment
CSE, Indian Institute of Technology Bombay
Pitch
 Many definitions exist
 Pitch of a signal is defined as the fundamental
frequency of an approximately harmonic pattern in
spectral representation of signal.
 Pitch period is defined as average length of several
periods of the signal.
0
0
1
T
F =
CSE, Indian Institute of Technology Bombay
Pitch detection
• Input: musical signal
• Output: pitch contour
Song waveform Pitch contour
CSE, Indian Institute of Technology Bombay
Pitch Detection Algorithm
 Preprocessor: Data reduction and enhancement
 Commonly used method: Filtering
 Basic Extractor: Estimates single/multiple pitch
candidates per frame
 Commonly used method: ACF
 Post processing:
 Measure of reliability of each candidate
 Smoothness of pitch contour
o Commonly used method: Dynamic programming
CSE, Indian Institute of Technology Bombay
Singing voice
• Pitch evolves
continuously
• Shows inflexions
like bents, stresses,
oscillations etc.
CSE, Indian Institute of Technology Bombay
Classification of tabla strokes
 Number of drums
 Simple strokes: Na, Ge, Ke, Tit, Tun etc.
 Complex strokes: Dha, Dhin, Dhun
 Harmonicity
 Harmonic: Na, Tin, Tun, Ge
 Inharmonic: Ke, Tit
 Rate of Decay:
 Slowly decaying: Na, Ge, Tin, Tun
 Fast decaying: Ke, Tit
CSE, Indian Institute of Technology Bombay
Harmonic interference: Na
CSE, Indian Institute of Technology Bombay
Single partial interference: Ge
CSE, Indian Institute of Technology Bombay
Noisy interference: Ke
CSE, Indian Institute of Technology Bombay
Pitch detection of mixed song
Waveform of song mixed with
stroke Na
Pitch contour by ACF
CSE, Indian Institute of Technology Bombay
Two-way mismatch procedure
TWM error, F0 = 300 Hz
CSE, Indian Institute of Technology Bombay
TWM and ACF: harmonic interference
• Complex tone of 450 Hz + signal
simulating Na
• In TWM error we can see
minimum at correct pitch
• In ACF all peaks are at lags
corresponding to 790 Hz
TWM error ACF
Magnitude plot
CSE, Indian Institute of Technology Bombay
TWM and ACF: single partial interference
• Complex tone of 300 Hz mixed with a single partial with
amplitude varied from 0 to 100.
• TWM is more robust than ACF
400 Hz 450 Hz
CSE, Indian Institute of Technology Bombay
TWM and ACF correlograms
• Correlograms of complex tone of 300 Hz mixed with stroke
Na
• Notice horizontal line at 300 Hz in TWM
• No clue to lag 73 (corresponds to 300 Hz)
ACF correlogram TWM error correlogram
CSE, Indian Institute of Technology Bombay
TWM pitch contour
• Pitch contour of song mixed with stroke Na
• Notice large pitch artifacts during strokes
CSE, Indian Institute of Technology Bombay
Post processing
CSE, Indian Institute of Technology Bombay
DP based smoothing
 Smoothing based on
 Measure of reliability of pitch candidates
 Smoothness of pitch contour
 Measurement cost:
 Smoothness cost:
 Local transition cost:
 Global transition cost:
∑=
−=
N
j
jjpjpTNpjppS
1
)),1(),(())(,),(,),1(( 
))(),1(()),(()),1(),(( jpjpWjjpEjjpjpT −+=−
),( jpE
)',( ppW
CSE, Indian Institute of Technology Bombay
Smoothness cost
• The width of bell varies
proportional to pitch
• Pitch variation at high
pitches is expected to be
more than that at low
pitches
• Saturates at high values
pc
s
pp
ppW e
*
)'(
1)',(
2
2
=
−
−=
−
σ
σ
CSE, Indian Institute of Technology Bombay
Pitch contour after applying DP
• Smoothened pitch contour
• Suppresses fast pitch variations
• May introduce errors where tabla is absent
CSE, Indian Institute of Technology Bombay
Pitch correction
• Searches for deepest local minimum in 6% range near pitch
estimated by DP
• Corrects most of the fine errors
CSE, Indian Institute of Technology Bombay
Experimental evaluation
 Test samples
 Samples produced by digitally adding tabla strokes Na,
Ge, Ke to pure song waveforms sung with syllable /la/
and /aa/
 Algorithms
 TWM:
 TDP: TWM + DP
 TDC: TWM + DP + PC
 Errors
 Fine error: error magnitude between 3% to 6%
 Gross error: error magnitue above 6%
CSE, Indian Institute of Technology Bombay
Results
• DP has decreased number of gross errors increasing number
of fine errors
• PC has decreased number of fine errors
• Better performance in case of songs with slowly varying pitch
contours
TWM TDP TDC
F G F G F G
Na 0.0 49.
3
4.6 13.
4
2.1 14.
8
Ge 0.0 20.
9
3.9 2.1 3.4 2.5
Ke 0.0 25.
7
4.9 5.1 0.2 5.1
TWM TDP TDC
F G F G F G
Na 4.7 14.
7
11.
6
1.6 5.3 2.1
Ge 0.0 22.
5
8.1 4.4 1.5 4.1
Ke 0.0 17.
9
7.2 2.5 0.0 2.5
Song with many fast variations of pitch Song with slowly varying pitch contour
Error rates in percentage
CSE, Indian Institute of Technology Bombay
Errors after application of DP + PC
• Errors remaining after application of DP and pitch correction
are found in regions with fast variations in pitch
CSE, Indian Institute of Technology Bombay
Conclusion
 Importance of music transcription
 Characteristics of tabla strokes
 Two-way mismatch PDA
 Results showing improvements by application of DP
smoothing and pitch correction
 Applications in building pitch detector for Indian
classical and semi classical music
CSE, Indian Institute of Technology Bombay
Future work
 Combination of ACF and TWM to take advantage of
 Lesser computational complexity of ACF
 ACF’s robustness to noise, thus better results in Ke
 Classification of frames by presence/ absence of
tabla strokes
 Use pitch estimated by DP and pitch correction only in
frames containing tabla stroke
 Application of advanced techniques:
 adaptive windowing, peak selection, selective search
 Pitch tracking in case of complex strokes like Dha
and words like TiReKiTa
CSE, Indian Institute of Technology Bombay
Thank you
CSE, Indian Institute of Technology Bombay
State space formulation of DP

More Related Content

What's hot

Speech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderSpeech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using Vocoder
IJTET Journal
 
Linear Predictive Coding
Linear Predictive CodingLinear Predictive Coding
Linear Predictive Coding
Srishti Kakade
 
Digital speech processing lecture1
Digital speech processing lecture1Digital speech processing lecture1
Digital speech processing lecture1Samiul Parag
 
Speech Signal Processing
Speech Signal ProcessingSpeech Signal Processing
Speech Signal Processing
Murtadha Alsabbagh
 
Voice morphing document
Voice morphing documentVoice morphing document
Voice morphing documenthimadrigupta
 
Voice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from LaryngectomyVoice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from Laryngectomy
International Journal of Science and Research (IJSR)
 
Rain Technology
Rain TechnologyRain Technology
Rain Technology
amal390
 
DSP_FOEHU - Lec 13 - Digital Signal Processing Applications I
DSP_FOEHU - Lec 13 - Digital Signal Processing Applications IDSP_FOEHU - Lec 13 - Digital Signal Processing Applications I
DSP_FOEHU - Lec 13 - Digital Signal Processing Applications I
Amr E. Mohamed
 
Voicemorphingppt 110328163403-phpapp01
Voicemorphingppt 110328163403-phpapp01Voicemorphingppt 110328163403-phpapp01
Voicemorphingppt 110328163403-phpapp01Madhu Babu
 

What's hot (11)

Speech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderSpeech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using Vocoder
 
Linear Predictive Coding
Linear Predictive CodingLinear Predictive Coding
Linear Predictive Coding
 
Digital speech processing lecture1
Digital speech processing lecture1Digital speech processing lecture1
Digital speech processing lecture1
 
Speech Signal Processing
Speech Signal ProcessingSpeech Signal Processing
Speech Signal Processing
 
Voice morphing document
Voice morphing documentVoice morphing document
Voice morphing document
 
Speech processing
Speech processingSpeech processing
Speech processing
 
Speech coding std
Speech coding stdSpeech coding std
Speech coding std
 
Voice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from LaryngectomyVoice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from Laryngectomy
 
Rain Technology
Rain TechnologyRain Technology
Rain Technology
 
DSP_FOEHU - Lec 13 - Digital Signal Processing Applications I
DSP_FOEHU - Lec 13 - Digital Signal Processing Applications IDSP_FOEHU - Lec 13 - Digital Signal Processing Applications I
DSP_FOEHU - Lec 13 - Digital Signal Processing Applications I
 
Voicemorphingppt 110328163403-phpapp01
Voicemorphingppt 110328163403-phpapp01Voicemorphingppt 110328163403-phpapp01
Voicemorphingppt 110328163403-phpapp01
 

Similar to Pitch detection in Tabla accompaniment

Amplitude spectrogram prediction from mel-frequency cepstrum coefficients and...
Amplitude spectrogram prediction from mel-frequency cepstrum coefficients and...Amplitude spectrogram prediction from mel-frequency cepstrum coefficients and...
Amplitude spectrogram prediction from mel-frequency cepstrum coefficients and...
Kitamura Laboratory
 
Sounding the Keys: A Comprehensive FFT-Based Approach for Piano Note Detection
Sounding the Keys: A Comprehensive FFT-Based Approach for Piano Note DetectionSounding the Keys: A Comprehensive FFT-Based Approach for Piano Note Detection
Sounding the Keys: A Comprehensive FFT-Based Approach for Piano Note Detection
IRJET Journal
 
Analog Communication fast mode material.pdf
Analog Communication fast mode material.pdfAnalog Communication fast mode material.pdf
Analog Communication fast mode material.pdf
Engineering Funda
 
Transferring Singing Expressions from One Voice to Another for a Given Song
Transferring Singing Expressions from One Voice to Another for a Given SongTransferring Singing Expressions from One Voice to Another for a Given Song
Transferring Singing Expressions from One Voice to Another for a Given Song
NAVER Engineering
 
Chord recognition mac lab presentation
Chord recognition mac lab presentationChord recognition mac lab presentation
Chord recognition mac lab presentation
牧衡 楊
 
Trends in Answer Set-Programming - Focus Musik - Presentation
Trends in Answer Set-Programming - Focus Musik - PresentationTrends in Answer Set-Programming - Focus Musik - Presentation
Trends in Answer Set-Programming - Focus Musik - Presentation
Erhard Dinhobl
 
Sound Engineering, What Does It Involve?.pptx
Sound Engineering, What Does It Involve?.pptxSound Engineering, What Does It Involve?.pptx
Sound Engineering, What Does It Involve?.pptx
Derek896886
 
example based audio editing
example based audio editingexample based audio editing
example based audio editing
Ramin Anushiravani
 
[Tutorial] Computational Approaches to Melodic Analysis of Indian Art Music
[Tutorial] Computational Approaches to Melodic Analysis of Indian Art Music[Tutorial] Computational Approaches to Melodic Analysis of Indian Art Music
[Tutorial] Computational Approaches to Melodic Analysis of Indian Art Music
Sankalp Gulati
 
Audio Morphing for Percussive Sound Generation
Audio Morphing for Percussive Sound GenerationAudio Morphing for Percussive Sound Generation
Audio Morphing for Percussive Sound Generation
a3labdsp
 
NTSC Software Decoding Presentation
NTSC Software Decoding PresentationNTSC Software Decoding Presentation
NTSC Software Decoding Presentation
Prateek Dayal
 
44th AES conference (2011)
44th AES conference (2011)44th AES conference (2011)
44th AES conference (2011)
Alvaro Barbosa
 
It's so quiet. Let's make music.
It's so quiet. Let's make music.It's so quiet. Let's make music.
It's so quiet. Let's make music.
Loren Segal
 
Bz33462466
Bz33462466Bz33462466
Bz33462466
IJERA Editor
 

Similar to Pitch detection in Tabla accompaniment (16)

Amplitude spectrogram prediction from mel-frequency cepstrum coefficients and...
Amplitude spectrogram prediction from mel-frequency cepstrum coefficients and...Amplitude spectrogram prediction from mel-frequency cepstrum coefficients and...
Amplitude spectrogram prediction from mel-frequency cepstrum coefficients and...
 
Sounding the Keys: A Comprehensive FFT-Based Approach for Piano Note Detection
Sounding the Keys: A Comprehensive FFT-Based Approach for Piano Note DetectionSounding the Keys: A Comprehensive FFT-Based Approach for Piano Note Detection
Sounding the Keys: A Comprehensive FFT-Based Approach for Piano Note Detection
 
Analog Communication fast mode material.pdf
Analog Communication fast mode material.pdfAnalog Communication fast mode material.pdf
Analog Communication fast mode material.pdf
 
Transferring Singing Expressions from One Voice to Another for a Given Song
Transferring Singing Expressions from One Voice to Another for a Given SongTransferring Singing Expressions from One Voice to Another for a Given Song
Transferring Singing Expressions from One Voice to Another for a Given Song
 
Chord recognition mac lab presentation
Chord recognition mac lab presentationChord recognition mac lab presentation
Chord recognition mac lab presentation
 
Trends in Answer Set-Programming - Focus Musik - Presentation
Trends in Answer Set-Programming - Focus Musik - PresentationTrends in Answer Set-Programming - Focus Musik - Presentation
Trends in Answer Set-Programming - Focus Musik - Presentation
 
Sound Engineering, What Does It Involve?.pptx
Sound Engineering, What Does It Involve?.pptxSound Engineering, What Does It Involve?.pptx
Sound Engineering, What Does It Involve?.pptx
 
Time Based Effects
Time Based EffectsTime Based Effects
Time Based Effects
 
example based audio editing
example based audio editingexample based audio editing
example based audio editing
 
[Tutorial] Computational Approaches to Melodic Analysis of Indian Art Music
[Tutorial] Computational Approaches to Melodic Analysis of Indian Art Music[Tutorial] Computational Approaches to Melodic Analysis of Indian Art Music
[Tutorial] Computational Approaches to Melodic Analysis of Indian Art Music
 
Audio Morphing for Percussive Sound Generation
Audio Morphing for Percussive Sound GenerationAudio Morphing for Percussive Sound Generation
Audio Morphing for Percussive Sound Generation
 
NTSC Software Decoding Presentation
NTSC Software Decoding PresentationNTSC Software Decoding Presentation
NTSC Software Decoding Presentation
 
44th AES conference (2011)
44th AES conference (2011)44th AES conference (2011)
44th AES conference (2011)
 
It's so quiet. Let's make music.
It's so quiet. Let's make music.It's so quiet. Let's make music.
It's so quiet. Let's make music.
 
Bz33462466
Bz33462466Bz33462466
Bz33462466
 
Bz33462466
Bz33462466Bz33462466
Bz33462466
 

More from Ashutosh Bapat

Query optimization techniques for partitioned tables.
Query optimization techniques for partitioned tables.Query optimization techniques for partitioned tables.
Query optimization techniques for partitioned tables.
Ashutosh Bapat
 
Partition and conquer large data in PostgreSQL 10
Partition and conquer large data in PostgreSQL 10Partition and conquer large data in PostgreSQL 10
Partition and conquer large data in PostgreSQL 10
Ashutosh Bapat
 
Atomicity for transactions involving foreign server in PostgreSQL
Atomicity for transactions involving foreign server in PostgreSQLAtomicity for transactions involving foreign server in PostgreSQL
Atomicity for transactions involving foreign server in PostgreSQL
Ashutosh Bapat
 
Materialized views in PostgreSQL
Materialized views in PostgreSQLMaterialized views in PostgreSQL
Materialized views in PostgreSQL
Ashutosh Bapat
 
Pgxc scalability pg_open2012
Pgxc scalability pg_open2012Pgxc scalability pg_open2012
Pgxc scalability pg_open2012
Ashutosh Bapat
 
Introduction to Postrges-XC
Introduction to Postrges-XCIntroduction to Postrges-XC
Introduction to Postrges-XC
Ashutosh Bapat
 

More from Ashutosh Bapat (6)

Query optimization techniques for partitioned tables.
Query optimization techniques for partitioned tables.Query optimization techniques for partitioned tables.
Query optimization techniques for partitioned tables.
 
Partition and conquer large data in PostgreSQL 10
Partition and conquer large data in PostgreSQL 10Partition and conquer large data in PostgreSQL 10
Partition and conquer large data in PostgreSQL 10
 
Atomicity for transactions involving foreign server in PostgreSQL
Atomicity for transactions involving foreign server in PostgreSQLAtomicity for transactions involving foreign server in PostgreSQL
Atomicity for transactions involving foreign server in PostgreSQL
 
Materialized views in PostgreSQL
Materialized views in PostgreSQLMaterialized views in PostgreSQL
Materialized views in PostgreSQL
 
Pgxc scalability pg_open2012
Pgxc scalability pg_open2012Pgxc scalability pg_open2012
Pgxc scalability pg_open2012
 
Introduction to Postrges-XC
Introduction to Postrges-XCIntroduction to Postrges-XC
Introduction to Postrges-XC
 

Recently uploaded

FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
Welocme to ViralQR, your best QR code generator.
Welocme to ViralQR, your best QR code generator.Welocme to ViralQR, your best QR code generator.
Welocme to ViralQR, your best QR code generator.
ViralQR
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
RinaMondal9
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
nkrafacyberclub
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Nexer Digital
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 

Recently uploaded (20)

FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
Welocme to ViralQR, your best QR code generator.
Welocme to ViralQR, your best QR code generator.Welocme to ViralQR, your best QR code generator.
Welocme to ViralQR, your best QR code generator.
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 

Pitch detection in Tabla accompaniment

  • 1. CSE, Indian Institute of Technology Bombay Pitch Detection of Singing Voice in Tabla Accompaniment Ashutosh Bapat (03305010)
  • 2. CSE, Indian Institute of Technology Bombay Outline  Motivation  Music transcription  Pitch & pitch detection  Signal characteristics  Two-way mismatch procedure  Post processing  DP based smoothing  Pitch correction  Experimental evaluation  Conclusion
  • 3. CSE, Indian Institute of Technology Bombay Automatic Music Transcription (AMT) system  Converts acoustic musical signal to symbolic representation  Documents musical attributes  Pitch  Timbre  Rhythm  Pitch is the most salient  Melody = pitch contour
  • 4. CSE, Indian Institute of Technology Bombay AMT for Indian classical music  Components of Indian classical and semi-classical music  Melody line sung by a single main voice o Gamakas: are described by detailed pitch contour  Accompaniment of tabla and tanpura  Musicological and pedagogical applications  Rich archives of audio recordings => need for a reliable PDA for singing voice pitch tracking in tabla accompaniment
  • 5. CSE, Indian Institute of Technology Bombay Pitch  Many definitions exist  Pitch of a signal is defined as the fundamental frequency of an approximately harmonic pattern in spectral representation of signal.  Pitch period is defined as average length of several periods of the signal. 0 0 1 T F =
  • 6. CSE, Indian Institute of Technology Bombay Pitch detection • Input: musical signal • Output: pitch contour Song waveform Pitch contour
  • 7. CSE, Indian Institute of Technology Bombay Pitch Detection Algorithm  Preprocessor: Data reduction and enhancement  Commonly used method: Filtering  Basic Extractor: Estimates single/multiple pitch candidates per frame  Commonly used method: ACF  Post processing:  Measure of reliability of each candidate  Smoothness of pitch contour o Commonly used method: Dynamic programming
  • 8. CSE, Indian Institute of Technology Bombay Singing voice • Pitch evolves continuously • Shows inflexions like bents, stresses, oscillations etc.
  • 9. CSE, Indian Institute of Technology Bombay Classification of tabla strokes  Number of drums  Simple strokes: Na, Ge, Ke, Tit, Tun etc.  Complex strokes: Dha, Dhin, Dhun  Harmonicity  Harmonic: Na, Tin, Tun, Ge  Inharmonic: Ke, Tit  Rate of Decay:  Slowly decaying: Na, Ge, Tin, Tun  Fast decaying: Ke, Tit
  • 10. CSE, Indian Institute of Technology Bombay Harmonic interference: Na
  • 11. CSE, Indian Institute of Technology Bombay Single partial interference: Ge
  • 12. CSE, Indian Institute of Technology Bombay Noisy interference: Ke
  • 13. CSE, Indian Institute of Technology Bombay Pitch detection of mixed song Waveform of song mixed with stroke Na Pitch contour by ACF
  • 14. CSE, Indian Institute of Technology Bombay Two-way mismatch procedure TWM error, F0 = 300 Hz
  • 15. CSE, Indian Institute of Technology Bombay TWM and ACF: harmonic interference • Complex tone of 450 Hz + signal simulating Na • In TWM error we can see minimum at correct pitch • In ACF all peaks are at lags corresponding to 790 Hz TWM error ACF Magnitude plot
  • 16. CSE, Indian Institute of Technology Bombay TWM and ACF: single partial interference • Complex tone of 300 Hz mixed with a single partial with amplitude varied from 0 to 100. • TWM is more robust than ACF 400 Hz 450 Hz
  • 17. CSE, Indian Institute of Technology Bombay TWM and ACF correlograms • Correlograms of complex tone of 300 Hz mixed with stroke Na • Notice horizontal line at 300 Hz in TWM • No clue to lag 73 (corresponds to 300 Hz) ACF correlogram TWM error correlogram
  • 18. CSE, Indian Institute of Technology Bombay TWM pitch contour • Pitch contour of song mixed with stroke Na • Notice large pitch artifacts during strokes
  • 19. CSE, Indian Institute of Technology Bombay Post processing
  • 20. CSE, Indian Institute of Technology Bombay DP based smoothing  Smoothing based on  Measure of reliability of pitch candidates  Smoothness of pitch contour  Measurement cost:  Smoothness cost:  Local transition cost:  Global transition cost: ∑= −= N j jjpjpTNpjppS 1 )),1(),(())(,),(,),1((  ))(),1(()),(()),1(),(( jpjpWjjpEjjpjpT −+=− ),( jpE )',( ppW
  • 21. CSE, Indian Institute of Technology Bombay Smoothness cost • The width of bell varies proportional to pitch • Pitch variation at high pitches is expected to be more than that at low pitches • Saturates at high values pc s pp ppW e * )'( 1)',( 2 2 = − −= − σ σ
  • 22. CSE, Indian Institute of Technology Bombay Pitch contour after applying DP • Smoothened pitch contour • Suppresses fast pitch variations • May introduce errors where tabla is absent
  • 23. CSE, Indian Institute of Technology Bombay Pitch correction • Searches for deepest local minimum in 6% range near pitch estimated by DP • Corrects most of the fine errors
  • 24. CSE, Indian Institute of Technology Bombay Experimental evaluation  Test samples  Samples produced by digitally adding tabla strokes Na, Ge, Ke to pure song waveforms sung with syllable /la/ and /aa/  Algorithms  TWM:  TDP: TWM + DP  TDC: TWM + DP + PC  Errors  Fine error: error magnitude between 3% to 6%  Gross error: error magnitue above 6%
  • 25. CSE, Indian Institute of Technology Bombay Results • DP has decreased number of gross errors increasing number of fine errors • PC has decreased number of fine errors • Better performance in case of songs with slowly varying pitch contours TWM TDP TDC F G F G F G Na 0.0 49. 3 4.6 13. 4 2.1 14. 8 Ge 0.0 20. 9 3.9 2.1 3.4 2.5 Ke 0.0 25. 7 4.9 5.1 0.2 5.1 TWM TDP TDC F G F G F G Na 4.7 14. 7 11. 6 1.6 5.3 2.1 Ge 0.0 22. 5 8.1 4.4 1.5 4.1 Ke 0.0 17. 9 7.2 2.5 0.0 2.5 Song with many fast variations of pitch Song with slowly varying pitch contour Error rates in percentage
  • 26. CSE, Indian Institute of Technology Bombay Errors after application of DP + PC • Errors remaining after application of DP and pitch correction are found in regions with fast variations in pitch
  • 27. CSE, Indian Institute of Technology Bombay Conclusion  Importance of music transcription  Characteristics of tabla strokes  Two-way mismatch PDA  Results showing improvements by application of DP smoothing and pitch correction  Applications in building pitch detector for Indian classical and semi classical music
  • 28. CSE, Indian Institute of Technology Bombay Future work  Combination of ACF and TWM to take advantage of  Lesser computational complexity of ACF  ACF’s robustness to noise, thus better results in Ke  Classification of frames by presence/ absence of tabla strokes  Use pitch estimated by DP and pitch correction only in frames containing tabla stroke  Application of advanced techniques:  adaptive windowing, peak selection, selective search  Pitch tracking in case of complex strokes like Dha and words like TiReKiTa
  • 29. CSE, Indian Institute of Technology Bombay Thank you
  • 30. CSE, Indian Institute of Technology Bombay State space formulation of DP

Editor's Notes

  1. Be consistent with use of caps!
  2. Pitch is most salient. Melody = pitch contour (evolution of pitch over time)
  3. Add a piece of real song with tabla and tanpura Mention tanpura is not considered after point 1.2 Pedagogical – visualisation and reproduction
  4. Since musical signal is dynamic the signal is divided into small frames and for each frame pitch is estimated according to definition seen No one system solves all problems, so each block is chosen and adapted according to the application
  5. One of the partial is stronger than the others
  6. Explain ACF Just after onset Na Add vertical lines at 450 Hz and 263 Hz as well as lags 28, 49, 56
  7. Why not to discuss now noisy interference
  8. Notes are equispaced on logarithmic scale than on linear scale
  9. Half semitone 2^(1/12), octave is divided into 12 notes Pitch estimates obtained from pure songs by applying TWM were assumed to be correct