SlideShare a Scribd company logo
1 of 19
Intelligent error detection andadvisory systemfor practitioners of
musicusing audio signal processing and computing
Authored And Presented by –
Ojas Bhargave [Dept. of E&Tc, MIT College of Engineering, Pune.]
Paper ID: 040
First International Conference on Advanced Technologies in Intelligent
Control, Environment, Computing and Communication Engineering
(ICATIECE-2019)
INTRODUCTION
 We find plenty of music enthusiasts learning music in our society.
 Every such person needs a basic ‘musical maturity’ which he lacks in his initial
stage.
 MUSICAL MATURITY ???
 So he is practically unable to spot his own mistakes and correct them and the wrong
practice may lead to the permanent habit of it.
 He needs a knowledgeable trainer {A Guru} for that matter.
 Here is the proposed closed loop and complete system
 It assists music enthusiasts and learners in their learning process and aims at
eliminating the need of that trainer.
MOTIVATION
 Indian classical music is really a very deep topic to be expert in.
Naturally -> scarcity of expert trainers.
 The process of learning this art is very analogous to the process of ‘Reinforcement
Learning’ concept in machine learning.
 A REWARD BASED system which needs the CONSTANT SUPERVISION and inputs to
accelerate the growth which needs a Guru.
 There can also be some other physical limitations which leads to the lack of the proper
guidance.
 None of these reasons should be significant in the learning process of a learner was the
main motive behind this system.
SWAR (A NOTE)
 12 Swaras in one octave.
 Every Swar has its own frequency.
 This formula to find frequencies in an octave:
Freq. of nth Swar = (Freq. of ‘Sa’) * 2^(n/12) …n = 1, 2, 3....11
 Same Swar from different octaves feel similar though they are not at same
frequency.
 If plotted, musical frequencies(Notes) show exponential behavior.
Scale (Patti)
 A reference Note.
 C# = 277.183Hz, A = 440 Hz …But Sa = ? , Ma = ?
 Notations in Indian Classical Music: Kaali 4, Safed 2
Western Music: C#(C sharp), Bb(B flat)
 Scale is decided before the session starts.
 Scale = base note (‘Sa’ in Indian classical music),
And other frequencies are calculated according to that scale.
 When multiple instruments are being used, it would be a chaos if they are not
tuned and synced with each other. For this tuning, a predetermined scale is
needed.
Raag
 A well designed set of multiple swaras(notes)
 Every Raag has its own rules defined in the theory of Indian Classical music.
 A learner has to perform inside the boundaries of those rules and regulations of
the Raag he is practicing.
 For Eg. Malkauns:
Aaroha : सा, ग॒ म, ध॒ न ॒ साां
Avaroha : साां न ॒ ध॒ म ग॒ म ग॒ सा
Thaat : Bhairavi
Vaadi – Samvadi : म (M ) - सा (S)
FLOW OF THE CONCEPT:
USER INPUTS
 Manual inputs: 1. Scale
2. Raag (to be practiced)
 These two inputs are directly accepted from the Raag Authentication block.
 These inputs are used to calculate the allowed frequencies and invalid frequencies.
 And the main Audio Input:
It is the live practice of the learner taken in the system using microphone.
DATA SAMPLING AND A/D CONVERSION
 Microphone converts the sound vibrations in to the analog electrical data.
 Analog data: Difficult and non-practical to store and process.
 Hence sampling is needed.
 Then in order to process the data, it is converted into Digital format.
 Digital data : Easier for processing and further computation.
DATA CONDITIONING
 Main issue: Noise removal and Smoothening.
 A raw audio signal contains a lot of unwanted frequencies.
 Technique used: Gaussian noise removal
 Removal of the extremes in the data.
which can hamper the results of the next block ‘Pitch Extraction’.
 Smoothening using Savitzky-Golay smoothing filter.
 Then we break this signal into VERY SMALL MULTIPLE DATA FRAMES OR UNITS.
PITCH EXTRACTION
 A backbone of a system
 We ultimately need the frequencies in that audio signal to judge the practicing
individual.
 So we find the pitch(frequencies) of each data unit received from the previous block.
 Technique used: Autocorrelation
 Used frequently in signal processing for finding repeating patterns in the signal.
 Briefly, Multiplication of the signal with itself in the frequency domain.
 The algorithm is applied over each data unit and the fetched frequencies are stored in
an array for further computation.
 However, the problem faced in this was the multiple numeric frequency outputs and
that can confuse the next block.
RAAG AUTHENTICATION
 This block has all the rules and regulations coded according to the theory of Indian
Classical Music.
 The pitch array received from pitch detection block is fed to this block.
 As mentioned, It has the manual inputs: Scale and Raag
 Using this data and the coded rules, this block calculates the valid and invalid
frequencies which can be played.
 After receiving the frequencies played by the learner,
This block Authenticates it according to the raag being played and the scale being used.
 Here, we also need to spot and neglect the unwanted frequencies detected in the
previous block.
ERROR ANALYZER
 The user can commit error in 2 ways: 1. A wrong note played altogether
2. A wrong note played according to the Raag
 This block firstly checks for the correctness of the note played using the input from the
previous block.
 Then, it is checked with respect to the Raag.
 Every frequency in the array is checked for the correctness.
 Another responsibility of this block is to draw predictions and conclusions for the user.
 Our output screen shows the errors, number of errors and the point where he has
committed it with some handy predictions. [Hence closed loop]
RESULTS NOTED WHILE TESTING
FUTURE SCOPE
 A handy app can be made to make it accessible for everyone.
 Current scope does not support highly complex music and exceptions which comes
when the senior musician is performing.
 Integration of AI and ML : Can learn these complex patterns which will extend the
scope of the system to the higher level.
 Currently, the system supports just the Indian Classical Music.
 Can be extended for other forms of music. {Generic Algorithm}
 Can be extended towards a smart auto tuner. [Extensively used in the music Industry]
CONCLUSION
 With the repetitive use of this smart and interactive method, the
beginner will be able to understand the patterns of his own errors and
will be able to correct them in a smart way.
 The rate and speed of learning will be very high.
THANK YOU…
- Ojas Bhargave
Email id: ojasbhargave@gmail.com

More Related Content

Similar to Intelligent error detection and advisory system for practitioners of music using audio signal processing and computing

EENG 1920 Final Report
EENG 1920 Final ReportEENG 1920 Final Report
EENG 1920 Final Report
Ore Afolayan
 
final.doc
final.docfinal.doc
final.doc
butest
 
final.doc
final.docfinal.doc
final.doc
butest
 

Similar to Intelligent error detection and advisory system for practitioners of music using audio signal processing and computing (20)

Sounding the Keys: A Comprehensive FFT-Based Approach for Piano Note Detection
Sounding the Keys: A Comprehensive FFT-Based Approach for Piano Note DetectionSounding the Keys: A Comprehensive FFT-Based Approach for Piano Note Detection
Sounding the Keys: A Comprehensive FFT-Based Approach for Piano Note Detection
 
Audio Classification using Artificial Neural Network with Denoising Algorithm...
Audio Classification using Artificial Neural Network with Denoising Algorithm...Audio Classification using Artificial Neural Network with Denoising Algorithm...
Audio Classification using Artificial Neural Network with Denoising Algorithm...
 
IRJET - EMO-MUSIC(Emotion based Music Player)
IRJET - EMO-MUSIC(Emotion based Music Player)IRJET - EMO-MUSIC(Emotion based Music Player)
IRJET - EMO-MUSIC(Emotion based Music Player)
 
IRJET- Machine Learning and Noise Reduction Techniques for Music Genre Classi...
IRJET- Machine Learning and Noise Reduction Techniques for Music Genre Classi...IRJET- Machine Learning and Noise Reduction Techniques for Music Genre Classi...
IRJET- Machine Learning and Noise Reduction Techniques for Music Genre Classi...
 
Extraction and Conversion of Vocals
Extraction and Conversion of VocalsExtraction and Conversion of Vocals
Extraction and Conversion of Vocals
 
IRJET - Music Generation using Deep Learning
IRJET -  	  Music Generation using Deep LearningIRJET -  	  Music Generation using Deep Learning
IRJET - Music Generation using Deep Learning
 
IRJET- A Survey on Sound Recognition
IRJET- A Survey on Sound RecognitionIRJET- A Survey on Sound Recognition
IRJET- A Survey on Sound Recognition
 
A Review on Speech Enhancement System using FIR Filter
A Review on Speech Enhancement System using FIR FilterA Review on Speech Enhancement System using FIR Filter
A Review on Speech Enhancement System using FIR Filter
 
IRJET- Musical Instrument Recognition using CNN and SVM
IRJET-  	  Musical Instrument Recognition using CNN and SVMIRJET-  	  Musical Instrument Recognition using CNN and SVM
IRJET- Musical Instrument Recognition using CNN and SVM
 
IRJET- The Complete Music Player
IRJET- The Complete Music PlayerIRJET- The Complete Music Player
IRJET- The Complete Music Player
 
Optimized audio classification and segmentation algorithm by using ensemble m...
Optimized audio classification and segmentation algorithm by using ensemble m...Optimized audio classification and segmentation algorithm by using ensemble m...
Optimized audio classification and segmentation algorithm by using ensemble m...
 
EENG 1920 Final Report
EENG 1920 Final ReportEENG 1920 Final Report
EENG 1920 Final Report
 
IRJET- Voice based Gender Recognition
IRJET- Voice based Gender RecognitionIRJET- Voice based Gender Recognition
IRJET- Voice based Gender Recognition
 
Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...
 
Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...
 
Application of Recurrent Neural Networks paired with LSTM - Music Generation
Application of Recurrent Neural Networks paired with LSTM - Music GenerationApplication of Recurrent Neural Networks paired with LSTM - Music Generation
Application of Recurrent Neural Networks paired with LSTM - Music Generation
 
Nithin Xavier research_proposal
Nithin Xavier research_proposalNithin Xavier research_proposal
Nithin Xavier research_proposal
 
Tonic Identification System for Hindustani and Carnatic Music
Tonic Identification System for Hindustani and Carnatic MusicTonic Identification System for Hindustani and Carnatic Music
Tonic Identification System for Hindustani and Carnatic Music
 
final.doc
final.docfinal.doc
final.doc
 
final.doc
final.docfinal.doc
final.doc
 

Recently uploaded

Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...
Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...
Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...
drjose256
 
electrical installation and maintenance.
electrical installation and maintenance.electrical installation and maintenance.
electrical installation and maintenance.
benjamincojr
 

Recently uploaded (20)

Circuit Breakers for Engineering Students
Circuit Breakers for Engineering StudentsCircuit Breakers for Engineering Students
Circuit Breakers for Engineering Students
 
Dynamo Scripts for Task IDs and Space Naming.pptx
Dynamo Scripts for Task IDs and Space Naming.pptxDynamo Scripts for Task IDs and Space Naming.pptx
Dynamo Scripts for Task IDs and Space Naming.pptx
 
Instruct Nirmaana 24-Smart and Lean Construction Through Technology.pdf
Instruct Nirmaana 24-Smart and Lean Construction Through Technology.pdfInstruct Nirmaana 24-Smart and Lean Construction Through Technology.pdf
Instruct Nirmaana 24-Smart and Lean Construction Through Technology.pdf
 
Insurance management system project report.pdf
Insurance management system project report.pdfInsurance management system project report.pdf
Insurance management system project report.pdf
 
Artificial Intelligence in due diligence
Artificial Intelligence in due diligenceArtificial Intelligence in due diligence
Artificial Intelligence in due diligence
 
Theory of Time 2024 (Universal Theory for Everything)
Theory of Time 2024 (Universal Theory for Everything)Theory of Time 2024 (Universal Theory for Everything)
Theory of Time 2024 (Universal Theory for Everything)
 
Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...
Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...
Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...
 
Fuzzy logic method-based stress detector with blood pressure and body tempera...
Fuzzy logic method-based stress detector with blood pressure and body tempera...Fuzzy logic method-based stress detector with blood pressure and body tempera...
Fuzzy logic method-based stress detector with blood pressure and body tempera...
 
8th International Conference on Soft Computing, Mathematics and Control (SMC ...
8th International Conference on Soft Computing, Mathematics and Control (SMC ...8th International Conference on Soft Computing, Mathematics and Control (SMC ...
8th International Conference on Soft Computing, Mathematics and Control (SMC ...
 
Passive Air Cooling System and Solar Water Heater.ppt
Passive Air Cooling System and Solar Water Heater.pptPassive Air Cooling System and Solar Water Heater.ppt
Passive Air Cooling System and Solar Water Heater.ppt
 
The Entity-Relationship Model(ER Diagram).pptx
The Entity-Relationship Model(ER Diagram).pptxThe Entity-Relationship Model(ER Diagram).pptx
The Entity-Relationship Model(ER Diagram).pptx
 
Worksharing and 3D Modeling with Revit.pptx
Worksharing and 3D Modeling with Revit.pptxWorksharing and 3D Modeling with Revit.pptx
Worksharing and 3D Modeling with Revit.pptx
 
NEWLETTER FRANCE HELICES/ SDS SURFACE DRIVES - MAY 2024
NEWLETTER FRANCE HELICES/ SDS SURFACE DRIVES - MAY 2024NEWLETTER FRANCE HELICES/ SDS SURFACE DRIVES - MAY 2024
NEWLETTER FRANCE HELICES/ SDS SURFACE DRIVES - MAY 2024
 
21scheme vtu syllabus of visveraya technological university
21scheme vtu syllabus of visveraya technological university21scheme vtu syllabus of visveraya technological university
21scheme vtu syllabus of visveraya technological university
 
Working Principle of Echo Sounder and Doppler Effect.pdf
Working Principle of Echo Sounder and Doppler Effect.pdfWorking Principle of Echo Sounder and Doppler Effect.pdf
Working Principle of Echo Sounder and Doppler Effect.pdf
 
Augmented Reality (AR) with Augin Software.pptx
Augmented Reality (AR) with Augin Software.pptxAugmented Reality (AR) with Augin Software.pptx
Augmented Reality (AR) with Augin Software.pptx
 
Seismic Hazard Assessment Software in Python by Prof. Dr. Costas Sachpazis
Seismic Hazard Assessment Software in Python by Prof. Dr. Costas SachpazisSeismic Hazard Assessment Software in Python by Prof. Dr. Costas Sachpazis
Seismic Hazard Assessment Software in Python by Prof. Dr. Costas Sachpazis
 
electrical installation and maintenance.
electrical installation and maintenance.electrical installation and maintenance.
electrical installation and maintenance.
 
Basics of Relay for Engineering Students
Basics of Relay for Engineering StudentsBasics of Relay for Engineering Students
Basics of Relay for Engineering Students
 
Intro to Design (for Engineers) at Sydney Uni
Intro to Design (for Engineers) at Sydney UniIntro to Design (for Engineers) at Sydney Uni
Intro to Design (for Engineers) at Sydney Uni
 

Intelligent error detection and advisory system for practitioners of music using audio signal processing and computing

  • 1. Intelligent error detection andadvisory systemfor practitioners of musicusing audio signal processing and computing Authored And Presented by – Ojas Bhargave [Dept. of E&Tc, MIT College of Engineering, Pune.] Paper ID: 040 First International Conference on Advanced Technologies in Intelligent Control, Environment, Computing and Communication Engineering (ICATIECE-2019)
  • 2. INTRODUCTION  We find plenty of music enthusiasts learning music in our society.  Every such person needs a basic ‘musical maturity’ which he lacks in his initial stage.  MUSICAL MATURITY ???  So he is practically unable to spot his own mistakes and correct them and the wrong practice may lead to the permanent habit of it.  He needs a knowledgeable trainer {A Guru} for that matter.  Here is the proposed closed loop and complete system  It assists music enthusiasts and learners in their learning process and aims at eliminating the need of that trainer.
  • 3. MOTIVATION  Indian classical music is really a very deep topic to be expert in. Naturally -> scarcity of expert trainers.  The process of learning this art is very analogous to the process of ‘Reinforcement Learning’ concept in machine learning.  A REWARD BASED system which needs the CONSTANT SUPERVISION and inputs to accelerate the growth which needs a Guru.  There can also be some other physical limitations which leads to the lack of the proper guidance.  None of these reasons should be significant in the learning process of a learner was the main motive behind this system.
  • 4. SWAR (A NOTE)  12 Swaras in one octave.  Every Swar has its own frequency.  This formula to find frequencies in an octave: Freq. of nth Swar = (Freq. of ‘Sa’) * 2^(n/12) …n = 1, 2, 3....11  Same Swar from different octaves feel similar though they are not at same frequency.  If plotted, musical frequencies(Notes) show exponential behavior.
  • 5.
  • 6. Scale (Patti)  A reference Note.  C# = 277.183Hz, A = 440 Hz …But Sa = ? , Ma = ?  Notations in Indian Classical Music: Kaali 4, Safed 2 Western Music: C#(C sharp), Bb(B flat)  Scale is decided before the session starts.  Scale = base note (‘Sa’ in Indian classical music), And other frequencies are calculated according to that scale.  When multiple instruments are being used, it would be a chaos if they are not tuned and synced with each other. For this tuning, a predetermined scale is needed.
  • 7. Raag  A well designed set of multiple swaras(notes)  Every Raag has its own rules defined in the theory of Indian Classical music.  A learner has to perform inside the boundaries of those rules and regulations of the Raag he is practicing.  For Eg. Malkauns: Aaroha : सा, ग॒ म, ध॒ न ॒ साां Avaroha : साां न ॒ ध॒ म ग॒ म ग॒ सा Thaat : Bhairavi Vaadi – Samvadi : म (M ) - सा (S)
  • 8. FLOW OF THE CONCEPT:
  • 9. USER INPUTS  Manual inputs: 1. Scale 2. Raag (to be practiced)  These two inputs are directly accepted from the Raag Authentication block.  These inputs are used to calculate the allowed frequencies and invalid frequencies.  And the main Audio Input: It is the live practice of the learner taken in the system using microphone.
  • 10. DATA SAMPLING AND A/D CONVERSION  Microphone converts the sound vibrations in to the analog electrical data.  Analog data: Difficult and non-practical to store and process.  Hence sampling is needed.  Then in order to process the data, it is converted into Digital format.  Digital data : Easier for processing and further computation.
  • 11. DATA CONDITIONING  Main issue: Noise removal and Smoothening.  A raw audio signal contains a lot of unwanted frequencies.  Technique used: Gaussian noise removal  Removal of the extremes in the data. which can hamper the results of the next block ‘Pitch Extraction’.  Smoothening using Savitzky-Golay smoothing filter.  Then we break this signal into VERY SMALL MULTIPLE DATA FRAMES OR UNITS.
  • 12. PITCH EXTRACTION  A backbone of a system  We ultimately need the frequencies in that audio signal to judge the practicing individual.  So we find the pitch(frequencies) of each data unit received from the previous block.  Technique used: Autocorrelation  Used frequently in signal processing for finding repeating patterns in the signal.  Briefly, Multiplication of the signal with itself in the frequency domain.  The algorithm is applied over each data unit and the fetched frequencies are stored in an array for further computation.  However, the problem faced in this was the multiple numeric frequency outputs and that can confuse the next block.
  • 13. RAAG AUTHENTICATION  This block has all the rules and regulations coded according to the theory of Indian Classical Music.  The pitch array received from pitch detection block is fed to this block.  As mentioned, It has the manual inputs: Scale and Raag  Using this data and the coded rules, this block calculates the valid and invalid frequencies which can be played.  After receiving the frequencies played by the learner, This block Authenticates it according to the raag being played and the scale being used.  Here, we also need to spot and neglect the unwanted frequencies detected in the previous block.
  • 14. ERROR ANALYZER  The user can commit error in 2 ways: 1. A wrong note played altogether 2. A wrong note played according to the Raag  This block firstly checks for the correctness of the note played using the input from the previous block.  Then, it is checked with respect to the Raag.  Every frequency in the array is checked for the correctness.  Another responsibility of this block is to draw predictions and conclusions for the user.  Our output screen shows the errors, number of errors and the point where he has committed it with some handy predictions. [Hence closed loop]
  • 15.
  • 17. FUTURE SCOPE  A handy app can be made to make it accessible for everyone.  Current scope does not support highly complex music and exceptions which comes when the senior musician is performing.  Integration of AI and ML : Can learn these complex patterns which will extend the scope of the system to the higher level.  Currently, the system supports just the Indian Classical Music.  Can be extended for other forms of music. {Generic Algorithm}  Can be extended towards a smart auto tuner. [Extensively used in the music Industry]
  • 18. CONCLUSION  With the repetitive use of this smart and interactive method, the beginner will be able to understand the patterns of his own errors and will be able to correct them in a smart way.  The rate and speed of learning will be very high.
  • 19. THANK YOU… - Ojas Bhargave Email id: ojasbhargave@gmail.com