SlideShare a Scribd company logo
1
Evaluation of log spectrogram and Mel spectrogram
in acoustic
analysis
Program: MCS
Section: 4rd
Student Name: Tahlil Younous
Registration No: COSC201301015
Supervisor: Sir Shahab Alam Hashmi
Department of Computer Science,
Khawaja Fareed UEIT,
Rahim Yar Khan.
2
Agenda
• Introduction
• Problem statement
• Functional requirements
• Domain/tools
• Feature Extraction
• Algorithm
• Project Scope
• Datasets
• Existing System
• Methodology Diagram
3
Introduction
• Machine learning and AI is being used extensively in domain
of acoustics analysis.
• E.g. in speech recognition, voice over, industrial anomaly
detection, music generation etc.
4
Problem Statement
For real-time acoustic analysis in deep learning, we use
spectrograms for feature extraction, these spectrograms could
be of different types e.g. Mel spectrograms and log
spectrograms. The aim of this study is to find which type of
spectrogram is better then other.
5
Functional Requirements
• Evaluation of performance and accuracy log
spectrogram and Mel spectrogram in acoustic
analysis using deep learning.
• Convolutional models like CNN 1D, CNN 2D and
LSTM will be used.
• Feature extraction will be done on real-time during
execution/model training.
6
Tools/Domain
• Machine learning ,deep learning .
• Python library /librosa kapre.
7
FEATURE EXTRACTION
• I used spectrograms for feature extraction.
• Mel Spectrogram
• Log Spectrogram
8
Algorithms
• Following deep learning models will be
utilized in the study.
• Convolutional neural network
1.CNN 1D
2.CNN 2D
.Recurrent neural network
.LSTM
9
Project Scope
• Scope of this project is in automated and intelligent system that
take decision on the basis of sounds /acoustics.
10
Datasets
• Cats and dogs sound
• Emergency vehicles siren sounds
• Music
• Urban 8k
11
Existing system
In 1937, Stevens, Volkmann, and Newman proposed a unit of
pitch such that equal distances in pitch sounded equally
distant to the listener. This is called the Mel scale. We perform
a mathematical operation on frequencies to convert them to
the Mel scale.
https://medium.com/analytics-vidhya/understanding-the-
mel-spectrogram-fca2afa2ce53.
https://towardsdatascience.com/getting-to-know-the-mel-
spectrogram-31bca3e2d9d0
12
Methodology Diagram
13

More Related Content

Similar to Tahlil fyp1.pptx

cznamjwyr36wfmgtmdzc-signature-a1b6820cbe628a2a167a0a81f2762fc8f340dd4b93d47a...
cznamjwyr36wfmgtmdzc-signature-a1b6820cbe628a2a167a0a81f2762fc8f340dd4b93d47a...cznamjwyr36wfmgtmdzc-signature-a1b6820cbe628a2a167a0a81f2762fc8f340dd4b93d47a...
cznamjwyr36wfmgtmdzc-signature-a1b6820cbe628a2a167a0a81f2762fc8f340dd4b93d47a...
Mathavan N
 
Unit 4 cognitive radio architecture
Unit 4   cognitive radio architectureUnit 4   cognitive radio architecture
Unit 4 cognitive radio architecture
JAIGANESH SEKAR
 
Unit1: Introduction to AI
Unit1: Introduction to AIUnit1: Introduction to AI
Unit1: Introduction to AI
Tekendra Nath Yogi
 
Shyam presentation prefinal
Shyam presentation prefinalShyam presentation prefinal
Shyam presentation prefinal
Shyam Raj
 
SPECS presentation
SPECS presentationSPECS presentation
SPECS presentation
Christian Zamudio
 
machinelearningedgedetectionfaultdetection.pptx
machinelearningedgedetectionfaultdetection.pptxmachinelearningedgedetectionfaultdetection.pptx
machinelearningedgedetectionfaultdetection.pptx
lalithavaddadi
 
Thesis review Presentation
Thesis review PresentationThesis review Presentation
Thesis review Presentation
Andrew Harvey
 
Artificial Intelligence for Automated Software Testing
Artificial Intelligence for Automated Software TestingArtificial Intelligence for Automated Software Testing
Artificial Intelligence for Automated Software Testing
Lionel Briand
 
SEG3101-ch2-3 - ElicitationTechniques.pdf
SEG3101-ch2-3 - ElicitationTechniques.pdfSEG3101-ch2-3 - ElicitationTechniques.pdf
SEG3101-ch2-3 - ElicitationTechniques.pdf
AdvantumConsulting
 
Dip
DipDip
A Study on the Video Scene Retrieving System
A Study on the Video Scene Retrieving SystemA Study on the Video Scene Retrieving System
A Study on the Video Scene Retrieving System
Yoshika Osawa
 
Speaker Identification
Speaker IdentificationSpeaker Identification
Speaker Identification
sipij
 
Av 738 - Adaptive Filtering Lecture 1 - Introduction
Av 738 - Adaptive Filtering Lecture 1 - IntroductionAv 738 - Adaptive Filtering Lecture 1 - Introduction
Av 738 - Adaptive Filtering Lecture 1 - Introduction
Dr. Bilal Siddiqui, C.Eng., MIMechE, FRAeS
 
Kick-Off Meeting - WP1
Kick-Off Meeting - WP1Kick-Off Meeting - WP1
Kick-Off Meeting - WP1
SLOPE Project
 
Deep learning nowpublishing-vol7-sig-039
Deep learning nowpublishing-vol7-sig-039Deep learning nowpublishing-vol7-sig-039
Deep learning nowpublishing-vol7-sig-039
Hari Om Atul
 
Expert system 21 sldes
Expert system 21 sldesExpert system 21 sldes
Expert system 21 sldes
Yasir Khan
 
A Survey of Image Classification with Deep Learning in the Presence of Noisy ...
A Survey of Image Classification with Deep Learning in the Presence of Noisy ...A Survey of Image Classification with Deep Learning in the Presence of Noisy ...
A Survey of Image Classification with Deep Learning in the Presence of Noisy ...
MonicaDommaraju
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)
inventionjournals
 
B.Tech Project Report
B.Tech Project ReportB.Tech Project Report
B.Tech Project Report
Rohit Singh
 
Utterance based speaker identification
Utterance based speaker identificationUtterance based speaker identification
Utterance based speaker identification
IJCSEA Journal
 

Similar to Tahlil fyp1.pptx (20)

cznamjwyr36wfmgtmdzc-signature-a1b6820cbe628a2a167a0a81f2762fc8f340dd4b93d47a...
cznamjwyr36wfmgtmdzc-signature-a1b6820cbe628a2a167a0a81f2762fc8f340dd4b93d47a...cznamjwyr36wfmgtmdzc-signature-a1b6820cbe628a2a167a0a81f2762fc8f340dd4b93d47a...
cznamjwyr36wfmgtmdzc-signature-a1b6820cbe628a2a167a0a81f2762fc8f340dd4b93d47a...
 
Unit 4 cognitive radio architecture
Unit 4   cognitive radio architectureUnit 4   cognitive radio architecture
Unit 4 cognitive radio architecture
 
Unit1: Introduction to AI
Unit1: Introduction to AIUnit1: Introduction to AI
Unit1: Introduction to AI
 
Shyam presentation prefinal
Shyam presentation prefinalShyam presentation prefinal
Shyam presentation prefinal
 
SPECS presentation
SPECS presentationSPECS presentation
SPECS presentation
 
machinelearningedgedetectionfaultdetection.pptx
machinelearningedgedetectionfaultdetection.pptxmachinelearningedgedetectionfaultdetection.pptx
machinelearningedgedetectionfaultdetection.pptx
 
Thesis review Presentation
Thesis review PresentationThesis review Presentation
Thesis review Presentation
 
Artificial Intelligence for Automated Software Testing
Artificial Intelligence for Automated Software TestingArtificial Intelligence for Automated Software Testing
Artificial Intelligence for Automated Software Testing
 
SEG3101-ch2-3 - ElicitationTechniques.pdf
SEG3101-ch2-3 - ElicitationTechniques.pdfSEG3101-ch2-3 - ElicitationTechniques.pdf
SEG3101-ch2-3 - ElicitationTechniques.pdf
 
Dip
DipDip
Dip
 
A Study on the Video Scene Retrieving System
A Study on the Video Scene Retrieving SystemA Study on the Video Scene Retrieving System
A Study on the Video Scene Retrieving System
 
Speaker Identification
Speaker IdentificationSpeaker Identification
Speaker Identification
 
Av 738 - Adaptive Filtering Lecture 1 - Introduction
Av 738 - Adaptive Filtering Lecture 1 - IntroductionAv 738 - Adaptive Filtering Lecture 1 - Introduction
Av 738 - Adaptive Filtering Lecture 1 - Introduction
 
Kick-Off Meeting - WP1
Kick-Off Meeting - WP1Kick-Off Meeting - WP1
Kick-Off Meeting - WP1
 
Deep learning nowpublishing-vol7-sig-039
Deep learning nowpublishing-vol7-sig-039Deep learning nowpublishing-vol7-sig-039
Deep learning nowpublishing-vol7-sig-039
 
Expert system 21 sldes
Expert system 21 sldesExpert system 21 sldes
Expert system 21 sldes
 
A Survey of Image Classification with Deep Learning in the Presence of Noisy ...
A Survey of Image Classification with Deep Learning in the Presence of Noisy ...A Survey of Image Classification with Deep Learning in the Presence of Noisy ...
A Survey of Image Classification with Deep Learning in the Presence of Noisy ...
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)
 
B.Tech Project Report
B.Tech Project ReportB.Tech Project Report
B.Tech Project Report
 
Utterance based speaker identification
Utterance based speaker identificationUtterance based speaker identification
Utterance based speaker identification
 

Recently uploaded

How to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP ModuleHow to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP Module
Celine George
 
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
GeorgeMilliken2
 
South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)
Academy of Science of South Africa
 
How to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRMHow to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRM
Celine George
 
PIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf IslamabadPIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf Islamabad
AyyanKhan40
 
How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17
Celine George
 
writing about opinions about Australia the movie
writing about opinions about Australia the moviewriting about opinions about Australia the movie
writing about opinions about Australia the movie
Nicholas Montgomery
 
Digital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental DesignDigital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental Design
amberjdewit93
 
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective UpskillingYour Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Excellence Foundation for South Sudan
 
The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
History of Stoke Newington
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
TechSoup
 
Main Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docxMain Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docx
adhitya5119
 
Hindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdfHindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdf
Dr. Mulla Adam Ali
 
The Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collectionThe Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collection
Israel Genealogy Research Association
 
PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.
Dr. Shivangi Singh Parihar
 
The basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptxThe basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptx
heathfieldcps1
 
clinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdfclinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdf
Priyankaranawat4
 
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
RitikBhardwaj56
 
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
Priyankaranawat4
 
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
IreneSebastianRueco1
 

Recently uploaded (20)

How to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP ModuleHow to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP Module
 
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
 
South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)
 
How to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRMHow to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRM
 
PIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf IslamabadPIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf Islamabad
 
How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17
 
writing about opinions about Australia the movie
writing about opinions about Australia the moviewriting about opinions about Australia the movie
writing about opinions about Australia the movie
 
Digital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental DesignDigital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental Design
 
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective UpskillingYour Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective Upskilling
 
The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
 
Main Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docxMain Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docx
 
Hindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdfHindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdf
 
The Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collectionThe Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collection
 
PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.
 
The basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptxThe basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptx
 
clinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdfclinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdf
 
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
 
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
 
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
 

Tahlil fyp1.pptx

  • 1. 1
  • 2. Evaluation of log spectrogram and Mel spectrogram in acoustic analysis Program: MCS Section: 4rd Student Name: Tahlil Younous Registration No: COSC201301015 Supervisor: Sir Shahab Alam Hashmi Department of Computer Science, Khawaja Fareed UEIT, Rahim Yar Khan. 2
  • 3. Agenda • Introduction • Problem statement • Functional requirements • Domain/tools • Feature Extraction • Algorithm • Project Scope • Datasets • Existing System • Methodology Diagram 3
  • 4. Introduction • Machine learning and AI is being used extensively in domain of acoustics analysis. • E.g. in speech recognition, voice over, industrial anomaly detection, music generation etc. 4
  • 5. Problem Statement For real-time acoustic analysis in deep learning, we use spectrograms for feature extraction, these spectrograms could be of different types e.g. Mel spectrograms and log spectrograms. The aim of this study is to find which type of spectrogram is better then other. 5
  • 6. Functional Requirements • Evaluation of performance and accuracy log spectrogram and Mel spectrogram in acoustic analysis using deep learning. • Convolutional models like CNN 1D, CNN 2D and LSTM will be used. • Feature extraction will be done on real-time during execution/model training. 6
  • 7. Tools/Domain • Machine learning ,deep learning . • Python library /librosa kapre. 7
  • 8. FEATURE EXTRACTION • I used spectrograms for feature extraction. • Mel Spectrogram • Log Spectrogram 8
  • 9. Algorithms • Following deep learning models will be utilized in the study. • Convolutional neural network 1.CNN 1D 2.CNN 2D .Recurrent neural network .LSTM 9
  • 10. Project Scope • Scope of this project is in automated and intelligent system that take decision on the basis of sounds /acoustics. 10
  • 11. Datasets • Cats and dogs sound • Emergency vehicles siren sounds • Music • Urban 8k 11
  • 12. Existing system In 1937, Stevens, Volkmann, and Newman proposed a unit of pitch such that equal distances in pitch sounded equally distant to the listener. This is called the Mel scale. We perform a mathematical operation on frequencies to convert them to the Mel scale. https://medium.com/analytics-vidhya/understanding-the- mel-spectrogram-fca2afa2ce53. https://towardsdatascience.com/getting-to-know-the-mel- spectrogram-31bca3e2d9d0 12