SlideShare a Scribd company logo
Juan G. Colonna
Afonso D. Ribas
Eduardo F. Nakamura
Eulanda M. dos Santos




                 Feature Subset Selection for
                 Automatically Classifying Anuran
                 Calls Using Sensor Networks



Institute of Computing (IComp)
Federal University of Amazon (UFAM)
Introduction - Environmental Motivation
The study of environmental conditions allow:

 maintain the quality of life, and
 to preserve the species.

The loss of species is an irreversible process!


 The variation of species populations
 enables to:

  identify environmental problems in the
   early stages, and

  establish strategies for the
   conservation of biological diversity.
Introduction - Environmental Motivation

  Variations in amphibian populations are related to pollution,
   deforestation, urbanization, etc.

  Frogs can be used as indicators for detecting environmental stress.




 Figure: Percentage of threatened
 species in the red list. Figure adapted
 from [Stuart et al., 2004].
Introduction – Objectives


   Classify frog species of tropical forests based on the vocalizations
   using wireless sensor networks and machine learning technique. *




                    * Consideration: Restrictions on the hardware.
                                                                     4
Introduction - Challenges
  Develop a method that does not need human intervention.

  Characterize the spectral frequency of frog.

  Extract and select the optimal set of features.

  Define the classification technique.

  Get the minimum set of features using genetic algorithm.

  Obtain the cost of processing characteristics.
                                                              WSN and
  Correlate the processing cost and success rate.            Machine Learning


  Maximize the benefit cost rate.




 5
Related Work
        Author              Animal         Features     Classifier        Results     WSN

   Taylor et al. [1996]   Bufo marinus   Spectrograma     C4.5              60%       No

    Hu et al. [2005]      Bufo marinus   Spectrograma     C4.5              60%       Yes

   Yen & Fu [2002]*          4 frog        Wavelet        MLP               71%       No
                                           Fisher’s
    Clemins [2005]          elephant       MFCCs         HMM                69%       No
                                            PLP          DTW                73%
    Cai et al. [2007]       14 bird        MFCCs          ANN            81% - 86%    Yes

  Huang et al. [2009]*       5 frog       S - B - ZC      k-NN           83% - 100%   No
                                                          SVM            82% - 100%
    Vaca-Castaño &          10 bird        MFCCs          k-NN              86%       Yes
   Rodriguez [2010]*        20 frog         PCA                             91%
   Han et al. [2011]*        9 frog       S - Hs - Hr     k-NN           83% - 100%   No

    * Work implemented and used in the comparisons.



                                                                     6
Our approach




  Figure: Anuran classification stages.       Figure: Pre-processing steps.




                                          Figure: Parametrization of vocalizations.




                                                               7
Features




Figure: Mel-Fourier Cepstral Coefficients   Figure: Wavelet Transform with Lifting Scheme.
(MFCCs).




                                                                        8
Obtain the features




                 Figure: Feature extraction.




                                               9
Spectrogram




         Figure: Audio sample (wave form and spectrogram) for
         the Adenomera andreae..




                                                                10
Features

            Feature    Complexity order   Computational cost

             Pitch          O(L)                3L − 1
              B           O(Nlog(N))      2M + 2M + Nlog(N)
           12 MFCC’s      O(Nlog(N))       Nlog(N) + N + mR
              S           O(Nlog(N))        2M + Nlog(N)
              H1            O(L)                L+i
              H2            O(L)                L+i
              ZC            O(L)                  L
              E             O(L)                  L
              Pw            O(L)                  L




                                                      11
Comparison between MFCCs and Wavelet

                Features                                         k-NN

                                              0.4                 0.5                 0.6

            Wavelet Features               96.35%(3)           97.86%(1)           98.22%(1)
          Daubechies Transform
             Wavelet Features              96.70%(1)           97.90%(1)           98.38%(1)
             Haar Transform
                 MFCCs                     99.19%(9)           99.36%(2)           99.19%(1)

     Table: Success rate in relation to alpha, using cross-validation fold = 10.



Applying the Wilcoxon test, with 95% significance level (α = 0.5), we conclude that the
MFCCs have better performance.




                                                                             12
Comparison between MFCCs and Wavelet
  Objective: To determine the optimal subset of features by applying GA.




                                                         13
Comparison between MFCCs and Wavelet


   Features    Classification   Crossover 50%      Success rate    Crossover 60%       Success rate
                before GA       Mutation 40%                       Mutation 20%

  9 features   97.86%(1)            1,2,3,5         93.73%         1,2,3,4,5,6,8,9      96.83%
   with Db
  9 features   97.90%(1)*        2,3,4,5,6,8,9      96.47%        1,2,3,4,5,6,7,8,9     97.90%*
  with Haar
  12 MFCCs     99.36%(2)*       1,2,3,4,5,6,7,11    99.08%        1,2,3,4,5,6,7,8,91    99.33%*
                                                                         1,12




                                                                             14
Case of Study




        fs = 44.1kHz                fs = 11kHz




                       fs =5.5kHz
Conclusions
We indicated how best set of features to choose the 12 MFCCs.

You can optimize costs by using 8 MFCCs, although the method loses
generality.

The MFFCs have:

✔
     Better success rate;
✔
     Constant cost, regardless of hardware, and
✔
     Immunity to environmental and quantization noise.




                                                   16
Questions?




             Thanks




                      17

More Related Content

Similar to Feature Subset Selection for Automatically Classifying Anuran Calls Using Sensor Networks

An Information-Theoretic Approach for Clonal Selection Algorithms
An Information-Theoretic Approach for Clonal Selection AlgorithmsAn Information-Theoretic Approach for Clonal Selection Algorithms
An Information-Theoretic Approach for Clonal Selection Algorithms
Mario Pavone
 
Dierk Raabe Darmstadt T U Celebration Colloquium Mechanics Of Crystals
Dierk  Raabe  Darmstadt  T U  Celebration  Colloquium  Mechanics Of  CrystalsDierk  Raabe  Darmstadt  T U  Celebration  Colloquium  Mechanics Of  Crystals
Dierk Raabe Darmstadt T U Celebration Colloquium Mechanics Of Crystals
Dierk Raabe
 
An Autocorrelation Analysis Approach to Detecting Land Cover Change using Hyp...
An Autocorrelation Analysis Approach to Detecting Land Cover Change using Hyp...An Autocorrelation Analysis Approach to Detecting Land Cover Change using Hyp...
An Autocorrelation Analysis Approach to Detecting Land Cover Change using Hyp...
grssieee
 

Similar to Feature Subset Selection for Automatically Classifying Anuran Calls Using Sensor Networks (20)

ISSCS2011
ISSCS2011ISSCS2011
ISSCS2011
 
Universal approximators for Direct Policy Search in multi-purpose water reser...
Universal approximators for Direct Policy Search in multi-purpose water reser...Universal approximators for Direct Policy Search in multi-purpose water reser...
Universal approximators for Direct Policy Search in multi-purpose water reser...
 
Integrated RF and Shim coils for MRI
 Integrated RF and Shim coils for MRI Integrated RF and Shim coils for MRI
Integrated RF and Shim coils for MRI
 
MODELING THE CHLOROPHYLL-A FROM SEA SURFACE REFLECTANCE IN WEST AFRICA BY DEE...
MODELING THE CHLOROPHYLL-A FROM SEA SURFACE REFLECTANCE IN WEST AFRICA BY DEE...MODELING THE CHLOROPHYLL-A FROM SEA SURFACE REFLECTANCE IN WEST AFRICA BY DEE...
MODELING THE CHLOROPHYLL-A FROM SEA SURFACE REFLECTANCE IN WEST AFRICA BY DEE...
 
MODELING THE CHLOROPHYLL-A FROM SEA SURFACE REFLECTANCE IN WEST AFRICA BY DEE...
MODELING THE CHLOROPHYLL-A FROM SEA SURFACE REFLECTANCE IN WEST AFRICA BY DEE...MODELING THE CHLOROPHYLL-A FROM SEA SURFACE REFLECTANCE IN WEST AFRICA BY DEE...
MODELING THE CHLOROPHYLL-A FROM SEA SURFACE REFLECTANCE IN WEST AFRICA BY DEE...
 
Modeling the Chlorophyll-a from Sea Surface Reflectance in West Africa by Dee...
Modeling the Chlorophyll-a from Sea Surface Reflectance in West Africa by Dee...Modeling the Chlorophyll-a from Sea Surface Reflectance in West Africa by Dee...
Modeling the Chlorophyll-a from Sea Surface Reflectance in West Africa by Dee...
 
Smart Systems for Urban Water Demand Management
Smart Systems for Urban Water Demand ManagementSmart Systems for Urban Water Demand Management
Smart Systems for Urban Water Demand Management
 
Audio Visual Emotion Recognition Using Cross Correlation and Wavelet Packet D...
Audio Visual Emotion Recognition Using Cross Correlation and Wavelet Packet D...Audio Visual Emotion Recognition Using Cross Correlation and Wavelet Packet D...
Audio Visual Emotion Recognition Using Cross Correlation and Wavelet Packet D...
 
Resolution
ResolutionResolution
Resolution
 
An Information-Theoretic Approach for Clonal Selection Algorithms
An Information-Theoretic Approach for Clonal Selection AlgorithmsAn Information-Theoretic Approach for Clonal Selection Algorithms
An Information-Theoretic Approach for Clonal Selection Algorithms
 
Dierk Raabe Darmstadt T U Celebration Colloquium Mechanics Of Crystals
Dierk  Raabe  Darmstadt  T U  Celebration  Colloquium  Mechanics Of  CrystalsDierk  Raabe  Darmstadt  T U  Celebration  Colloquium  Mechanics Of  Crystals
Dierk Raabe Darmstadt T U Celebration Colloquium Mechanics Of Crystals
 
Segmentation Based Multilevel Wide Band Compression for SAR Images Using Coif...
Segmentation Based Multilevel Wide Band Compression for SAR Images Using Coif...Segmentation Based Multilevel Wide Band Compression for SAR Images Using Coif...
Segmentation Based Multilevel Wide Band Compression for SAR Images Using Coif...
 
An Autocorrelation Analysis Approach to Detecting Land Cover Change using Hyp...
An Autocorrelation Analysis Approach to Detecting Land Cover Change using Hyp...An Autocorrelation Analysis Approach to Detecting Land Cover Change using Hyp...
An Autocorrelation Analysis Approach to Detecting Land Cover Change using Hyp...
 
Comparative Analysis of Different Wavelet Functions using Modified Adaptive F...
Comparative Analysis of Different Wavelet Functions using Modified Adaptive F...Comparative Analysis of Different Wavelet Functions using Modified Adaptive F...
Comparative Analysis of Different Wavelet Functions using Modified Adaptive F...
 
Presentation CIE619
Presentation CIE619 Presentation CIE619
Presentation CIE619
 
Hvordan maler og dokumenterer du?
Hvordan maler og dokumenterer du?Hvordan maler og dokumenterer du?
Hvordan maler og dokumenterer du?
 
[IJET-V1I5P4] Authors :Murugan, Avudaiappan, Balasubramanian
[IJET-V1I5P4] Authors :Murugan, Avudaiappan, Balasubramanian [IJET-V1I5P4] Authors :Murugan, Avudaiappan, Balasubramanian
[IJET-V1I5P4] Authors :Murugan, Avudaiappan, Balasubramanian
 
LINAC COMMSSN.ppt
LINAC COMMSSN.pptLINAC COMMSSN.ppt
LINAC COMMSSN.ppt
 
Bayesian modelling and computation for Raman spectroscopy
Bayesian modelling and computation for Raman spectroscopyBayesian modelling and computation for Raman spectroscopy
Bayesian modelling and computation for Raman spectroscopy
 
LTE Physical Layer Transmission Mode Selection Over MIMO Scattering Channels
LTE Physical Layer Transmission Mode Selection Over MIMO Scattering ChannelsLTE Physical Layer Transmission Mode Selection Over MIMO Scattering Channels
LTE Physical Layer Transmission Mode Selection Over MIMO Scattering Channels
 

Recently uploaded

Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 

Recently uploaded (20)

Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
 
10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka Doktorová10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka Doktorová
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John Staveley
 
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeFree and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
 
UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2
 
IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
Agentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdfAgentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdf
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptxWSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone KomSalesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
 

Feature Subset Selection for Automatically Classifying Anuran Calls Using Sensor Networks

  • 1. Juan G. Colonna Afonso D. Ribas Eduardo F. Nakamura Eulanda M. dos Santos Feature Subset Selection for Automatically Classifying Anuran Calls Using Sensor Networks Institute of Computing (IComp) Federal University of Amazon (UFAM)
  • 2. Introduction - Environmental Motivation The study of environmental conditions allow:  maintain the quality of life, and  to preserve the species. The loss of species is an irreversible process! The variation of species populations enables to:  identify environmental problems in the early stages, and  establish strategies for the conservation of biological diversity.
  • 3. Introduction - Environmental Motivation  Variations in amphibian populations are related to pollution, deforestation, urbanization, etc.  Frogs can be used as indicators for detecting environmental stress. Figure: Percentage of threatened species in the red list. Figure adapted from [Stuart et al., 2004].
  • 4. Introduction – Objectives Classify frog species of tropical forests based on the vocalizations using wireless sensor networks and machine learning technique. * * Consideration: Restrictions on the hardware. 4
  • 5. Introduction - Challenges  Develop a method that does not need human intervention.  Characterize the spectral frequency of frog.  Extract and select the optimal set of features.  Define the classification technique.  Get the minimum set of features using genetic algorithm.  Obtain the cost of processing characteristics. WSN and  Correlate the processing cost and success rate. Machine Learning  Maximize the benefit cost rate. 5
  • 6. Related Work Author Animal Features Classifier Results WSN Taylor et al. [1996] Bufo marinus Spectrograma C4.5 60% No Hu et al. [2005] Bufo marinus Spectrograma C4.5 60% Yes Yen & Fu [2002]* 4 frog Wavelet MLP 71% No Fisher’s Clemins [2005] elephant MFCCs HMM 69% No PLP DTW 73% Cai et al. [2007] 14 bird MFCCs ANN 81% - 86% Yes Huang et al. [2009]* 5 frog S - B - ZC k-NN 83% - 100% No SVM 82% - 100% Vaca-Castaño & 10 bird MFCCs k-NN 86% Yes Rodriguez [2010]* 20 frog PCA 91% Han et al. [2011]* 9 frog S - Hs - Hr k-NN 83% - 100% No * Work implemented and used in the comparisons. 6
  • 7. Our approach Figure: Anuran classification stages. Figure: Pre-processing steps. Figure: Parametrization of vocalizations. 7
  • 8. Features Figure: Mel-Fourier Cepstral Coefficients Figure: Wavelet Transform with Lifting Scheme. (MFCCs). 8
  • 9. Obtain the features Figure: Feature extraction. 9
  • 10. Spectrogram Figure: Audio sample (wave form and spectrogram) for the Adenomera andreae.. 10
  • 11. Features Feature Complexity order Computational cost Pitch O(L) 3L − 1 B O(Nlog(N)) 2M + 2M + Nlog(N) 12 MFCC’s O(Nlog(N)) Nlog(N) + N + mR S O(Nlog(N)) 2M + Nlog(N) H1 O(L) L+i H2 O(L) L+i ZC O(L) L E O(L) L Pw O(L) L 11
  • 12. Comparison between MFCCs and Wavelet Features k-NN 0.4 0.5 0.6 Wavelet Features 96.35%(3) 97.86%(1) 98.22%(1) Daubechies Transform Wavelet Features 96.70%(1) 97.90%(1) 98.38%(1) Haar Transform MFCCs 99.19%(9) 99.36%(2) 99.19%(1) Table: Success rate in relation to alpha, using cross-validation fold = 10. Applying the Wilcoxon test, with 95% significance level (α = 0.5), we conclude that the MFCCs have better performance. 12
  • 13. Comparison between MFCCs and Wavelet Objective: To determine the optimal subset of features by applying GA. 13
  • 14. Comparison between MFCCs and Wavelet Features Classification Crossover 50% Success rate Crossover 60% Success rate before GA Mutation 40% Mutation 20% 9 features 97.86%(1) 1,2,3,5 93.73% 1,2,3,4,5,6,8,9 96.83% with Db 9 features 97.90%(1)* 2,3,4,5,6,8,9 96.47% 1,2,3,4,5,6,7,8,9 97.90%* with Haar 12 MFCCs 99.36%(2)* 1,2,3,4,5,6,7,11 99.08% 1,2,3,4,5,6,7,8,91 99.33%* 1,12 14
  • 15. Case of Study fs = 44.1kHz fs = 11kHz fs =5.5kHz
  • 16. Conclusions We indicated how best set of features to choose the 12 MFCCs. You can optimize costs by using 8 MFCCs, although the method loses generality. The MFFCs have: ✔ Better success rate; ✔ Constant cost, regardless of hardware, and ✔ Immunity to environmental and quantization noise. 16
  • 17. Questions? Thanks 17