SlideShare a Scribd company logo
1 of 27
Machine learning
implementations on different
datasets using python
Submitted by: Oanh Doan
Tess Mundadan
Ramandeep
Kaur Bagri
Problem formulation
● Electricity data set
● This is classification
problem of electrical grid
stability into two classes;
● 1.Stable
● 2.Unstable
● Olivetti Dataset
● Recognizing face with highest
accuracy using machine learning
algorithms
● Comparison of their accuracies to
select the most fitted face
recognition algorithm.
About Electricity Data Set
•Electrical Grid data set in which we have different attributes for examine the stability of
the system.
• We will examine the response of the system stability depends on 10,000 observations
and 13 attributes, 1 classes attribute (stabf). Attributes are given in dataset as;
● tau[x] : Reaction time of participant (real from the range [0.5,10]s).
● p[x] : Nominal power consumed(negative)/produced(positive)(real).
● g[x] : Coefficient (gamma) proportional to price elasticity (real from the range
[0.05,1]s^-1).
● stab: The maximal real part of the characteristic equation root (if positive - the
system is linearly unstable)(real)
● stabf: The stability label of the system (stable/unstable)
Head of the data set
Preprocessing
Head of the data set
PCA transformation
Classification using PCA
Method 90:10 80:20 75:25 70:30 Confusion Matrix
Naive bayes 0.706 0.6985 0.6932 0.6997
KNN (N=12) 0.686 0.6735 0.666 0.6723
SVM 0.694 0.689 0.6848 0.6897
Decision Tree 0.618 0.622 0.6352 0.6216
Random Forest 0.661 0.6655 0.6612 0.6703
Classification using without PCA,use standardization
Method 90:10 80:20 75:25 70:30 Confusion
matrix
Naive Bayes 0.831 0.832 0.833 0.83466
KNN (N=12) 0.906 0.9102 0.9108 0.909
SVM 0.81 0.812 0.8108 0.8123
Decision Tree 0.879 0.8715 0.869 0.85766
Random Forest
(n_estimators=6
0)
0.923 0.9215 0.916 0.9186
Classification by ratio 95:5
Method Accuracy Confusion Matrix
Naive Bayes 0.832
KNN 0.92
SVM 0.92
Decision Tree 0.878
Random Forest 0.826
Selecting different features and results
Naive Bayes:
Accuracy - 0.982
KNN:
Accuracy - 0.964
Decision Tree
Accuracy - 1.0
Random Forest:
Accuracy - 1.0
SVM
Accuracy - 99.2
Test: 0.05
Best algorithm fit for electricity simulated data Set for
classification
Random Forest( Using n_estimators = 60 ):
● Test dataset: 90:10
● Accuracy = 0.923
● Confusion Matrix:
● Although the KNN and SVM has given good accuracy at ratio of 95:5, but it is only on
test dataset – need further evaluation on training dataset.
● By selecting different features the accuracy observed is 1, but the model is
overfitted – Need prepossing and proper selection of features to train the models to
improve accuracy.
Step 1: Load and explore the data
Step 2: Visualize the data Olivetti Data set
Python Packages Used
Reshape data
Accuracy Metrics Comparison without PCA
Method 90:10 80:20 70:30
Naive_Bayes
92.5 87.50 73.33
KNN
90.0 91.25 90.00
Random Forest
90.0 93.75 93.33
SVM
95.0 96.25 96.67
Logistic Regression
95.0 97.50 97.50
Dimensionality Reduction using PCA
Mean Face of the Samples
Eigen Faces
Metrics with different Principle Components
Method PC = 90 PC =103 PC = 200
Naive Bayes 76.67 77.50 69.17
KNN 90.83 90.83 90.00
Random Forest 93.33 92.50 91.67
Logistic Regression 96.67 98.33 98.33
SVM 96.67 96.67 96.67
Accuracy Metrics Comparison with PCA
METHOD 90:10 80:20 70:30
Without PCA With PCA Without PCA With PCA Without PCA With PCA
Naive Bayes 92.5 92.5 87.50 90.0 73.33 77.50
KNN 90.0 90.0 91.25 92.5 90.00 90.83
Random Forest 90.0 100.0 93.75 95.0 93.33 92.50
Logistic
Regression
95.0 97.5 96.25 97.5 97.50 98.33
SVM 95.0 95.0 97.5 97.5 96.67 96.67
Best fit for Olivetti data set
Random Forest : PCA = 103
Test dataset = 90:10
Accuracy = 100
Conclusion
Use of PCA improved the accuracy metrics.
Project Enhancement Idea : Identify smile and no Smile face/
Identify male and female
Reference
● https://www.kaggle.com/serkanpeldek/face-recognition-on-olivetti-dataset
● https://colab.research.google.com/notebooks/welcome.ipynb
● https://www.youtube.com/watch?v=PitcORQSjNM
● https://www.youtube.com/watch?v=_VTtrSDHPwU&t=86s
● https://www.kaggle.com/elikplim/eergy-efficiency-dataset/kernels
● Class notes: Introduction to data mining and machine learning
Thank you for your attention

More Related Content

What's hot (20)

Machine Learning for Disease Prediction
Machine Learning for Disease PredictionMachine Learning for Disease Prediction
Machine Learning for Disease Prediction
 
Naive bayes
Naive bayesNaive bayes
Naive bayes
 
K Nearest Neighbors
K Nearest NeighborsK Nearest Neighbors
K Nearest Neighbors
 
supervised learning
supervised learningsupervised learning
supervised learning
 
Machine learning
Machine learning Machine learning
Machine learning
 
Machine learning
Machine learningMachine learning
Machine learning
 
Deep neural networks
Deep neural networksDeep neural networks
Deep neural networks
 
Random forest
Random forestRandom forest
Random forest
 
Back propagation
Back propagationBack propagation
Back propagation
 
Machine Learning and its Applications
Machine Learning and its ApplicationsMachine Learning and its Applications
Machine Learning and its Applications
 
Presentation on K-Means Clustering
Presentation on K-Means ClusteringPresentation on K-Means Clustering
Presentation on K-Means Clustering
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Introduction to Deep Learning
Introduction to Deep LearningIntroduction to Deep Learning
Introduction to Deep Learning
 
Machine Learning Final presentation
Machine Learning Final presentation Machine Learning Final presentation
Machine Learning Final presentation
 
Ensemble learning
Ensemble learningEnsemble learning
Ensemble learning
 
Lecture1 introduction to machine learning
Lecture1 introduction to machine learningLecture1 introduction to machine learning
Lecture1 introduction to machine learning
 
Machine Learning with Decision trees
Machine Learning with Decision treesMachine Learning with Decision trees
Machine Learning with Decision trees
 
Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?
 
Perceptron (neural network)
Perceptron (neural network)Perceptron (neural network)
Perceptron (neural network)
 
Multilayer perceptron
Multilayer perceptronMultilayer perceptron
Multilayer perceptron
 

Similar to Machine Learning project presentation

Electrical Grid Stability Simulated Data Set analysis using R
Electrical Grid Stability Simulated Data Set analysis using RElectrical Grid Stability Simulated Data Set analysis using R
Electrical Grid Stability Simulated Data Set analysis using RRamandeep Kaur Bagri
 
House Sale Price Prediction
House Sale Price PredictionHouse Sale Price Prediction
House Sale Price Predictionsriram30691
 
Techniques in Deep Learning
Techniques in Deep LearningTechniques in Deep Learning
Techniques in Deep LearningSourya Dey
 
Andres hernandez ai_machine_learning_london_nov2017
Andres hernandez ai_machine_learning_london_nov2017Andres hernandez ai_machine_learning_london_nov2017
Andres hernandez ai_machine_learning_london_nov2017Andres Hernandez
 
Poster_Reseau_Neurones_Journees_2013
Poster_Reseau_Neurones_Journees_2013Poster_Reseau_Neurones_Journees_2013
Poster_Reseau_Neurones_Journees_2013Pedro Lopes
 
A temporal classifier system using spiking neural networks
A temporal classifier system using spiking neural networksA temporal classifier system using spiking neural networks
A temporal classifier system using spiking neural networksDaniele Loiacono
 
Predicting rainfall using ensemble of ensembles
Predicting rainfall using ensemble of ensemblesPredicting rainfall using ensemble of ensembles
Predicting rainfall using ensemble of ensemblesVarad Meru
 
Test different neural networks models for forecasting of wind,solar and energ...
Test different neural networks models for forecasting of wind,solar and energ...Test different neural networks models for forecasting of wind,solar and energ...
Test different neural networks models for forecasting of wind,solar and energ...Tonmoy Ibne Arif
 
Thesis Presentation on Energy Efficiency Improvement in Data Centers
Thesis Presentation on Energy Efficiency Improvement in Data CentersThesis Presentation on Energy Efficiency Improvement in Data Centers
Thesis Presentation on Energy Efficiency Improvement in Data CentersMonica Vitali
 
Huong dan cu the svm
Huong dan cu the svmHuong dan cu the svm
Huong dan cu the svmtaikhoan262
 
A NEW TOOL FOR LARGE SCALE POWER SYSTEM TRANSIENT SECURITY ASSESSMENT
A NEW TOOL FOR LARGE SCALE POWER SYSTEM TRANSIENT SECURITY ASSESSMENTA NEW TOOL FOR LARGE SCALE POWER SYSTEM TRANSIENT SECURITY ASSESSMENT
A NEW TOOL FOR LARGE SCALE POWER SYSTEM TRANSIENT SECURITY ASSESSMENTPower System Operation
 
AI Class Topic 6: Easy Way to Learn Deep Learning AI Technologies
AI Class Topic 6: Easy Way to Learn Deep Learning AI TechnologiesAI Class Topic 6: Easy Way to Learn Deep Learning AI Technologies
AI Class Topic 6: Easy Way to Learn Deep Learning AI TechnologiesValue Amplify Consulting
 
Performance assessment of an optimization strategy proposed for power systems
Performance assessment of an optimization strategy proposed for power systemsPerformance assessment of an optimization strategy proposed for power systems
Performance assessment of an optimization strategy proposed for power systemsTELKOMNIKA JOURNAL
 
Quantum Machine Learning for IBM AI
Quantum Machine Learning for IBM AIQuantum Machine Learning for IBM AI
Quantum Machine Learning for IBM AISasha Lazarevic
 
Advance mathematics mid term presentation rev01
Advance mathematics mid term presentation rev01Advance mathematics mid term presentation rev01
Advance mathematics mid term presentation rev01Nirmal Joshi
 
Bioactivity Predictive ModelingMay2016
Bioactivity Predictive ModelingMay2016Bioactivity Predictive ModelingMay2016
Bioactivity Predictive ModelingMay2016Matthew Clark
 
Principal component analysis and lda
Principal component analysis and ldaPrincipal component analysis and lda
Principal component analysis and ldaSuresh Pokharel
 
Detection focal loss 딥러닝 논문읽기 모임 발표자료
Detection focal loss 딥러닝 논문읽기 모임 발표자료Detection focal loss 딥러닝 논문읽기 모임 발표자료
Detection focal loss 딥러닝 논문읽기 모임 발표자료taeseon ryu
 

Similar to Machine Learning project presentation (20)

Electrical Grid Stability Simulated Data Set analysis using R
Electrical Grid Stability Simulated Data Set analysis using RElectrical Grid Stability Simulated Data Set analysis using R
Electrical Grid Stability Simulated Data Set analysis using R
 
House Sale Price Prediction
House Sale Price PredictionHouse Sale Price Prediction
House Sale Price Prediction
 
Techniques in Deep Learning
Techniques in Deep LearningTechniques in Deep Learning
Techniques in Deep Learning
 
Andres hernandez ai_machine_learning_london_nov2017
Andres hernandez ai_machine_learning_london_nov2017Andres hernandez ai_machine_learning_london_nov2017
Andres hernandez ai_machine_learning_london_nov2017
 
Poster_Reseau_Neurones_Journees_2013
Poster_Reseau_Neurones_Journees_2013Poster_Reseau_Neurones_Journees_2013
Poster_Reseau_Neurones_Journees_2013
 
A temporal classifier system using spiking neural networks
A temporal classifier system using spiking neural networksA temporal classifier system using spiking neural networks
A temporal classifier system using spiking neural networks
 
Predicting rainfall using ensemble of ensembles
Predicting rainfall using ensemble of ensemblesPredicting rainfall using ensemble of ensembles
Predicting rainfall using ensemble of ensembles
 
Test different neural networks models for forecasting of wind,solar and energ...
Test different neural networks models for forecasting of wind,solar and energ...Test different neural networks models for forecasting of wind,solar and energ...
Test different neural networks models for forecasting of wind,solar and energ...
 
Thesis Presentation on Energy Efficiency Improvement in Data Centers
Thesis Presentation on Energy Efficiency Improvement in Data CentersThesis Presentation on Energy Efficiency Improvement in Data Centers
Thesis Presentation on Energy Efficiency Improvement in Data Centers
 
Guide
GuideGuide
Guide
 
Huong dan cu the svm
Huong dan cu the svmHuong dan cu the svm
Huong dan cu the svm
 
A NEW TOOL FOR LARGE SCALE POWER SYSTEM TRANSIENT SECURITY ASSESSMENT
A NEW TOOL FOR LARGE SCALE POWER SYSTEM TRANSIENT SECURITY ASSESSMENTA NEW TOOL FOR LARGE SCALE POWER SYSTEM TRANSIENT SECURITY ASSESSMENT
A NEW TOOL FOR LARGE SCALE POWER SYSTEM TRANSIENT SECURITY ASSESSMENT
 
AI Class Topic 6: Easy Way to Learn Deep Learning AI Technologies
AI Class Topic 6: Easy Way to Learn Deep Learning AI TechnologiesAI Class Topic 6: Easy Way to Learn Deep Learning AI Technologies
AI Class Topic 6: Easy Way to Learn Deep Learning AI Technologies
 
Guide
GuideGuide
Guide
 
Performance assessment of an optimization strategy proposed for power systems
Performance assessment of an optimization strategy proposed for power systemsPerformance assessment of an optimization strategy proposed for power systems
Performance assessment of an optimization strategy proposed for power systems
 
Quantum Machine Learning for IBM AI
Quantum Machine Learning for IBM AIQuantum Machine Learning for IBM AI
Quantum Machine Learning for IBM AI
 
Advance mathematics mid term presentation rev01
Advance mathematics mid term presentation rev01Advance mathematics mid term presentation rev01
Advance mathematics mid term presentation rev01
 
Bioactivity Predictive ModelingMay2016
Bioactivity Predictive ModelingMay2016Bioactivity Predictive ModelingMay2016
Bioactivity Predictive ModelingMay2016
 
Principal component analysis and lda
Principal component analysis and ldaPrincipal component analysis and lda
Principal component analysis and lda
 
Detection focal loss 딥러닝 논문읽기 모임 발표자료
Detection focal loss 딥러닝 논문읽기 모임 발표자료Detection focal loss 딥러닝 논문읽기 모임 발표자료
Detection focal loss 딥러닝 논문읽기 모임 발표자료
 

Recently uploaded

Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxFurkanTasci3
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
Call Girls In Noida City Center Metro 24/7✡️9711147426✡️ Escorts Service
Call Girls In Noida City Center Metro 24/7✡️9711147426✡️ Escorts ServiceCall Girls In Noida City Center Metro 24/7✡️9711147426✡️ Escorts Service
Call Girls In Noida City Center Metro 24/7✡️9711147426✡️ Escorts Servicejennyeacort
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 

Recently uploaded (20)

E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptx
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
Call Girls In Noida City Center Metro 24/7✡️9711147426✡️ Escorts Service
Call Girls In Noida City Center Metro 24/7✡️9711147426✡️ Escorts ServiceCall Girls In Noida City Center Metro 24/7✡️9711147426✡️ Escorts Service
Call Girls In Noida City Center Metro 24/7✡️9711147426✡️ Escorts Service
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 

Machine Learning project presentation

  • 1. Machine learning implementations on different datasets using python Submitted by: Oanh Doan Tess Mundadan Ramandeep Kaur Bagri
  • 2. Problem formulation ● Electricity data set ● This is classification problem of electrical grid stability into two classes; ● 1.Stable ● 2.Unstable ● Olivetti Dataset ● Recognizing face with highest accuracy using machine learning algorithms ● Comparison of their accuracies to select the most fitted face recognition algorithm.
  • 3. About Electricity Data Set •Electrical Grid data set in which we have different attributes for examine the stability of the system. • We will examine the response of the system stability depends on 10,000 observations and 13 attributes, 1 classes attribute (stabf). Attributes are given in dataset as; ● tau[x] : Reaction time of participant (real from the range [0.5,10]s). ● p[x] : Nominal power consumed(negative)/produced(positive)(real). ● g[x] : Coefficient (gamma) proportional to price elasticity (real from the range [0.05,1]s^-1). ● stab: The maximal real part of the characteristic equation root (if positive - the system is linearly unstable)(real) ● stabf: The stability label of the system (stable/unstable)
  • 4. Head of the data set
  • 6. Head of the data set
  • 8. Classification using PCA Method 90:10 80:20 75:25 70:30 Confusion Matrix Naive bayes 0.706 0.6985 0.6932 0.6997 KNN (N=12) 0.686 0.6735 0.666 0.6723 SVM 0.694 0.689 0.6848 0.6897 Decision Tree 0.618 0.622 0.6352 0.6216 Random Forest 0.661 0.6655 0.6612 0.6703
  • 9. Classification using without PCA,use standardization Method 90:10 80:20 75:25 70:30 Confusion matrix Naive Bayes 0.831 0.832 0.833 0.83466 KNN (N=12) 0.906 0.9102 0.9108 0.909 SVM 0.81 0.812 0.8108 0.8123 Decision Tree 0.879 0.8715 0.869 0.85766 Random Forest (n_estimators=6 0) 0.923 0.9215 0.916 0.9186
  • 10. Classification by ratio 95:5 Method Accuracy Confusion Matrix Naive Bayes 0.832 KNN 0.92 SVM 0.92 Decision Tree 0.878 Random Forest 0.826
  • 11. Selecting different features and results Naive Bayes: Accuracy - 0.982 KNN: Accuracy - 0.964 Decision Tree Accuracy - 1.0 Random Forest: Accuracy - 1.0 SVM Accuracy - 99.2 Test: 0.05
  • 12. Best algorithm fit for electricity simulated data Set for classification Random Forest( Using n_estimators = 60 ): ● Test dataset: 90:10 ● Accuracy = 0.923 ● Confusion Matrix: ● Although the KNN and SVM has given good accuracy at ratio of 95:5, but it is only on test dataset – need further evaluation on training dataset. ● By selecting different features the accuracy observed is 1, but the model is overfitted – Need prepossing and proper selection of features to train the models to improve accuracy.
  • 13. Step 1: Load and explore the data
  • 14. Step 2: Visualize the data Olivetti Data set
  • 17.
  • 18. Accuracy Metrics Comparison without PCA Method 90:10 80:20 70:30 Naive_Bayes 92.5 87.50 73.33 KNN 90.0 91.25 90.00 Random Forest 90.0 93.75 93.33 SVM 95.0 96.25 96.67 Logistic Regression 95.0 97.50 97.50
  • 20. Mean Face of the Samples
  • 22. Metrics with different Principle Components Method PC = 90 PC =103 PC = 200 Naive Bayes 76.67 77.50 69.17 KNN 90.83 90.83 90.00 Random Forest 93.33 92.50 91.67 Logistic Regression 96.67 98.33 98.33 SVM 96.67 96.67 96.67
  • 23. Accuracy Metrics Comparison with PCA METHOD 90:10 80:20 70:30 Without PCA With PCA Without PCA With PCA Without PCA With PCA Naive Bayes 92.5 92.5 87.50 90.0 73.33 77.50 KNN 90.0 90.0 91.25 92.5 90.00 90.83 Random Forest 90.0 100.0 93.75 95.0 93.33 92.50 Logistic Regression 95.0 97.5 96.25 97.5 97.50 98.33 SVM 95.0 95.0 97.5 97.5 96.67 96.67
  • 24. Best fit for Olivetti data set Random Forest : PCA = 103 Test dataset = 90:10 Accuracy = 100
  • 25. Conclusion Use of PCA improved the accuracy metrics. Project Enhancement Idea : Identify smile and no Smile face/ Identify male and female
  • 26. Reference ● https://www.kaggle.com/serkanpeldek/face-recognition-on-olivetti-dataset ● https://colab.research.google.com/notebooks/welcome.ipynb ● https://www.youtube.com/watch?v=PitcORQSjNM ● https://www.youtube.com/watch?v=_VTtrSDHPwU&t=86s ● https://www.kaggle.com/elikplim/eergy-efficiency-dataset/kernels ● Class notes: Introduction to data mining and machine learning
  • 27. Thank you for your attention

Editor's Notes

  1. Data is collected by taking the pictures of 40 different human personalities in each column of the dataset. Each row is the sample of one human personality with different face emojis to train the dataset. Dimensions: 10:400
  2. No N/As Machine learning models can work on vectors. Since the image data is in the matrix form, it must be converted to a vector. Different testing datasets are being used: 0.1,0.2,0.3.
  3. SVM is not affected by PCA 100 is due to small and clean data set Increasing the percentage of training set improves accuracy