SlideShare a Scribd company logo
1 of 11
EMAIL SPAM DETECTION USING MACHINE LEARNING
ALGORITMS
CONTENTS
•ABSTRACT
•PROBLEMDEFINITION
•OBJECTIVE
•INTRODUCTION
•LITERATURESURVEY
•CONTENTDIAGRAM
•CONCLUSION
ABSTRACT
Email is an information stored on a computer that is exchanged between two usersover telecommunication.
Spam is one of the major threats posed to email users. Spam refers to the electronicmessaging system to send out unrequested or unwanted messages in bulk.
The privacy and security of large amount of sensitive data arethreadbymalicious spam.
Data mining has many approaches and alogorithms for email filtering. A classifier is asupervised function where the learned attribute is categorical.
Context based methods analyze the control of the email to determine if the email is spamor not.
Problem definition
1.Dataset:Obtainalabeleddatasetconsistingofbothspamandnon-spam(ham)emails.Thedatasetshouldberepresentativeofthe typesofemailstypicallyencounteredin
thetargetenvironment.
2.FeatureSelection:Identifyrelevantfeatureswithintheemailcontent,metadata,andstructuralelementsthatcanbeusedto distinguishbetweenspamandnon-spam
emails.Thesemayincludewordfrequency,presenceofcertainkeywords,senderinformation,emailheaderanalysis,etc.
3.ModelSelection:Chooseappropriatemachinelearningorstatisticalmodelsforclassificationtasks,suchasNaiveBayes,SupportVectorMachines(SVM),RandomForests,
ordeeplearningmodelslikeRecurrentNeuralNetworks(RNNs)orConvolutionalNeuralNetworks(CNNs).
4.Preprocessing:Cleanandpreprocesstheemaildata,includingtaskssuchastokenization,stemming,removalofstopwords,andnormalizationoffeaturestoimprove
modelperformance.
5.TrainingandEvaluation:Traintheselectedmodel(s)onthelabeleddatasetandevaluatetheirperformanceusingmetricssuch asaccuracy,precision,recall,F1-score,
andareaundertheROCcurve(AUC).
6.HyperparameterTuning:Fine-tunemodelhyperparameterstooptimizeperformanceandgeneralizationonunseendata.Techniquessuchascross-validationorgrid
searchcanbeemployedforthispurpose.
7.Deployment:Implementthetrainedmodelintoaproductionenvironmentwhereitcanautomaticallyclassifyincomingemailsas spamornon-spaminreal-time.
8.MonitoringandMaintenance:Regularlymonitortheperformanceofthedeployedsystemandupdatethemodelasneededtoadapt tochangingpatternsinemailspam.
ALGORITHMS:
• Algorithms used for the title for the Research in order to get the enhanced version to predict email
spam detection
•Logistic Regression Algorithm
•KNN Algorithm
•Random Forest Algorithm
•Naïve Baye’s Algorithm
•SVM Algorithm
• The main algorithm used in the title is “Random forest” and the remaining algorithms such as
“KNN Algorithm, Random Forest Algorithm,Naïve Baye’s Algorithm,SVM Algorithm” will be used
as the comparing algorithms with the main algorithm
STEPS USED IN RESEARCH TO GET ACCURACY
1.Data collection.
2.Data Preprocessing.
3.Splitting the dataset into Training set and Testing set.
4.Model Selection according to the Algorithm that to be
performed.
5.Model Training with the splitted Training set data’s.
6.Model Evaluation on Performance of appropriate
Accuracies.
7.Optimization of Parameters in Training set with the Model
.
8.Error Analysis of Selected Model .
9.Testing the Model with splitted Testing data set.
10.Getting Final Accuracy of the Tested dataset for
CAMPARISION OF LOGISTIC REGRESSION ALGORITHM AND KNN ALGORITHM
• The main algorithm “Logistic regression” will be compared with the “KNN Algorithm”.
• As the Result Logistic regression is getting Mean Accuracy of 98.00% percent , whereas
KNN algorithm is getting Mean Accuracy of 85.2% percent.
CAMPARISION OF LOGISTIC REGRESSION ALGORITHM AND NAÏVE BAYES
ALGORITHM
• The main algorithm “Logistic regression” will be compared with the “Naïve Baye’s Algorithm”.
• As the Result Logistic regression is getting Mean Accuracy of 98.00% percent , whereas Naïve
baye’s algorithm is getting Mean Accuracy of 97.1% percent.
CAMPARISION OF LOGISTIC REGRESSION ALGORITHM AND RANDOM
FOREST ALGORITHM
• The main algorithm “Logistic regression” will be compared with the “Random forest Algorithm”.
• As the Result Logistic regression is getting Mean Accuracy of 98.80% percent , whereas Random Forest algorithm is getting
Mean Accuracy of 98.1% percent.
CAMPARISION OF LOGISTIC REGRESSION ALGORITHM AND SVM
ALGORITHM
• The main algorithm “Logistic regression” will be compared with the “Random forest Algorithm”.
• As the Result Logistic regression is getting Mean Accuracy of 98.80% percent , whereas SVM algorithm is getting
Mean Accuracy of 95.00% percent
CONCLUSION
• In conclusion, this research concentrated on prediction of malware employing a classifier model
incorporating Random Forest, contrasting it with the Comparing algorithms. Findings unveiled a
significant accuracy advantage for Random Forest (97.2%) over the Algorithms with Accuracies such as :
Logistic Regression
KNN algorithm
Naïve Baye’s algorithm
SVM algorithm
Random Forest algorithm

More Related Content

Similar to machine learning project with advanced technology and am uploading it for my ppt for .pptx

EMAIL SPAM CLASSIFICATION USING HYBRID APPROACH OF RBF NEURAL NETWORK AND PAR...
EMAIL SPAM CLASSIFICATION USING HYBRID APPROACH OF RBF NEURAL NETWORK AND PAR...EMAIL SPAM CLASSIFICATION USING HYBRID APPROACH OF RBF NEURAL NETWORK AND PAR...
EMAIL SPAM CLASSIFICATION USING HYBRID APPROACH OF RBF NEURAL NETWORK AND PAR...IJNSA Journal
 
EMAIL SPAM CLASSIFICATION USING HYBRID APPROACH OF RBF NEURAL NETWORK AND PAR...
EMAIL SPAM CLASSIFICATION USING HYBRID APPROACH OF RBF NEURAL NETWORK AND PAR...EMAIL SPAM CLASSIFICATION USING HYBRID APPROACH OF RBF NEURAL NETWORK AND PAR...
EMAIL SPAM CLASSIFICATION USING HYBRID APPROACH OF RBF NEURAL NETWORK AND PAR...IJNSA Journal
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningLior Rokach
 
Characterizing Faults, Errors and Failures in Extreme-Scale Computing Systems
Characterizing Faults, Errors and Failures in Extreme-Scale Computing SystemsCharacterizing Faults, Errors and Failures in Extreme-Scale Computing Systems
Characterizing Faults, Errors and Failures in Extreme-Scale Computing Systemsinside-BigData.com
 
PyData London 2018 talk on feature selection
PyData London 2018 talk on feature selectionPyData London 2018 talk on feature selection
PyData London 2018 talk on feature selectionThomas Huijskens
 
NLP in Healthcare to Predict Adverse Events with Amazon SageMaker (AIM346) - ...
NLP in Healthcare to Predict Adverse Events with Amazon SageMaker (AIM346) - ...NLP in Healthcare to Predict Adverse Events with Amazon SageMaker (AIM346) - ...
NLP in Healthcare to Predict Adverse Events with Amazon SageMaker (AIM346) - ...Amazon Web Services
 
Case Studies IN Machine Learning
Case Studies IN Machine Learning Case Studies IN Machine Learning
Case Studies IN Machine Learning HIMADRI BANERJI
 
Sample_Subjective_Questions_Answers (1).pdf
Sample_Subjective_Questions_Answers (1).pdfSample_Subjective_Questions_Answers (1).pdf
Sample_Subjective_Questions_Answers (1).pdfAaryanArora10
 
Machine Learning techniques used in AI.
Machine Learning  techniques used in AI.Machine Learning  techniques used in AI.
Machine Learning techniques used in AI.ArchanaT32
 
Farid Ali Presentation_Final.pptx
Farid Ali Presentation_Final.pptxFarid Ali Presentation_Final.pptx
Farid Ali Presentation_Final.pptxFaridAliMousa1
 
Machine learning and linear regression programming
Machine learning and linear regression programmingMachine learning and linear regression programming
Machine learning and linear regression programmingSoumya Mukherjee
 
Deep belief networks for spam filtering
Deep belief networks for spam filteringDeep belief networks for spam filtering
Deep belief networks for spam filteringSOYEON KIM
 
WORKLOAD CHARACTERIZATION OF SPAM EMAIL FILTERING SYSTEMS
WORKLOAD CHARACTERIZATION OF SPAM EMAIL FILTERING SYSTEMSWORKLOAD CHARACTERIZATION OF SPAM EMAIL FILTERING SYSTEMS
WORKLOAD CHARACTERIZATION OF SPAM EMAIL FILTERING SYSTEMSIJNSA Journal
 
Performance Comparision of Machine Learning Algorithms
Performance Comparision of Machine Learning AlgorithmsPerformance Comparision of Machine Learning Algorithms
Performance Comparision of Machine Learning AlgorithmsDinusha Dilanka
 
Convolutional neural network
Convolutional neural networkConvolutional neural network
Convolutional neural networkItachi SK
 
Trinity of AI: data, algorithms and cloud
Trinity of AI: data, algorithms and cloudTrinity of AI: data, algorithms and cloud
Trinity of AI: data, algorithms and cloudAnima Anandkumar
 
Improving Spam Mail Filtering Using Classification Algorithms With Partition ...
Improving Spam Mail Filtering Using Classification Algorithms With Partition ...Improving Spam Mail Filtering Using Classification Algorithms With Partition ...
Improving Spam Mail Filtering Using Classification Algorithms With Partition ...IRJET Journal
 
671gdhfhfghhfhfghfghfghfgh163663-Project-2-PPT.pptx
671gdhfhfghhfhfghfghfghfgh163663-Project-2-PPT.pptx671gdhfhfghhfhfghfghfghfgh163663-Project-2-PPT.pptx
671gdhfhfghhfhfghfghfghfgh163663-Project-2-PPT.pptx0901CS211114SOURAVDI
 
Caim discretization algorithm
Caim discretization algorithmCaim discretization algorithm
Caim discretization algorithmenok7
 

Similar to machine learning project with advanced technology and am uploading it for my ppt for .pptx (20)

EMAIL SPAM CLASSIFICATION USING HYBRID APPROACH OF RBF NEURAL NETWORK AND PAR...
EMAIL SPAM CLASSIFICATION USING HYBRID APPROACH OF RBF NEURAL NETWORK AND PAR...EMAIL SPAM CLASSIFICATION USING HYBRID APPROACH OF RBF NEURAL NETWORK AND PAR...
EMAIL SPAM CLASSIFICATION USING HYBRID APPROACH OF RBF NEURAL NETWORK AND PAR...
 
EMAIL SPAM CLASSIFICATION USING HYBRID APPROACH OF RBF NEURAL NETWORK AND PAR...
EMAIL SPAM CLASSIFICATION USING HYBRID APPROACH OF RBF NEURAL NETWORK AND PAR...EMAIL SPAM CLASSIFICATION USING HYBRID APPROACH OF RBF NEURAL NETWORK AND PAR...
EMAIL SPAM CLASSIFICATION USING HYBRID APPROACH OF RBF NEURAL NETWORK AND PAR...
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Characterizing Faults, Errors and Failures in Extreme-Scale Computing Systems
Characterizing Faults, Errors and Failures in Extreme-Scale Computing SystemsCharacterizing Faults, Errors and Failures in Extreme-Scale Computing Systems
Characterizing Faults, Errors and Failures in Extreme-Scale Computing Systems
 
PyData London 2018 talk on feature selection
PyData London 2018 talk on feature selectionPyData London 2018 talk on feature selection
PyData London 2018 talk on feature selection
 
NLP in Healthcare to Predict Adverse Events with Amazon SageMaker (AIM346) - ...
NLP in Healthcare to Predict Adverse Events with Amazon SageMaker (AIM346) - ...NLP in Healthcare to Predict Adverse Events with Amazon SageMaker (AIM346) - ...
NLP in Healthcare to Predict Adverse Events with Amazon SageMaker (AIM346) - ...
 
Case Studies IN Machine Learning
Case Studies IN Machine Learning Case Studies IN Machine Learning
Case Studies IN Machine Learning
 
Sample_Subjective_Questions_Answers (1).pdf
Sample_Subjective_Questions_Answers (1).pdfSample_Subjective_Questions_Answers (1).pdf
Sample_Subjective_Questions_Answers (1).pdf
 
Machine Learning techniques used in AI.
Machine Learning  techniques used in AI.Machine Learning  techniques used in AI.
Machine Learning techniques used in AI.
 
Farid Ali Presentation_Final.pptx
Farid Ali Presentation_Final.pptxFarid Ali Presentation_Final.pptx
Farid Ali Presentation_Final.pptx
 
Machine learning and linear regression programming
Machine learning and linear regression programmingMachine learning and linear regression programming
Machine learning and linear regression programming
 
Deep belief networks for spam filtering
Deep belief networks for spam filteringDeep belief networks for spam filtering
Deep belief networks for spam filtering
 
WORKLOAD CHARACTERIZATION OF SPAM EMAIL FILTERING SYSTEMS
WORKLOAD CHARACTERIZATION OF SPAM EMAIL FILTERING SYSTEMSWORKLOAD CHARACTERIZATION OF SPAM EMAIL FILTERING SYSTEMS
WORKLOAD CHARACTERIZATION OF SPAM EMAIL FILTERING SYSTEMS
 
Performance Comparision of Machine Learning Algorithms
Performance Comparision of Machine Learning AlgorithmsPerformance Comparision of Machine Learning Algorithms
Performance Comparision of Machine Learning Algorithms
 
Convolutional neural network
Convolutional neural networkConvolutional neural network
Convolutional neural network
 
Trinity of AI: data, algorithms and cloud
Trinity of AI: data, algorithms and cloudTrinity of AI: data, algorithms and cloud
Trinity of AI: data, algorithms and cloud
 
Improving Spam Mail Filtering Using Classification Algorithms With Partition ...
Improving Spam Mail Filtering Using Classification Algorithms With Partition ...Improving Spam Mail Filtering Using Classification Algorithms With Partition ...
Improving Spam Mail Filtering Using Classification Algorithms With Partition ...
 
671gdhfhfghhfhfghfghfghfgh163663-Project-2-PPT.pptx
671gdhfhfghhfhfghfghfghfgh163663-Project-2-PPT.pptx671gdhfhfghhfhfghfghfghfgh163663-Project-2-PPT.pptx
671gdhfhfghhfhfghfghfghfgh163663-Project-2-PPT.pptx
 
Caim discretization algorithm
Caim discretization algorithmCaim discretization algorithm
Caim discretization algorithm
 
CREDIT_CARD.ppt
CREDIT_CARD.pptCREDIT_CARD.ppt
CREDIT_CARD.ppt
 

Recently uploaded

RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxFurkanTasci3
 
Spark3's new memory model/management
Spark3's new memory model/managementSpark3's new memory model/management
Spark3's new memory model/managementakshesh doshi
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 

Recently uploaded (20)

Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptx
 
Spark3's new memory model/management
Spark3's new memory model/managementSpark3's new memory model/management
Spark3's new memory model/management
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 

machine learning project with advanced technology and am uploading it for my ppt for .pptx

  • 1. EMAIL SPAM DETECTION USING MACHINE LEARNING ALGORITMS
  • 3. ABSTRACT Email is an information stored on a computer that is exchanged between two usersover telecommunication. Spam is one of the major threats posed to email users. Spam refers to the electronicmessaging system to send out unrequested or unwanted messages in bulk. The privacy and security of large amount of sensitive data arethreadbymalicious spam. Data mining has many approaches and alogorithms for email filtering. A classifier is asupervised function where the learned attribute is categorical. Context based methods analyze the control of the email to determine if the email is spamor not.
  • 4. Problem definition 1.Dataset:Obtainalabeleddatasetconsistingofbothspamandnon-spam(ham)emails.Thedatasetshouldberepresentativeofthe typesofemailstypicallyencounteredin thetargetenvironment. 2.FeatureSelection:Identifyrelevantfeatureswithintheemailcontent,metadata,andstructuralelementsthatcanbeusedto distinguishbetweenspamandnon-spam emails.Thesemayincludewordfrequency,presenceofcertainkeywords,senderinformation,emailheaderanalysis,etc. 3.ModelSelection:Chooseappropriatemachinelearningorstatisticalmodelsforclassificationtasks,suchasNaiveBayes,SupportVectorMachines(SVM),RandomForests, ordeeplearningmodelslikeRecurrentNeuralNetworks(RNNs)orConvolutionalNeuralNetworks(CNNs). 4.Preprocessing:Cleanandpreprocesstheemaildata,includingtaskssuchastokenization,stemming,removalofstopwords,andnormalizationoffeaturestoimprove modelperformance. 5.TrainingandEvaluation:Traintheselectedmodel(s)onthelabeleddatasetandevaluatetheirperformanceusingmetricssuch asaccuracy,precision,recall,F1-score, andareaundertheROCcurve(AUC). 6.HyperparameterTuning:Fine-tunemodelhyperparameterstooptimizeperformanceandgeneralizationonunseendata.Techniquessuchascross-validationorgrid searchcanbeemployedforthispurpose. 7.Deployment:Implementthetrainedmodelintoaproductionenvironmentwhereitcanautomaticallyclassifyincomingemailsas spamornon-spaminreal-time. 8.MonitoringandMaintenance:Regularlymonitortheperformanceofthedeployedsystemandupdatethemodelasneededtoadapt tochangingpatternsinemailspam.
  • 5. ALGORITHMS: • Algorithms used for the title for the Research in order to get the enhanced version to predict email spam detection •Logistic Regression Algorithm •KNN Algorithm •Random Forest Algorithm •Naïve Baye’s Algorithm •SVM Algorithm • The main algorithm used in the title is “Random forest” and the remaining algorithms such as “KNN Algorithm, Random Forest Algorithm,Naïve Baye’s Algorithm,SVM Algorithm” will be used as the comparing algorithms with the main algorithm
  • 6. STEPS USED IN RESEARCH TO GET ACCURACY 1.Data collection. 2.Data Preprocessing. 3.Splitting the dataset into Training set and Testing set. 4.Model Selection according to the Algorithm that to be performed. 5.Model Training with the splitted Training set data’s. 6.Model Evaluation on Performance of appropriate Accuracies. 7.Optimization of Parameters in Training set with the Model . 8.Error Analysis of Selected Model . 9.Testing the Model with splitted Testing data set. 10.Getting Final Accuracy of the Tested dataset for
  • 7. CAMPARISION OF LOGISTIC REGRESSION ALGORITHM AND KNN ALGORITHM • The main algorithm “Logistic regression” will be compared with the “KNN Algorithm”. • As the Result Logistic regression is getting Mean Accuracy of 98.00% percent , whereas KNN algorithm is getting Mean Accuracy of 85.2% percent.
  • 8. CAMPARISION OF LOGISTIC REGRESSION ALGORITHM AND NAÏVE BAYES ALGORITHM • The main algorithm “Logistic regression” will be compared with the “Naïve Baye’s Algorithm”. • As the Result Logistic regression is getting Mean Accuracy of 98.00% percent , whereas Naïve baye’s algorithm is getting Mean Accuracy of 97.1% percent.
  • 9. CAMPARISION OF LOGISTIC REGRESSION ALGORITHM AND RANDOM FOREST ALGORITHM • The main algorithm “Logistic regression” will be compared with the “Random forest Algorithm”. • As the Result Logistic regression is getting Mean Accuracy of 98.80% percent , whereas Random Forest algorithm is getting Mean Accuracy of 98.1% percent.
  • 10. CAMPARISION OF LOGISTIC REGRESSION ALGORITHM AND SVM ALGORITHM • The main algorithm “Logistic regression” will be compared with the “Random forest Algorithm”. • As the Result Logistic regression is getting Mean Accuracy of 98.80% percent , whereas SVM algorithm is getting Mean Accuracy of 95.00% percent
  • 11. CONCLUSION • In conclusion, this research concentrated on prediction of malware employing a classifier model incorporating Random Forest, contrasting it with the Comparing algorithms. Findings unveiled a significant accuracy advantage for Random Forest (97.2%) over the Algorithms with Accuracies such as : Logistic Regression KNN algorithm Naïve Baye’s algorithm SVM algorithm Random Forest algorithm