SlideShare a Scribd company logo
FOUNDATIONS OF DATA SCIENCE
Course Based Project
CUSTOMER SEGMENTATION USING K-MEANS CLUSTERING
NAME: Yelubandi Aravind
Course: M.Tech (Embedded Systems)
Roll No: UCC22ECES11
1
National Institute of Electronics and
Information Technology
Ministry of Electronics and Information Technology, Government of India
Calicut, Kerala-673601
2
Background of the project
 Nowadays, success in business relies on generating innovative ideas, especially because there are
many potential customers who are unsure about what products or services to choose.
 This is where machine learning comes into play, by applying various algorithms, we can uncover
hidden patterns in data, enabling better decision making.
 To achieve this, we can use a clustering technique called the K-means and DBSCAN algorithms, which
divides customers into clusters based on similarities.
 To determine the optimal number of clusters, we can utilize the elbow method, which helps us find
the right balance between capturing enough distinct segments while avoiding excessive complexity.
 we doing Silhouette Score validation for checking number of clusters determined through elbow
method.
Literature Survey
3
S.No CITATION KEY IDEA/APPROACH LIMITATIONS/REMARKS
1
R. Gupta, A. Verma and H. O. Topal,
"Customer Segmentation of Indian
restaurants on the basis of geographical
locations using Machine Learning," 2021
International Conference on Technological
Advancements and Innovations (ICTAI),
Tashkent, Uzbekistan, 2021, pp. 382-387,
doi: 10.1109/ICTAI53825.2021.9673153.
In this paper, They have
implemented the K-Means
clustering algorithm on the
dataset of all the restaurants
present in Bangalore based
in Python Language.
2
Tushar Kansal, Suraj Bahuguna, Vishal
Singh, Tanupriya Choudhury (June 2018).
Customer Segmentation using K-means
Clustering.
3
Mr. M. Sathyanarayana, S. Dhanish, P.
Shiva Kumar, A. Niranjan Reddy
(December 2022). Mall Customer
Segmentation Using Clustering Algorithm.
4
S.No CITATION KEY IDEA/APPROACH LIMITATIONS/REMARKS
4
Hemashree Kilari, Sailesh
Edara, Guna Ratna Sai
Yarra, Dileep Varma
Gadhiraju (March 2022).
Customer Segmentation
using K-Means Clustering.
5
Pavithra M, Ayushman
Prashar, Abirami (July
2022). Maximizing Strategy
in Customer Segmentation
Using Different Clustering
Techniques.
6
Mathesh T, Sumathy G,
Maheshwari A ( May 2023).
A Machine Learning
approach to segment the
customers of online sales
data for better and efficient
marketing purposes.
5
Problem Statement
 In Business the owners wants to know the data like which type of customers are coming for
purchasing. So, to get this type of data using dataset that they have is complex.
 The dataset that the Business units having is unsupervised data so it is complex to derive
required data from that dataset.
 To overcome this, In machine learning there is method called clustering which will divide
customers into different groups based on their purchasing history.
 The project aims to create a simple machine-learning model in Python for customers
segmentation using K-means and DBSCAN algorithms in clustering.
 The model will clustering algorithms based on important features present in the data set.
 It will handle complex data and capture relationships effectively.
6
Objectives of the project
 Identify Meaningful Customer Segments.
 Understand Customer Preferences and Behavior.
 Personalize Marketing and Offerings.
 Improve Customer Retention and Loyalty.
 Optimize Marketing Resource Allocation.
 Enhance Customer Experience.
 Gain Competitive Advantage.
 Measure and Evaluate Cluster Quality.
 Provide Actionable Insights and Recommendations.
 Monitor and adapt.
7
Scope of the Project
 Data Collection.
 Data Preprocessing.
 Feature Selection.
 Algorithm Selection.
 Clustering Implementation.
 Evaluation and Validation.
 Interpretation and Insights.
 Visualization and Reporting.
 Recommendations and Actionable Strategies.
8
Tools & Resources
 Google Colab, platform was used for coding and model development.
 Python Programming language was utilized within the Google Colab environment.
 Pandas, NumPy, and Scikit-learn libraries were employed for data processing,
manipulation, and machine-learning tasks.
 Matplotlib and Seaborn libraries were used for data visualization within the Google
Colab Notebook.
Methodology
9
Result
10
11
12
13
14
15
16
Conclusion
customer segmentation using K-means and DBSCAN algorithms, along with the validation of the
segmentation through Silhouette score, provides businesses with a powerful framework for
understanding their customer base, identifying distinct customer groups, and making data-driven
decisions to optimize their marketing efforts.
 High-income, high-spending customers: Target them as they have more money to spend.
 High-income, low-spending customers: Engage with them by seeking feedback and improving
advertising to increase their spending.
 Average-income, average-spending customers: May or may not be beneficial to mall owners. It
depends on individual circumstances.
 Low-income, high-spending customers: Target them by offering affordable payment options like low-
cost EMI plans.
 Low-income, low-spending customers: Avoid targeting them as they have limited income and spend
less.
17
Future Scope
 Feature Engineering: Explore and incorporate additional customer features or variables that can
provide richer insights for segmentation.
 Algorithm Enhancement: Assess their performance in customer segmentation and compare them with
the existing methods.
 Dynamic Segmentation: This could involve developing an automated system that continuously analyses
and updates clusters based on new data.
 Predictive Analytics: Apply predictive modelling techniques to forecast future customer behavior
within each segment.
 Multi-channel Analysis: Extend the segmentation analysis to incorporate multiple channels, such as
offline and online customer interactions.
 Segmentation Visualization: Develop visualizations and dashboards to effectively communicate the
segmented customer groups and their characteristics to stakeholders within the organization. This can
aid in decision-making, resource allocation, and marketing strategy formulation.
18
References
1. V. Vijilesh, A. Harini, M. Hari Dharshini, R. Priyadharshini (May 2021). Customer Segmentation Using Machine
Learning. https://www.irjet.net/archives/V8/i5/IRJET-V8I5163.pdf
2. Tushar Kansal, Suraj Bahuguna, Vishal Singh, Tanupriya Choudhury (June 2018). Customer Segmentation
using K-means Clustering. https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8769171&tag=1
3. Mr. M. Sathyanarayana, S. Dhanish, P. Shiva Kumar, A. Niranjan Reddy (December 2022). Mall Customer
Segmentation Using Clustering Algorithm. https://www.ijraset.com/research-paper/mall-customer-
segmentation-usingclustering-algorithm
4. Hemashree Kilari, Sailesh Edara, Guna Ratna Sai Yarra, Dileep Varma Gadhiraju (March 2022). Customer
Segmentation using K-Means Clustering. https://www.ijert.org/research/customer-segmentation-using-k-
meansclustering-IJERTV11IS030152.pdf
5. Pavithra M, Ayushman Prashar, Abirami (July 2022). Maximizing Strategy in Customer Segmentation Using
Different Clustering Techniques. https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9774200
6. Mathesh T, Sumathy G, Maheshwari A ( May 2023). A Machine Learning approach to segment the customers
of online sales data for better and efficient marketing purposes.
https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10084339
19

More Related Content

Similar to YELUBANDI ARAVIND-PPT-FODS (1).pptx

YouGov Plan and Track.pptx
YouGov Plan and Track.pptxYouGov Plan and Track.pptx
YouGov Plan and Track.pptx
CaitlinB3
 
Break through the Analytics Barrier
Break through the Analytics BarrierBreak through the Analytics Barrier
Break through the Analytics Barrier
Cognizant
 
Data Mining: What is Data Mining?
Data Mining: What is Data Mining?Data Mining: What is Data Mining?
Data Mining: What is Data Mining?
Seerat Malik
 
Big Data Analytics for Predicting Consumer Behaviour
Big Data Analytics for Predicting Consumer BehaviourBig Data Analytics for Predicting Consumer Behaviour
Big Data Analytics for Predicting Consumer Behaviour
IRJET Journal
 
Data Mining Concepts with Customer Relationship Management
Data Mining Concepts with Customer Relationship ManagementData Mining Concepts with Customer Relationship Management
Data Mining Concepts with Customer Relationship Management
IJERA Editor
 
AHP Based Data Mining for Customer Segmentation Based on Customer Lifetime Value
AHP Based Data Mining for Customer Segmentation Based on Customer Lifetime ValueAHP Based Data Mining for Customer Segmentation Based on Customer Lifetime Value
AHP Based Data Mining for Customer Segmentation Based on Customer Lifetime Value
IIRindia
 
IRJET-User Profile based Behavior Identificaton using Data Mining Technique
IRJET-User Profile based Behavior Identificaton using Data Mining TechniqueIRJET-User Profile based Behavior Identificaton using Data Mining Technique
IRJET-User Profile based Behavior Identificaton using Data Mining Technique
IRJET Journal
 
Ankita's resume
Ankita's resume Ankita's resume
Ankita's resume
Ankita Ghosh
 
Driving Customer Loyalty with Azure Machine Learning
Driving Customer Loyalty with Azure Machine LearningDriving Customer Loyalty with Azure Machine Learning
Driving Customer Loyalty with Azure Machine Learning
CCG
 
How to Create a Data Analytics Roadmap
How to Create a Data Analytics RoadmapHow to Create a Data Analytics Roadmap
How to Create a Data Analytics Roadmap
CCG
 
Improved Customer Churn Behaviour by using SVM
Improved Customer Churn Behaviour by using SVMImproved Customer Churn Behaviour by using SVM
Improved Customer Churn Behaviour by using SVM
IRJET Journal
 
Customer_Setntiment_Analysis_LAST_YEAR_PROJECT.pptx
Customer_Setntiment_Analysis_LAST_YEAR_PROJECT.pptxCustomer_Setntiment_Analysis_LAST_YEAR_PROJECT.pptx
Customer_Setntiment_Analysis_LAST_YEAR_PROJECT.pptx
vipushmangacseds2020
 
Clustering customer data dr sankar rajagopal
Clustering customer data   dr sankar rajagopalClustering customer data   dr sankar rajagopal
Clustering customer data dr sankar rajagopal
Dr.Sankar Rajagopal
 
Most Demanding Freelancing Skills in 2024 - freelancingtools.com-.pdf
Most Demanding Freelancing Skills in 2024 - freelancingtools.com-.pdfMost Demanding Freelancing Skills in 2024 - freelancingtools.com-.pdf
Most Demanding Freelancing Skills in 2024 - freelancingtools.com-.pdf
abdulldr86
 
Smart Data Module 4 d drive_business models
Smart Data Module 4 d drive_business modelsSmart Data Module 4 d drive_business models
Smart Data Module 4 d drive_business models
caniceconsulting
 
Final pm ppt.pptx
Final pm ppt.pptxFinal pm ppt.pptx
Final pm ppt.pptx
RitikaUppal4
 
Leveraging Artificial Intelligence in Marketing and Advertising: Unleashing t...
Leveraging Artificial Intelligence in Marketing and Advertising: Unleashing t...Leveraging Artificial Intelligence in Marketing and Advertising: Unleashing t...
Leveraging Artificial Intelligence in Marketing and Advertising: Unleashing t...
IRJET Journal
 
Machine Learning in Customer Analytics
Machine Learning in Customer AnalyticsMachine Learning in Customer Analytics
Machine Learning in Customer Analytics
Course5i
 
Business analytics course with NSE India certification
Business analytics course with NSE India certificationBusiness analytics course with NSE India certification
Business analytics course with NSE India certification
IMS Proschool
 
Business analytics course with NSE India Certification
Business analytics course with NSE India CertificationBusiness analytics course with NSE India Certification
Business analytics course with NSE India Certification
IMS Proschool
 

Similar to YELUBANDI ARAVIND-PPT-FODS (1).pptx (20)

YouGov Plan and Track.pptx
YouGov Plan and Track.pptxYouGov Plan and Track.pptx
YouGov Plan and Track.pptx
 
Break through the Analytics Barrier
Break through the Analytics BarrierBreak through the Analytics Barrier
Break through the Analytics Barrier
 
Data Mining: What is Data Mining?
Data Mining: What is Data Mining?Data Mining: What is Data Mining?
Data Mining: What is Data Mining?
 
Big Data Analytics for Predicting Consumer Behaviour
Big Data Analytics for Predicting Consumer BehaviourBig Data Analytics for Predicting Consumer Behaviour
Big Data Analytics for Predicting Consumer Behaviour
 
Data Mining Concepts with Customer Relationship Management
Data Mining Concepts with Customer Relationship ManagementData Mining Concepts with Customer Relationship Management
Data Mining Concepts with Customer Relationship Management
 
AHP Based Data Mining for Customer Segmentation Based on Customer Lifetime Value
AHP Based Data Mining for Customer Segmentation Based on Customer Lifetime ValueAHP Based Data Mining for Customer Segmentation Based on Customer Lifetime Value
AHP Based Data Mining for Customer Segmentation Based on Customer Lifetime Value
 
IRJET-User Profile based Behavior Identificaton using Data Mining Technique
IRJET-User Profile based Behavior Identificaton using Data Mining TechniqueIRJET-User Profile based Behavior Identificaton using Data Mining Technique
IRJET-User Profile based Behavior Identificaton using Data Mining Technique
 
Ankita's resume
Ankita's resume Ankita's resume
Ankita's resume
 
Driving Customer Loyalty with Azure Machine Learning
Driving Customer Loyalty with Azure Machine LearningDriving Customer Loyalty with Azure Machine Learning
Driving Customer Loyalty with Azure Machine Learning
 
How to Create a Data Analytics Roadmap
How to Create a Data Analytics RoadmapHow to Create a Data Analytics Roadmap
How to Create a Data Analytics Roadmap
 
Improved Customer Churn Behaviour by using SVM
Improved Customer Churn Behaviour by using SVMImproved Customer Churn Behaviour by using SVM
Improved Customer Churn Behaviour by using SVM
 
Customer_Setntiment_Analysis_LAST_YEAR_PROJECT.pptx
Customer_Setntiment_Analysis_LAST_YEAR_PROJECT.pptxCustomer_Setntiment_Analysis_LAST_YEAR_PROJECT.pptx
Customer_Setntiment_Analysis_LAST_YEAR_PROJECT.pptx
 
Clustering customer data dr sankar rajagopal
Clustering customer data   dr sankar rajagopalClustering customer data   dr sankar rajagopal
Clustering customer data dr sankar rajagopal
 
Most Demanding Freelancing Skills in 2024 - freelancingtools.com-.pdf
Most Demanding Freelancing Skills in 2024 - freelancingtools.com-.pdfMost Demanding Freelancing Skills in 2024 - freelancingtools.com-.pdf
Most Demanding Freelancing Skills in 2024 - freelancingtools.com-.pdf
 
Smart Data Module 4 d drive_business models
Smart Data Module 4 d drive_business modelsSmart Data Module 4 d drive_business models
Smart Data Module 4 d drive_business models
 
Final pm ppt.pptx
Final pm ppt.pptxFinal pm ppt.pptx
Final pm ppt.pptx
 
Leveraging Artificial Intelligence in Marketing and Advertising: Unleashing t...
Leveraging Artificial Intelligence in Marketing and Advertising: Unleashing t...Leveraging Artificial Intelligence in Marketing and Advertising: Unleashing t...
Leveraging Artificial Intelligence in Marketing and Advertising: Unleashing t...
 
Machine Learning in Customer Analytics
Machine Learning in Customer AnalyticsMachine Learning in Customer Analytics
Machine Learning in Customer Analytics
 
Business analytics course with NSE India certification
Business analytics course with NSE India certificationBusiness analytics course with NSE India certification
Business analytics course with NSE India certification
 
Business analytics course with NSE India Certification
Business analytics course with NSE India CertificationBusiness analytics course with NSE India Certification
Business analytics course with NSE India Certification
 

Recently uploaded

Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
correoyaya
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Linda486226
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
ewymefz
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
ukgaet
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
ewymefz
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
ArpitMalhotra16
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
vcaxypu
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
MaleehaSheikh2
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 
tapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive data
theahmadsaood
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
enxupq
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
NABLAS株式会社
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
StarCompliance.io
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
nscud
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
benishzehra469
 

Recently uploaded (20)

Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
tapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive data
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
 

YELUBANDI ARAVIND-PPT-FODS (1).pptx

  • 1. FOUNDATIONS OF DATA SCIENCE Course Based Project CUSTOMER SEGMENTATION USING K-MEANS CLUSTERING NAME: Yelubandi Aravind Course: M.Tech (Embedded Systems) Roll No: UCC22ECES11 1 National Institute of Electronics and Information Technology Ministry of Electronics and Information Technology, Government of India Calicut, Kerala-673601
  • 2. 2 Background of the project  Nowadays, success in business relies on generating innovative ideas, especially because there are many potential customers who are unsure about what products or services to choose.  This is where machine learning comes into play, by applying various algorithms, we can uncover hidden patterns in data, enabling better decision making.  To achieve this, we can use a clustering technique called the K-means and DBSCAN algorithms, which divides customers into clusters based on similarities.  To determine the optimal number of clusters, we can utilize the elbow method, which helps us find the right balance between capturing enough distinct segments while avoiding excessive complexity.  we doing Silhouette Score validation for checking number of clusters determined through elbow method.
  • 3. Literature Survey 3 S.No CITATION KEY IDEA/APPROACH LIMITATIONS/REMARKS 1 R. Gupta, A. Verma and H. O. Topal, "Customer Segmentation of Indian restaurants on the basis of geographical locations using Machine Learning," 2021 International Conference on Technological Advancements and Innovations (ICTAI), Tashkent, Uzbekistan, 2021, pp. 382-387, doi: 10.1109/ICTAI53825.2021.9673153. In this paper, They have implemented the K-Means clustering algorithm on the dataset of all the restaurants present in Bangalore based in Python Language. 2 Tushar Kansal, Suraj Bahuguna, Vishal Singh, Tanupriya Choudhury (June 2018). Customer Segmentation using K-means Clustering. 3 Mr. M. Sathyanarayana, S. Dhanish, P. Shiva Kumar, A. Niranjan Reddy (December 2022). Mall Customer Segmentation Using Clustering Algorithm.
  • 4. 4 S.No CITATION KEY IDEA/APPROACH LIMITATIONS/REMARKS 4 Hemashree Kilari, Sailesh Edara, Guna Ratna Sai Yarra, Dileep Varma Gadhiraju (March 2022). Customer Segmentation using K-Means Clustering. 5 Pavithra M, Ayushman Prashar, Abirami (July 2022). Maximizing Strategy in Customer Segmentation Using Different Clustering Techniques. 6 Mathesh T, Sumathy G, Maheshwari A ( May 2023). A Machine Learning approach to segment the customers of online sales data for better and efficient marketing purposes.
  • 5. 5 Problem Statement  In Business the owners wants to know the data like which type of customers are coming for purchasing. So, to get this type of data using dataset that they have is complex.  The dataset that the Business units having is unsupervised data so it is complex to derive required data from that dataset.  To overcome this, In machine learning there is method called clustering which will divide customers into different groups based on their purchasing history.  The project aims to create a simple machine-learning model in Python for customers segmentation using K-means and DBSCAN algorithms in clustering.  The model will clustering algorithms based on important features present in the data set.  It will handle complex data and capture relationships effectively.
  • 6. 6 Objectives of the project  Identify Meaningful Customer Segments.  Understand Customer Preferences and Behavior.  Personalize Marketing and Offerings.  Improve Customer Retention and Loyalty.  Optimize Marketing Resource Allocation.  Enhance Customer Experience.  Gain Competitive Advantage.  Measure and Evaluate Cluster Quality.  Provide Actionable Insights and Recommendations.  Monitor and adapt.
  • 7. 7 Scope of the Project  Data Collection.  Data Preprocessing.  Feature Selection.  Algorithm Selection.  Clustering Implementation.  Evaluation and Validation.  Interpretation and Insights.  Visualization and Reporting.  Recommendations and Actionable Strategies.
  • 8. 8 Tools & Resources  Google Colab, platform was used for coding and model development.  Python Programming language was utilized within the Google Colab environment.  Pandas, NumPy, and Scikit-learn libraries were employed for data processing, manipulation, and machine-learning tasks.  Matplotlib and Seaborn libraries were used for data visualization within the Google Colab Notebook.
  • 11. 11
  • 12. 12
  • 13. 13
  • 14. 14
  • 15. 15
  • 16. 16 Conclusion customer segmentation using K-means and DBSCAN algorithms, along with the validation of the segmentation through Silhouette score, provides businesses with a powerful framework for understanding their customer base, identifying distinct customer groups, and making data-driven decisions to optimize their marketing efforts.  High-income, high-spending customers: Target them as they have more money to spend.  High-income, low-spending customers: Engage with them by seeking feedback and improving advertising to increase their spending.  Average-income, average-spending customers: May or may not be beneficial to mall owners. It depends on individual circumstances.  Low-income, high-spending customers: Target them by offering affordable payment options like low- cost EMI plans.  Low-income, low-spending customers: Avoid targeting them as they have limited income and spend less.
  • 17. 17 Future Scope  Feature Engineering: Explore and incorporate additional customer features or variables that can provide richer insights for segmentation.  Algorithm Enhancement: Assess their performance in customer segmentation and compare them with the existing methods.  Dynamic Segmentation: This could involve developing an automated system that continuously analyses and updates clusters based on new data.  Predictive Analytics: Apply predictive modelling techniques to forecast future customer behavior within each segment.  Multi-channel Analysis: Extend the segmentation analysis to incorporate multiple channels, such as offline and online customer interactions.  Segmentation Visualization: Develop visualizations and dashboards to effectively communicate the segmented customer groups and their characteristics to stakeholders within the organization. This can aid in decision-making, resource allocation, and marketing strategy formulation.
  • 18. 18 References 1. V. Vijilesh, A. Harini, M. Hari Dharshini, R. Priyadharshini (May 2021). Customer Segmentation Using Machine Learning. https://www.irjet.net/archives/V8/i5/IRJET-V8I5163.pdf 2. Tushar Kansal, Suraj Bahuguna, Vishal Singh, Tanupriya Choudhury (June 2018). Customer Segmentation using K-means Clustering. https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8769171&tag=1 3. Mr. M. Sathyanarayana, S. Dhanish, P. Shiva Kumar, A. Niranjan Reddy (December 2022). Mall Customer Segmentation Using Clustering Algorithm. https://www.ijraset.com/research-paper/mall-customer- segmentation-usingclustering-algorithm 4. Hemashree Kilari, Sailesh Edara, Guna Ratna Sai Yarra, Dileep Varma Gadhiraju (March 2022). Customer Segmentation using K-Means Clustering. https://www.ijert.org/research/customer-segmentation-using-k- meansclustering-IJERTV11IS030152.pdf 5. Pavithra M, Ayushman Prashar, Abirami (July 2022). Maximizing Strategy in Customer Segmentation Using Different Clustering Techniques. https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9774200 6. Mathesh T, Sumathy G, Maheshwari A ( May 2023). A Machine Learning approach to segment the customers of online sales data for better and efficient marketing purposes. https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10084339
  • 19. 19