SlideShare a Scribd company logo
1 of 13
K-MEANS CLUSTERING
WITH ORANGE
IDENTIFY CUSTOMER SEGMENTS
OF A SOCIAL ENTERPRISE TO
CREATE CUSTOMER OFFERS FOR EACH SEGMENT
AUTHOR: ANTHONY MOK
DATE: 18 NOV 2023
EMAIL: XXIAOHAO@YAHOO.COM
WHAT IS ORANGE
Open-source
and Extensible
Freely available,
adaptable, and
customisable
data mining tool
Visual
Programming
Drag-and-drop
interface for
building data
analysis
workflows
Interactive Data
Exploration
Quickly
understand data
patterns and
trends using
visualisations
Wide Range of
Data Mining
Algorithms
Identify patterns,
make predictions,
and solve data mining
problems
PROJECT’S CONTEXT, OBJECTIVE & STRATEGIES
To identify customer
segments to customised
offers for each segment
Social Enterprise
collected data on
customers & wants to make
insight-informed decisions
• Explore & Clean data for
analysis
• Perform K-Means Clustering,
in Orange, to find possible
segments in the customer
data
• Tune the model to improve its
performance
• Visualise the findings, share
conclusions, and give insight-
driven recommendations
EXPLORATORY DATA ANALYSIS
Findings
• Target = Recency_in_Day
• Provides insights into customer behavior,
preferences, and churn risk
• Feature Columns = 9
• Instances = 2,240
• Blanks & Outliers
Age Column Income Column
23 Blanks -
1 Outlier 3 Outliers
K-MEANS CLUSTERING WORKFLOW IN ORANGE
LOADING DATA & DEALING WITH BLANKS
Customer.csv file imported into
workflow with the ‘Role’ of
Recency_days set as ‘Target’,‘ID’ as
“meta’, with the rest as ‘features’
Exploratory Data Analysis (EDA) was
considered, and blanks are imputed
by ‘Average’ of sum of values in the
‘Income’ column
EXAMINING RELATIONSHIPS & PATTERNS
Scatter Plots were created
to explore the relationships
and patterns in the dataset
‘Recency_days’is the ‘Target’
with Four feature columns
selected for the model:
‘Income’ & ‘Age’ (Numerical
Data) & ‘Marital Status’ &
‘Education’, since these are
more informative
IDENTIFYING IDEAL NUMBER OF CLUSTERS
• To determine the ideal number of
clusters, the Silhouette Scores in the
range of 2 to 12 clusters were
calculated
• Overall, the Silhouette Scores are
positive, but relatively low, suggesting
the clustering is fair, but there is still
some overlaps between clusters
• Clustering parameters can be
adjusted to improve the separation
between clusters
BOOSTING MODEL’S PERFORMANCE & LIMITATIONS
• By default,‘K-Means++’ & ‘Normalise Columns’
are enabled in the Hyperparameters
• So only ‘Maximum Iterations’ was set to 100,000
(from 300) and ‘Re-runs’ at 100 (from 10) to
boost the performance of the model
• But the Silhouette Scores haven’t improved in
the range of 2 to 12 clusters after these changes,
suggesting that the K-Means Clustering
Algorithm has converged to a stable solution
BOOSTING MODEL’S PERFORMANCE & LIMITATIONS
In this stable state, scores can be
increased at the upper ranges of
the clusters, but will result to
overfitting the model to the dataset
To avoid this outcome, the
conservative number of 3 Clusters
was chosen (Silhouette Score =
0.217) instead
FINDINGS & CONCLUSIONS
• Maximum income of customer base is
$100,000/annum
• For customers in the age range of 30 to 55, half of
these earned below $50,000/annum, who could
be price sensitive and are bargain hunters, while
the other half earned above this threshold, who
may be able to pay a premium for quality
• Higher concentration of customers is found to
have undergraduate degrees, who are more
educated, and they are separated equally into
two clusters: singles, with more ability for
discretionary spending, and married couples,
with less spending power given children/teens in
their households
• Customers above 55 are even distributed across
all income groups
* More comprehensive findings and conclusions were provided in the project report, which
are not released at the request of the Social Enterprise
RECOMMENDATIONS*
Segment 1 - Customers in the age range of 30 to 55
who earned below $50,000/annum
• Offer value-for-money products and services
• Highlight discounts and promotions
• Offer bundle deals and loyalty programs
• Target them with personalised marketing campaigns
based on their purchase history and interests
* More recommendations were provided for each identified cluster in the project report,
which are not released at the request of the Social Enterprise
Segment 3 - Customers with undergraduate degrees
• Offer educational and informative content
• Highlight the benefits of products and services for their
careers and personal development
• Partner with other businesses that offer complementary
products and services
• Target them with personalised marketing campaigns
based on their interests and areas of expertise
K-MEANS CLUSTERING
WITH ORANGE
IDENTIFY CUSTOMER SEGMENTS
OF A SOCIAL ENTERPRISE TO
CREATE CUSTOMER OFFERS FOR EACH SEGMENT
AUTHOR: ANTHONY MOK
DATE: 18 NOV 2023
EMAIL: XXIAOHAO@YAHOO.COM

More Related Content

Similar to Identify Customer Segments to Create Customer Offers for Each Segment - Application of K-Means Clustering With Orange

MonetizingStatistics
MonetizingStatisticsMonetizingStatistics
MonetizingStatistics
Aaron Sankey
 
Liferay overview of predicitve analytics
Liferay overview of predicitve analyticsLiferay overview of predicitve analytics
Liferay overview of predicitve analytics
Joe Brandenburg
 
MA- UNIT -1.pptx for ipu bba sem 5, complete pdf
MA- UNIT -1.pptx for ipu bba sem 5, complete pdfMA- UNIT -1.pptx for ipu bba sem 5, complete pdf
MA- UNIT -1.pptx for ipu bba sem 5, complete pdf
zm2pfgpcdt
 

Similar to Identify Customer Segments to Create Customer Offers for Each Segment - Application of K-Means Clustering With Orange (20)

Predictive Analytics Demystified
Predictive Analytics DemystifiedPredictive Analytics Demystified
Predictive Analytics Demystified
 
Unit I-Final MArketing analytics unit 1 ppt
Unit I-Final MArketing analytics unit 1 pptUnit I-Final MArketing analytics unit 1 ppt
Unit I-Final MArketing analytics unit 1 ppt
 
image classifier.pptx
image classifier.pptximage classifier.pptx
image classifier.pptx
 
MonetizingStatistics
MonetizingStatisticsMonetizingStatistics
MonetizingStatistics
 
dasat
dasatdasat
dasat
 
Liferay overview of predicitve analytics
Liferay overview of predicitve analyticsLiferay overview of predicitve analytics
Liferay overview of predicitve analytics
 
Employee Churn Prediction: Artificial Intelligence Project Presentation
Employee Churn Prediction: Artificial Intelligence Project PresentationEmployee Churn Prediction: Artificial Intelligence Project Presentation
Employee Churn Prediction: Artificial Intelligence Project Presentation
 
Bank Marketing Analysis: Data Analysis Project
Bank Marketing Analysis: Data Analysis ProjectBank Marketing Analysis: Data Analysis Project
Bank Marketing Analysis: Data Analysis Project
 
Customer Segmentation
Customer SegmentationCustomer Segmentation
Customer Segmentation
 
Customer Analytics Overview
Customer Analytics OverviewCustomer Analytics Overview
Customer Analytics Overview
 
Recency/Frequency and Predictive Analytics in the gaming industry
Recency/Frequency and Predictive Analytics in the gaming industryRecency/Frequency and Predictive Analytics in the gaming industry
Recency/Frequency and Predictive Analytics in the gaming industry
 
Hair_EOMA_1e_Chap001_PPT.pptx
Hair_EOMA_1e_Chap001_PPT.pptxHair_EOMA_1e_Chap001_PPT.pptx
Hair_EOMA_1e_Chap001_PPT.pptx
 
MA- UNIT -1.pptx for ipu bba sem 5, complete pdf
MA- UNIT -1.pptx for ipu bba sem 5, complete pdfMA- UNIT -1.pptx for ipu bba sem 5, complete pdf
MA- UNIT -1.pptx for ipu bba sem 5, complete pdf
 
Business analytics & strategy
Business analytics & strategyBusiness analytics & strategy
Business analytics & strategy
 
Stark Consulting Services Inc.- BDAS Capabilities Presentation
Stark Consulting Services Inc.- BDAS Capabilities PresentationStark Consulting Services Inc.- BDAS Capabilities Presentation
Stark Consulting Services Inc.- BDAS Capabilities Presentation
 
WHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjn
WHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjnWHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjn
WHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjn
 
E-commerce Berlin Expo 2018 - How to boost your online sales using machine le...
E-commerce Berlin Expo 2018 - How to boost your online sales using machine le...E-commerce Berlin Expo 2018 - How to boost your online sales using machine le...
E-commerce Berlin Expo 2018 - How to boost your online sales using machine le...
 
Leveraging Data Analysis for Sales
Leveraging Data Analysis for SalesLeveraging Data Analysis for Sales
Leveraging Data Analysis for Sales
 
Predictive Price Optimization 2018 - Achilles - QueBIT Consulting
Predictive Price Optimization 2018 - Achilles - QueBIT ConsultingPredictive Price Optimization 2018 - Achilles - QueBIT Consulting
Predictive Price Optimization 2018 - Achilles - QueBIT Consulting
 
BI Knowledge Sharing Session 1
BI Knowledge Sharing Session 1BI Knowledge Sharing Session 1
BI Knowledge Sharing Session 1
 

More from ThinkInnovation

Decision Making Under Uncertainty - Decide Whether Or Not to Take Precautions
Decision Making Under Uncertainty - Decide Whether Or Not to Take PrecautionsDecision Making Under Uncertainty - Decide Whether Or Not to Take Precautions
Decision Making Under Uncertainty - Decide Whether Or Not to Take Precautions
ThinkInnovation
 
Breakfast Talk - Manage Projects
Breakfast Talk - Manage ProjectsBreakfast Talk - Manage Projects
Breakfast Talk - Manage Projects
ThinkInnovation
 
Think innovation issue 4 share - scamper
Think innovation issue 4   share - scamperThink innovation issue 4   share - scamper
Think innovation issue 4 share - scamper
ThinkInnovation
 

More from ThinkInnovation (19)

Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
 
Ordinary Least Square Regression & Stage-2 Regression - Factors Influencing M...
Ordinary Least Square Regression & Stage-2 Regression - Factors Influencing M...Ordinary Least Square Regression & Stage-2 Regression - Factors Influencing M...
Ordinary Least Square Regression & Stage-2 Regression - Factors Influencing M...
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
 
Decision Making Under Uncertainty - Predict the Chances of a Person Suffering...
Decision Making Under Uncertainty - Predict the Chances of a Person Suffering...Decision Making Under Uncertainty - Predict the Chances of a Person Suffering...
Decision Making Under Uncertainty - Predict the Chances of a Person Suffering...
 
Decision Making Under Uncertainty - Is It Better Off Joining a Partnership or...
Decision Making Under Uncertainty - Is It Better Off Joining a Partnership or...Decision Making Under Uncertainty - Is It Better Off Joining a Partnership or...
Decision Making Under Uncertainty - Is It Better Off Joining a Partnership or...
 
Predictive Analysis - Using Insight-informed Data to Plan Inventory in Next 6...
Predictive Analysis - Using Insight-informed Data to Plan Inventory in Next 6...Predictive Analysis - Using Insight-informed Data to Plan Inventory in Next 6...
Predictive Analysis - Using Insight-informed Data to Plan Inventory in Next 6...
 
Decision Making Under Uncertainty - Decide Whether Or Not to Take Precautions
Decision Making Under Uncertainty - Decide Whether Or Not to Take PrecautionsDecision Making Under Uncertainty - Decide Whether Or Not to Take Precautions
Decision Making Under Uncertainty - Decide Whether Or Not to Take Precautions
 
Optimal Decision Making - Cost Reduction in Logistics
Optimal Decision Making - Cost Reduction in LogisticsOptimal Decision Making - Cost Reduction in Logistics
Optimal Decision Making - Cost Reduction in Logistics
 
Create Data Model & Conduct Visualisation in Power BI Desktop
Create Data Model & Conduct Visualisation in Power BI DesktopCreate Data Model & Conduct Visualisation in Power BI Desktop
Create Data Model & Conduct Visualisation in Power BI Desktop
 
Using DAX & Time-based Analysis in Data Warehouse
Using DAX & Time-based Analysis in Data WarehouseUsing DAX & Time-based Analysis in Data Warehouse
Using DAX & Time-based Analysis in Data Warehouse
 
Creating Data Warehouse Using Power Query & Power Pivot
Creating Data Warehouse Using Power Query & Power PivotCreating Data Warehouse Using Power Query & Power Pivot
Creating Data Warehouse Using Power Query & Power Pivot
 
Unlocking New Insights Into the World of European Soccer Through the European...
Unlocking New Insights Into the World of European Soccer Through the European...Unlocking New Insights Into the World of European Soccer Through the European...
Unlocking New Insights Into the World of European Soccer Through the European...
 
Breakfast Talk - Manage Projects
Breakfast Talk - Manage ProjectsBreakfast Talk - Manage Projects
Breakfast Talk - Manage Projects
 
Think innovation issue 4 share - scamper
Think innovation issue 4   share - scamperThink innovation issue 4   share - scamper
Think innovation issue 4 share - scamper
 
SCAMPER
SCAMPERSCAMPER
SCAMPER
 
Reverse Assumption Method
Reverse Assumption MethodReverse Assumption Method
Reverse Assumption Method
 
Psyche of Facilitation - The New Language of Facilitating Conversations
Psyche of Facilitation - The New Language of Facilitating ConversationsPsyche of Facilitation - The New Language of Facilitating Conversations
Psyche of Facilitation - The New Language of Facilitating Conversations
 
Visual Connection - Ideation Through Word Association
Visual Connection - Ideation Through Word AssociationVisual Connection - Ideation Through Word Association
Visual Connection - Ideation Through Word Association
 

Recently uploaded

Fuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertaintyFuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertainty
RafigAliyev2
 
一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理
cyebo
 
Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxExploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptx
DilipVasan
 
一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理
cyebo
 
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotecAbortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理
pyhepag
 
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
pyhepag
 
一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理
pyhepag
 

Recently uploaded (20)

Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsWebinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
 
Fuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertaintyFuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertainty
 
Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)
 
一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理
 
Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxExploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptx
 
一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理
 
AI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdfAI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdf
 
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotecAbortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
 
basics of data science with application areas.pdf
basics of data science with application areas.pdfbasics of data science with application areas.pdf
basics of data science with application areas.pdf
 
2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting
 
Data analytics courses in Nepal Presentation
Data analytics courses in Nepal PresentationData analytics courses in Nepal Presentation
Data analytics courses in Nepal Presentation
 
一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理
 
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
 
一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理
 
Easy and simple project file on mp online
Easy and simple project file on mp onlineEasy and simple project file on mp online
Easy and simple project file on mp online
 
2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call
 
Generative AI for Trailblazers_ Unlock the Future of AI.pdf
Generative AI for Trailblazers_ Unlock the Future of AI.pdfGenerative AI for Trailblazers_ Unlock the Future of AI.pdf
Generative AI for Trailblazers_ Unlock the Future of AI.pdf
 
Pre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxPre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptx
 
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictSupply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
 
How I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prisonHow I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prison
 

Identify Customer Segments to Create Customer Offers for Each Segment - Application of K-Means Clustering With Orange

  • 1. K-MEANS CLUSTERING WITH ORANGE IDENTIFY CUSTOMER SEGMENTS OF A SOCIAL ENTERPRISE TO CREATE CUSTOMER OFFERS FOR EACH SEGMENT AUTHOR: ANTHONY MOK DATE: 18 NOV 2023 EMAIL: XXIAOHAO@YAHOO.COM
  • 2. WHAT IS ORANGE Open-source and Extensible Freely available, adaptable, and customisable data mining tool Visual Programming Drag-and-drop interface for building data analysis workflows Interactive Data Exploration Quickly understand data patterns and trends using visualisations Wide Range of Data Mining Algorithms Identify patterns, make predictions, and solve data mining problems
  • 3. PROJECT’S CONTEXT, OBJECTIVE & STRATEGIES To identify customer segments to customised offers for each segment Social Enterprise collected data on customers & wants to make insight-informed decisions • Explore & Clean data for analysis • Perform K-Means Clustering, in Orange, to find possible segments in the customer data • Tune the model to improve its performance • Visualise the findings, share conclusions, and give insight- driven recommendations
  • 4. EXPLORATORY DATA ANALYSIS Findings • Target = Recency_in_Day • Provides insights into customer behavior, preferences, and churn risk • Feature Columns = 9 • Instances = 2,240 • Blanks & Outliers Age Column Income Column 23 Blanks - 1 Outlier 3 Outliers
  • 6. LOADING DATA & DEALING WITH BLANKS Customer.csv file imported into workflow with the ‘Role’ of Recency_days set as ‘Target’,‘ID’ as “meta’, with the rest as ‘features’ Exploratory Data Analysis (EDA) was considered, and blanks are imputed by ‘Average’ of sum of values in the ‘Income’ column
  • 7. EXAMINING RELATIONSHIPS & PATTERNS Scatter Plots were created to explore the relationships and patterns in the dataset ‘Recency_days’is the ‘Target’ with Four feature columns selected for the model: ‘Income’ & ‘Age’ (Numerical Data) & ‘Marital Status’ & ‘Education’, since these are more informative
  • 8. IDENTIFYING IDEAL NUMBER OF CLUSTERS • To determine the ideal number of clusters, the Silhouette Scores in the range of 2 to 12 clusters were calculated • Overall, the Silhouette Scores are positive, but relatively low, suggesting the clustering is fair, but there is still some overlaps between clusters • Clustering parameters can be adjusted to improve the separation between clusters
  • 9. BOOSTING MODEL’S PERFORMANCE & LIMITATIONS • By default,‘K-Means++’ & ‘Normalise Columns’ are enabled in the Hyperparameters • So only ‘Maximum Iterations’ was set to 100,000 (from 300) and ‘Re-runs’ at 100 (from 10) to boost the performance of the model • But the Silhouette Scores haven’t improved in the range of 2 to 12 clusters after these changes, suggesting that the K-Means Clustering Algorithm has converged to a stable solution
  • 10. BOOSTING MODEL’S PERFORMANCE & LIMITATIONS In this stable state, scores can be increased at the upper ranges of the clusters, but will result to overfitting the model to the dataset To avoid this outcome, the conservative number of 3 Clusters was chosen (Silhouette Score = 0.217) instead
  • 11. FINDINGS & CONCLUSIONS • Maximum income of customer base is $100,000/annum • For customers in the age range of 30 to 55, half of these earned below $50,000/annum, who could be price sensitive and are bargain hunters, while the other half earned above this threshold, who may be able to pay a premium for quality • Higher concentration of customers is found to have undergraduate degrees, who are more educated, and they are separated equally into two clusters: singles, with more ability for discretionary spending, and married couples, with less spending power given children/teens in their households • Customers above 55 are even distributed across all income groups * More comprehensive findings and conclusions were provided in the project report, which are not released at the request of the Social Enterprise
  • 12. RECOMMENDATIONS* Segment 1 - Customers in the age range of 30 to 55 who earned below $50,000/annum • Offer value-for-money products and services • Highlight discounts and promotions • Offer bundle deals and loyalty programs • Target them with personalised marketing campaigns based on their purchase history and interests * More recommendations were provided for each identified cluster in the project report, which are not released at the request of the Social Enterprise Segment 3 - Customers with undergraduate degrees • Offer educational and informative content • Highlight the benefits of products and services for their careers and personal development • Partner with other businesses that offer complementary products and services • Target them with personalised marketing campaigns based on their interests and areas of expertise
  • 13. K-MEANS CLUSTERING WITH ORANGE IDENTIFY CUSTOMER SEGMENTS OF A SOCIAL ENTERPRISE TO CREATE CUSTOMER OFFERS FOR EACH SEGMENT AUTHOR: ANTHONY MOK DATE: 18 NOV 2023 EMAIL: XXIAOHAO@YAHOO.COM