SlideShare a Scribd company logo
1 of 11
CLUSTERING
Stores Dataset
N RAMACHANDRAN
Data Exploration
• The percentage sales is the highest for the category 1 ,Fresh Foods
section(38%)
• The percentage sales is the lowest for the category 3 , Health and
Beauty section at 13%
• Average Sale is around 210 Rs/sqft.
Clustering Methodology used
• K – Means technique used to cluster the different stores.
• Worked to understand the techniques with different number of
clusters : 4 , 5 and 6 at the initial clustering level.
• The 4 initial cluster works the best with the different clusters having a
z score with a good difference from that of the population. The
cluster strengths are greater too .So this looks to be the optimal
number of cluster.
• One of the clusters is then again processed to get 3 more clusters .
Cluster Profiling
• Cluster 1
• No of stores in the cluster : 91
• Cluster Strength : 4.71
• The significant difference between the cluster and the population is the avg
sale amount.It is around 291 Rs/sqft compared to 210Rs/sqft for the
population.
• Frozen food higher than in the population.(28% > 25%)
• Tobacco sales are less compared to the population. (20% < 23%)
Cluster Profiling
• Cluster 2
• No of stores in the cluster : 151 , Cluster Strength : 3.10
• The biggest differentiator here is the Tobacco and Alcohol sections.(27.5% >
23% in the population)
• Average sale is significantly lower than that of the population(163 Rs/sqft <<
210 Rs/sqft)
• The fresh food sections sales are also very low compared to the overall
population.(32% << 38%)
• Health and Beauty sales is slightly better than the population(16.2% > 13.7%)
Cluster Profiling
• Cluster 3
• No of stores in the cluster : 3 , Cluster Strength : 8.33
• Very high difference between the average sales in these stores(471Rs/sqft >
210Rs/sqft)
• Tobacco and Alcohol forms considerably low percentage if compared to the
population.(18% < 23%)
• Fresh Foods among these stores has a higher share of the sales compared to
that in the overall population.
• Frozen food share also is a little greater than in the population.(26.4% >
24.9%)
• Looks to be an outlier group with very less no of stores and significantly
different characteristics from the population.
Cluster Profiling
• Cluster 4
• No of stores in the cluster : 270 .A much higher than expected no of
stores(100-140) in the cluster .
• Cluster Strength : 3.12
• Not very significant changes from the characteristics of the whole population.
• The highest differentiator being the Fresh Food Section ( 41.4% > 38.4%)
• The Tobacco and Alcohol section sales accounts a little less than the average
population(21.4% < 22.8%)
• Another iteration on the stores of this clusters is done to get more details.
Cluster Profiling
• Cluster 4_1
• No of stores in the cluster : 101 , Cluster Strength : 3.26
• Significant difference in the Fresh Food sales in this cluster compared to the
population.(45% > 38%)
• Average sales is around 184 Rs/sqft compared to the 210 Rs/sqft for the
population.
• Tobacco & Alcohol , Health & Beauty sections sales are a tad lower to the
overall population.
Cluster Profiling
• Cluster 4_2
• No of stores in the cluster : 85 , Cluster Strength : 3.42
• It has a very similar characteristics for all the store sections and average sales
when compared to the overall population.
Cluster Profiling
• Cluster 4_3
• No of stores in the cluster : 84 , Cluster Strength : 3.55
• Frozen Food section sales are a little lower than the overall population(20% <
22%)
• Sales/sqft a little higher than the population for the stores in this cluster (233
Rs/sqft > 210 Rs/sqft)
Recommendations
• We find that the cluster 3 with the 3 stores is an outlier .On further
check(as mentioned in the class videos) , its found to be the open
area outside the store to sell the fresh foods that is not accounted for.
• Hence , the stores can maybe try to sell the fresh food items in the
extra open area outside the stores.
• We would need more data to analyze why certain stores and sections
have greater sales in different clusters.

More Related Content

Similar to Cluster Analysis

Harvard business review:Natureview Farm
Harvard business review:Natureview FarmHarvard business review:Natureview Farm
Harvard business review:Natureview FarmSukkhadaJoshii
 
Ayush Srivastava NatureView Farm Case Study
Ayush Srivastava NatureView Farm Case StudyAyush Srivastava NatureView Farm Case Study
Ayush Srivastava NatureView Farm Case StudyAyush Srivastava
 
Natureview farm case study
Natureview farm case studyNatureview farm case study
Natureview farm case studyVeena Desai
 
Superior super market case study analysis
Superior super market case study analysisSuperior super market case study analysis
Superior super market case study analysiskkambli
 
Natureview Farm Harvard Case Analysis
Natureview Farm Harvard Case AnalysisNatureview Farm Harvard Case Analysis
Natureview Farm Harvard Case AnalysisShreyans Hinger
 
Birds, Bees and Buds: How to Talk About Cannabis and How to Label it Correctly
Birds, Bees and Buds:  How to Talk About Cannabis and How to Label it CorrectlyBirds, Bees and Buds:  How to Talk About Cannabis and How to Label it Correctly
Birds, Bees and Buds: How to Talk About Cannabis and How to Label it CorrectlyMarkus Roggen
 
Natureview case analysis
Natureview case analysisNatureview case analysis
Natureview case analysisAnkit Raj
 
Case 2 natureview farm case study by soumya jaiswal,nitrr
Case 2   natureview farm case study by soumya jaiswal,nitrrCase 2   natureview farm case study by soumya jaiswal,nitrr
Case 2 natureview farm case study by soumya jaiswal,nitrrSoumya Jaiswal
 

Similar to Cluster Analysis (13)

Harvard business review:Natureview Farm
Harvard business review:Natureview FarmHarvard business review:Natureview Farm
Harvard business review:Natureview Farm
 
Ayush Srivastava NatureView Farm Case Study
Ayush Srivastava NatureView Farm Case StudyAyush Srivastava NatureView Farm Case Study
Ayush Srivastava NatureView Farm Case Study
 
Natureview farm case study
Natureview farm case studyNatureview farm case study
Natureview farm case study
 
Superior super market case study analysis
Superior super market case study analysisSuperior super market case study analysis
Superior super market case study analysis
 
Crescent Pure
Crescent PureCrescent Pure
Crescent Pure
 
Natureview Farm Harvard Case Analysis
Natureview Farm Harvard Case AnalysisNatureview Farm Harvard Case Analysis
Natureview Farm Harvard Case Analysis
 
NATUREVIEW FARM
NATUREVIEW FARMNATUREVIEW FARM
NATUREVIEW FARM
 
Birds, Bees and Buds: How to Talk About Cannabis and How to Label it Correctly
Birds, Bees and Buds:  How to Talk About Cannabis and How to Label it CorrectlyBirds, Bees and Buds:  How to Talk About Cannabis and How to Label it Correctly
Birds, Bees and Buds: How to Talk About Cannabis and How to Label it Correctly
 
Natureview case analysis
Natureview case analysisNatureview case analysis
Natureview case analysis
 
Natureview frame
Natureview frameNatureview frame
Natureview frame
 
Case 2 natureview farm case study by soumya jaiswal,nitrr
Case 2   natureview farm case study by soumya jaiswal,nitrrCase 2   natureview farm case study by soumya jaiswal,nitrr
Case 2 natureview farm case study by soumya jaiswal,nitrr
 
Natureview case study
Natureview case studyNatureview case study
Natureview case study
 
Natureview farm case study
Natureview farm case studyNatureview farm case study
Natureview farm case study
 

Recently uploaded

RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 

Recently uploaded (20)

RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 

Cluster Analysis

  • 2. Data Exploration • The percentage sales is the highest for the category 1 ,Fresh Foods section(38%) • The percentage sales is the lowest for the category 3 , Health and Beauty section at 13% • Average Sale is around 210 Rs/sqft.
  • 3. Clustering Methodology used • K – Means technique used to cluster the different stores. • Worked to understand the techniques with different number of clusters : 4 , 5 and 6 at the initial clustering level. • The 4 initial cluster works the best with the different clusters having a z score with a good difference from that of the population. The cluster strengths are greater too .So this looks to be the optimal number of cluster. • One of the clusters is then again processed to get 3 more clusters .
  • 4. Cluster Profiling • Cluster 1 • No of stores in the cluster : 91 • Cluster Strength : 4.71 • The significant difference between the cluster and the population is the avg sale amount.It is around 291 Rs/sqft compared to 210Rs/sqft for the population. • Frozen food higher than in the population.(28% > 25%) • Tobacco sales are less compared to the population. (20% < 23%)
  • 5. Cluster Profiling • Cluster 2 • No of stores in the cluster : 151 , Cluster Strength : 3.10 • The biggest differentiator here is the Tobacco and Alcohol sections.(27.5% > 23% in the population) • Average sale is significantly lower than that of the population(163 Rs/sqft << 210 Rs/sqft) • The fresh food sections sales are also very low compared to the overall population.(32% << 38%) • Health and Beauty sales is slightly better than the population(16.2% > 13.7%)
  • 6. Cluster Profiling • Cluster 3 • No of stores in the cluster : 3 , Cluster Strength : 8.33 • Very high difference between the average sales in these stores(471Rs/sqft > 210Rs/sqft) • Tobacco and Alcohol forms considerably low percentage if compared to the population.(18% < 23%) • Fresh Foods among these stores has a higher share of the sales compared to that in the overall population. • Frozen food share also is a little greater than in the population.(26.4% > 24.9%) • Looks to be an outlier group with very less no of stores and significantly different characteristics from the population.
  • 7. Cluster Profiling • Cluster 4 • No of stores in the cluster : 270 .A much higher than expected no of stores(100-140) in the cluster . • Cluster Strength : 3.12 • Not very significant changes from the characteristics of the whole population. • The highest differentiator being the Fresh Food Section ( 41.4% > 38.4%) • The Tobacco and Alcohol section sales accounts a little less than the average population(21.4% < 22.8%) • Another iteration on the stores of this clusters is done to get more details.
  • 8. Cluster Profiling • Cluster 4_1 • No of stores in the cluster : 101 , Cluster Strength : 3.26 • Significant difference in the Fresh Food sales in this cluster compared to the population.(45% > 38%) • Average sales is around 184 Rs/sqft compared to the 210 Rs/sqft for the population. • Tobacco & Alcohol , Health & Beauty sections sales are a tad lower to the overall population.
  • 9. Cluster Profiling • Cluster 4_2 • No of stores in the cluster : 85 , Cluster Strength : 3.42 • It has a very similar characteristics for all the store sections and average sales when compared to the overall population.
  • 10. Cluster Profiling • Cluster 4_3 • No of stores in the cluster : 84 , Cluster Strength : 3.55 • Frozen Food section sales are a little lower than the overall population(20% < 22%) • Sales/sqft a little higher than the population for the stores in this cluster (233 Rs/sqft > 210 Rs/sqft)
  • 11. Recommendations • We find that the cluster 3 with the 3 stores is an outlier .On further check(as mentioned in the class videos) , its found to be the open area outside the store to sell the fresh foods that is not accounted for. • Hence , the stores can maybe try to sell the fresh food items in the extra open area outside the stores. • We would need more data to analyze why certain stores and sections have greater sales in different clusters.