SlideShare a Scribd company logo
1 of 13
Download to read offline
Cluster Analysis
For segmentation
Clustering
what is it?
why do we use it?
how do we do it?
What is it?
• Cluster analysis is the
process of grouping a set of
data into clusters.
• A cluster is a collection of data
points where each observation is
1) similar to other observations in
the same cluster, and 2) dissimilar
to observations in other clusters
What is it?
• Cluster analysis is a statistical tool for discovering
hidden patterns in groups of observations - e.g., on what
criteria are these “clusters” made?
• Cluster analysis is still quite subjective in nature. Does it
make sense?
In Marketing…
• Clustering is used to discover
distinct groups in customer
bases (e.g., segments), and
use this knowledge to develop
targeted marketing programs
• Another example: Insurance
companies use clustering to
determine “what type” of
drivers are risky, and safe -
and charge premiums
accordingly!
Good Clusters have:
• High: Intra-class similarity
(observations in the cluster
share qualities)
• Low: Inter-class similarity
(distinct clusters are different
from one-another)
Consider- two important
characteristics
Student grades work hours
a 3.5 0
b 3.7 5
c 2.9 10
d 2.0 12
e 3.0 15
f 2.8 14
work hours
grades
a
d
c
b
ef
cluster 1
cluster 2
How do we use this
information?
We have 2 distinct
segments.
Other data we have:
age, gender,
hometown, grade
level, major, hair color.
What is the segment
profile of each?
work hours
grades
a
d
c
b
ef
cluster 1
cluster 2
Are
these both viable
targets?
That depends on ….?
Are all of these characteristics useful?
How do we use this
information?
We have 2 distinct
segments.
Descriptive Statistics
age gender hometown major haircolor
segment 1
- works a
lot
Mean = 20 57% male 90% NKY
65%
Business
50%
blonde
segment 2
- good
students
Mean = 20 75% male
66% OH,
IN
50% Arts 75% brown
How to do it!
• You need access to SPSS.
You can either 1) log in to
NKU’s virtual network (VPN)
using the virtual desktop, or
you can use a University
computer. (I suggest VPN)
• Use this link to learn how to
use the virtual desktop. You
first have to install the VPN
software if you want to do it
off-campus: click here.
Steps to follow…
• Open your data set and save it
to a portable drive or your NKU
“j” drive
• We will be using “Two-Step”
cluster analysis.
• From SPSS file:
Analyze —->Classify —> two-step cluster
The Youtube tutorial is linked here if you need to review it.
Then follow the instructions on the YouTube
tutorial.
Warnings
Don’t use binary variables in the clustering process (e.g., gender,
team (yes/no)). These are “swamping variables” and will hijack your
clusters.
Clusters of 3-4 are ideal, even if you have to force it and the
criteria are not very good. You only have what you have…
Your data set might not ever give you “perfect” results based on the
criteria discussed in the video tutorial. Thats ok. Do the best you
can.
More on Profiles to come…

More Related Content

What's hot

Hierarchical Clustering in Data Mining
Hierarchical Clustering in Data MiningHierarchical Clustering in Data Mining
Hierarchical Clustering in Data MiningYashraj Nigam
 
Introduction to Linear Discriminant Analysis
Introduction to Linear Discriminant AnalysisIntroduction to Linear Discriminant Analysis
Introduction to Linear Discriminant AnalysisJaclyn Kokx
 
Program_Cluster_Analysis
Program_Cluster_AnalysisProgram_Cluster_Analysis
Program_Cluster_AnalysisSammya Sengupta
 
Cluster analysis
Cluster analysisCluster analysis
Cluster analysis緯鈞 沈
 
pratik meshram-Unit 5 (contemporary mkt r sch)
pratik meshram-Unit 5 (contemporary mkt r sch)pratik meshram-Unit 5 (contemporary mkt r sch)
pratik meshram-Unit 5 (contemporary mkt r sch)Pratik Meshram
 
Cluster Analysis Assignment 2013-2014(2)
Cluster Analysis Assignment 2013-2014(2)Cluster Analysis Assignment 2013-2014(2)
Cluster Analysis Assignment 2013-2014(2)TIEZHENG YUAN
 
Clustering & classification
Clustering & classificationClustering & classification
Clustering & classificationJamshed Khan
 
Graph Clustering and cluster
Graph Clustering and clusterGraph Clustering and cluster
Graph Clustering and clusterAdil Mehmoood
 
Methods for feature/variable selection in Regression Analysis
Methods for feature/variable selection in Regression AnalysisMethods for feature/variable selection in Regression Analysis
Methods for feature/variable selection in Regression AnalysisRupak Roy
 
Data Reduction
Data ReductionData Reduction
Data ReductionRajan Shah
 
Clustering in data Mining (Data Mining)
Clustering in data Mining (Data Mining)Clustering in data Mining (Data Mining)
Clustering in data Mining (Data Mining)Mustafa Sherazi
 

What's hot (20)

Clustering
ClusteringClustering
Clustering
 
Hierarchical Clustering in Data Mining
Hierarchical Clustering in Data MiningHierarchical Clustering in Data Mining
Hierarchical Clustering in Data Mining
 
Introduction to Linear Discriminant Analysis
Introduction to Linear Discriminant AnalysisIntroduction to Linear Discriminant Analysis
Introduction to Linear Discriminant Analysis
 
Cluster Validation
Cluster ValidationCluster Validation
Cluster Validation
 
Program_Cluster_Analysis
Program_Cluster_AnalysisProgram_Cluster_Analysis
Program_Cluster_Analysis
 
Cluster analysis
Cluster analysisCluster analysis
Cluster analysis
 
Cluster Analysis
Cluster Analysis Cluster Analysis
Cluster Analysis
 
Machine learning clustering
Machine learning clusteringMachine learning clustering
Machine learning clustering
 
Clustering in Data Mining
Clustering in Data MiningClustering in Data Mining
Clustering in Data Mining
 
pratik meshram-Unit 5 (contemporary mkt r sch)
pratik meshram-Unit 5 (contemporary mkt r sch)pratik meshram-Unit 5 (contemporary mkt r sch)
pratik meshram-Unit 5 (contemporary mkt r sch)
 
Cluster Analysis Assignment 2013-2014(2)
Cluster Analysis Assignment 2013-2014(2)Cluster Analysis Assignment 2013-2014(2)
Cluster Analysis Assignment 2013-2014(2)
 
Clustering
ClusteringClustering
Clustering
 
Clustering & classification
Clustering & classificationClustering & classification
Clustering & classification
 
Malhotra20
Malhotra20Malhotra20
Malhotra20
 
Graph Clustering and cluster
Graph Clustering and clusterGraph Clustering and cluster
Graph Clustering and cluster
 
Data discretization
Data discretizationData discretization
Data discretization
 
Methods for feature/variable selection in Regression Analysis
Methods for feature/variable selection in Regression AnalysisMethods for feature/variable selection in Regression Analysis
Methods for feature/variable selection in Regression Analysis
 
Data Reduction
Data ReductionData Reduction
Data Reduction
 
Clustering in data Mining (Data Mining)
Clustering in data Mining (Data Mining)Clustering in data Mining (Data Mining)
Clustering in data Mining (Data Mining)
 
Cluster analysis
Cluster analysisCluster analysis
Cluster analysis
 

Similar to Basics of Clustering

Cluster Analysis in Data Science.pptx
Cluster Analysis in Data Science.pptxCluster Analysis in Data Science.pptx
Cluster Analysis in Data Science.pptxinfosec train
 
Cluster Analysis in Data Science.pptx
Cluster Analysis in Data Science.pptxCluster Analysis in Data Science.pptx
Cluster Analysis in Data Science.pptxInfosectrain3
 
Barga Data Science lecture 10
Barga Data Science lecture 10Barga Data Science lecture 10
Barga Data Science lecture 10Roger Barga
 
CodeLess Machine Learning
CodeLess Machine LearningCodeLess Machine Learning
CodeLess Machine LearningSharjeel Imtiaz
 
Data mining chapter04and5-best
Data mining chapter04and5-bestData mining chapter04and5-best
Data mining chapter04and5-bestABDUmomo
 
Research trends in data warehousing and data mining
Research trends in data warehousing and data miningResearch trends in data warehousing and data mining
Research trends in data warehousing and data miningEr. Nawaraj Bhandari
 
لموعد الإثنين 03 يناير 2022 143 مبادرة #تواصل_تطوير المحاضرة ال 143 من المباد...
لموعد الإثنين 03 يناير 2022 143 مبادرة #تواصل_تطوير المحاضرة ال 143 من المباد...لموعد الإثنين 03 يناير 2022 143 مبادرة #تواصل_تطوير المحاضرة ال 143 من المباد...
لموعد الإثنين 03 يناير 2022 143 مبادرة #تواصل_تطوير المحاضرة ال 143 من المباد...Egyptian Engineers Association
 
Nss power point_machine_learning
Nss power point_machine_learningNss power point_machine_learning
Nss power point_machine_learningGauravsd2014
 
Supervised Learning
Supervised LearningSupervised Learning
Supervised LearningFEG
 
CS 402 DATAMINING AND WAREHOUSING -MODULE 3
CS 402 DATAMINING AND WAREHOUSING -MODULE 3CS 402 DATAMINING AND WAREHOUSING -MODULE 3
CS 402 DATAMINING AND WAREHOUSING -MODULE 3NIMMYRAJU
 
TQM UNIT 2.pptx presentation with images
TQM UNIT 2.pptx presentation with imagesTQM UNIT 2.pptx presentation with images
TQM UNIT 2.pptx presentation with imagesPradeep482741
 
IRJET- A Detailed Study on Classification Techniques for Data Mining
IRJET- A Detailed Study on Classification Techniques for Data MiningIRJET- A Detailed Study on Classification Techniques for Data Mining
IRJET- A Detailed Study on Classification Techniques for Data MiningIRJET Journal
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learningSanghamitra Deb
 
Data Analytics Using R - Report
Data Analytics Using R - ReportData Analytics Using R - Report
Data Analytics Using R - ReportAkanksha Gohil
 
Module 7: Unsupervised Learning
Module 7:  Unsupervised LearningModule 7:  Unsupervised Learning
Module 7: Unsupervised LearningSara Hooker
 
التقنيات المستخدمة لتطوير المكتبات
التقنيات المستخدمة لتطوير المكتباتالتقنيات المستخدمة لتطوير المكتبات
التقنيات المستخدمة لتطوير المكتباتMohammed El Rafie Tarabay
 

Similar to Basics of Clustering (20)

Classification
ClassificationClassification
Classification
 
Cluster Analysis in Data Science.pptx
Cluster Analysis in Data Science.pptxCluster Analysis in Data Science.pptx
Cluster Analysis in Data Science.pptx
 
Cluster Analysis in Data Science.pptx
Cluster Analysis in Data Science.pptxCluster Analysis in Data Science.pptx
Cluster Analysis in Data Science.pptx
 
Barga Data Science lecture 10
Barga Data Science lecture 10Barga Data Science lecture 10
Barga Data Science lecture 10
 
CodeLess Machine Learning
CodeLess Machine LearningCodeLess Machine Learning
CodeLess Machine Learning
 
Data mining chapter04and5-best
Data mining chapter04and5-bestData mining chapter04and5-best
Data mining chapter04and5-best
 
Research trends in data warehousing and data mining
Research trends in data warehousing and data miningResearch trends in data warehousing and data mining
Research trends in data warehousing and data mining
 
لموعد الإثنين 03 يناير 2022 143 مبادرة #تواصل_تطوير المحاضرة ال 143 من المباد...
لموعد الإثنين 03 يناير 2022 143 مبادرة #تواصل_تطوير المحاضرة ال 143 من المباد...لموعد الإثنين 03 يناير 2022 143 مبادرة #تواصل_تطوير المحاضرة ال 143 من المباد...
لموعد الإثنين 03 يناير 2022 143 مبادرة #تواصل_تطوير المحاضرة ال 143 من المباد...
 
Nss power point_machine_learning
Nss power point_machine_learningNss power point_machine_learning
Nss power point_machine_learning
 
Supervised Learning
Supervised LearningSupervised Learning
Supervised Learning
 
Classification
ClassificationClassification
Classification
 
CS 402 DATAMINING AND WAREHOUSING -MODULE 3
CS 402 DATAMINING AND WAREHOUSING -MODULE 3CS 402 DATAMINING AND WAREHOUSING -MODULE 3
CS 402 DATAMINING AND WAREHOUSING -MODULE 3
 
TQM UNIT 2.pptx presentation with images
TQM UNIT 2.pptx presentation with imagesTQM UNIT 2.pptx presentation with images
TQM UNIT 2.pptx presentation with images
 
IRJET- A Detailed Study on Classification Techniques for Data Mining
IRJET- A Detailed Study on Classification Techniques for Data MiningIRJET- A Detailed Study on Classification Techniques for Data Mining
IRJET- A Detailed Study on Classification Techniques for Data Mining
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learning
 
Rapid Miner
Rapid MinerRapid Miner
Rapid Miner
 
Data Analytics Using R - Report
Data Analytics Using R - ReportData Analytics Using R - Report
Data Analytics Using R - Report
 
Module 7: Unsupervised Learning
Module 7:  Unsupervised LearningModule 7:  Unsupervised Learning
Module 7: Unsupervised Learning
 
Octave
OctaveOctave
Octave
 
التقنيات المستخدمة لتطوير المكتبات
التقنيات المستخدمة لتطوير المكتباتالتقنيات المستخدمة لتطوير المكتبات
التقنيات المستخدمة لتطوير المكتبات
 

More from B. Nichols

Cyclone final group project presentation
Cyclone final group project presentationCyclone final group project presentation
Cyclone final group project presentationB. Nichols
 
Ch 10 licensing nichols
Ch 10 licensing nicholsCh 10 licensing nichols
Ch 10 licensing nicholsB. Nichols
 
Sports pricing spring 2015.key
Sports pricing spring 2015.keySports pricing spring 2015.key
Sports pricing spring 2015.keyB. Nichols
 
Segmentation & Targeting
Segmentation & TargetingSegmentation & Targeting
Segmentation & TargetingB. Nichols
 
Licensing in Sports
Licensing in SportsLicensing in Sports
Licensing in SportsB. Nichols
 
Sma p resentation rev oct 25
Sma p resentation rev oct 25Sma p resentation rev oct 25
Sma p resentation rev oct 25B. Nichols
 
Ch 8 segmentation
Ch 8 segmentationCh 8 segmentation
Ch 8 segmentationB. Nichols
 
Chapter 7 - SPB 309
Chapter 7 - SPB 309Chapter 7 - SPB 309
Chapter 7 - SPB 309B. Nichols
 
Chapter 6 - SPB 309
Chapter 6 - SPB 309Chapter 6 - SPB 309
Chapter 6 - SPB 309B. Nichols
 
Chapter 8 - SBP 309
Chapter 8 - SBP 309Chapter 8 - SBP 309
Chapter 8 - SBP 309B. Nichols
 
Chapter 5 - SPB 309
Chapter 5 - SPB 309Chapter 5 - SPB 309
Chapter 5 - SPB 309B. Nichols
 
Chapter 3 Presentation - SPB 309
Chapter 3 Presentation - SPB 309Chapter 3 Presentation - SPB 309
Chapter 3 Presentation - SPB 309B. Nichols
 
Chapter 1 Presentation - SPB 309
Chapter 1 Presentation - SPB 309Chapter 1 Presentation - SPB 309
Chapter 1 Presentation - SPB 309B. Nichols
 

More from B. Nichols (17)

Cyclone final group project presentation
Cyclone final group project presentationCyclone final group project presentation
Cyclone final group project presentation
 
Ch 10 licensing nichols
Ch 10 licensing nicholsCh 10 licensing nichols
Ch 10 licensing nichols
 
Sports pricing spring 2015.key
Sports pricing spring 2015.keySports pricing spring 2015.key
Sports pricing spring 2015.key
 
Segmentation & Targeting
Segmentation & TargetingSegmentation & Targeting
Segmentation & Targeting
 
Pricing
PricingPricing
Pricing
 
Licensing in Sports
Licensing in SportsLicensing in Sports
Licensing in Sports
 
Sma p resentation rev oct 25
Sma p resentation rev oct 25Sma p resentation rev oct 25
Sma p resentation rev oct 25
 
Ch 8 segmentation
Ch 8 segmentationCh 8 segmentation
Ch 8 segmentation
 
Chapter 12
Chapter 12Chapter 12
Chapter 12
 
Chapter 11
Chapter 11Chapter 11
Chapter 11
 
Chapter 10
Chapter 10Chapter 10
Chapter 10
 
Chapter 7 - SPB 309
Chapter 7 - SPB 309Chapter 7 - SPB 309
Chapter 7 - SPB 309
 
Chapter 6 - SPB 309
Chapter 6 - SPB 309Chapter 6 - SPB 309
Chapter 6 - SPB 309
 
Chapter 8 - SBP 309
Chapter 8 - SBP 309Chapter 8 - SBP 309
Chapter 8 - SBP 309
 
Chapter 5 - SPB 309
Chapter 5 - SPB 309Chapter 5 - SPB 309
Chapter 5 - SPB 309
 
Chapter 3 Presentation - SPB 309
Chapter 3 Presentation - SPB 309Chapter 3 Presentation - SPB 309
Chapter 3 Presentation - SPB 309
 
Chapter 1 Presentation - SPB 309
Chapter 1 Presentation - SPB 309Chapter 1 Presentation - SPB 309
Chapter 1 Presentation - SPB 309
 

Recently uploaded

Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxAvyJaneVismanos
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,Virag Sontakke
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Science lesson Moon for 4th quarter lesson
Science lesson Moon for 4th quarter lessonScience lesson Moon for 4th quarter lesson
Science lesson Moon for 4th quarter lessonJericReyAuditor
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaVirag Sontakke
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfMahmoud M. Sallam
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 

Recently uploaded (20)

Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptx
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Science lesson Moon for 4th quarter lesson
Science lesson Moon for 4th quarter lessonScience lesson Moon for 4th quarter lesson
Science lesson Moon for 4th quarter lesson
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of India
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdf
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 

Basics of Clustering

  • 2. Clustering what is it? why do we use it? how do we do it?
  • 3. What is it? • Cluster analysis is the process of grouping a set of data into clusters. • A cluster is a collection of data points where each observation is 1) similar to other observations in the same cluster, and 2) dissimilar to observations in other clusters
  • 4. What is it? • Cluster analysis is a statistical tool for discovering hidden patterns in groups of observations - e.g., on what criteria are these “clusters” made? • Cluster analysis is still quite subjective in nature. Does it make sense?
  • 5. In Marketing… • Clustering is used to discover distinct groups in customer bases (e.g., segments), and use this knowledge to develop targeted marketing programs • Another example: Insurance companies use clustering to determine “what type” of drivers are risky, and safe - and charge premiums accordingly!
  • 6. Good Clusters have: • High: Intra-class similarity (observations in the cluster share qualities) • Low: Inter-class similarity (distinct clusters are different from one-another)
  • 7. Consider- two important characteristics Student grades work hours a 3.5 0 b 3.7 5 c 2.9 10 d 2.0 12 e 3.0 15 f 2.8 14 work hours grades a d c b ef cluster 1 cluster 2
  • 8. How do we use this information? We have 2 distinct segments. Other data we have: age, gender, hometown, grade level, major, hair color. What is the segment profile of each? work hours grades a d c b ef cluster 1 cluster 2
  • 9. Are these both viable targets? That depends on ….? Are all of these characteristics useful? How do we use this information? We have 2 distinct segments. Descriptive Statistics age gender hometown major haircolor segment 1 - works a lot Mean = 20 57% male 90% NKY 65% Business 50% blonde segment 2 - good students Mean = 20 75% male 66% OH, IN 50% Arts 75% brown
  • 10. How to do it! • You need access to SPSS. You can either 1) log in to NKU’s virtual network (VPN) using the virtual desktop, or you can use a University computer. (I suggest VPN) • Use this link to learn how to use the virtual desktop. You first have to install the VPN software if you want to do it off-campus: click here.
  • 11. Steps to follow… • Open your data set and save it to a portable drive or your NKU “j” drive • We will be using “Two-Step” cluster analysis. • From SPSS file: Analyze —->Classify —> two-step cluster The Youtube tutorial is linked here if you need to review it. Then follow the instructions on the YouTube tutorial.
  • 12. Warnings Don’t use binary variables in the clustering process (e.g., gender, team (yes/no)). These are “swamping variables” and will hijack your clusters. Clusters of 3-4 are ideal, even if you have to force it and the criteria are not very good. You only have what you have… Your data set might not ever give you “perfect” results based on the criteria discussed in the video tutorial. Thats ok. Do the best you can.
  • 13. More on Profiles to come…