SlideShare a Scribd company logo
1 of 25
Download to read offline
1
Cluster Analysis:
A practical example
2
Content
Introduction: the necessity to reduce the
complexity
Recall: what cluster analysis does
An example : cluster analysis in consumer
research on fair trade coffee
Discussion
(…)
“Where is the life, we have lost in living?
Where is the wisdom, we have lost in knowledge?
Where is the knowledge, we have lost in information?”
(…)
T. S. Elliot,
Choruses from the Rock
(1888 – 1965)
Intro
(…)
Where is the wisdom, we have lost in knowledge?
Where is the knowledge, we have lost in information?
(…)
“Where is the information we have lost in data?”
Intro
In order to go from data to information, to
knowledge and to wisdom,
we need to reduce the complexity of the data.
Complexity can be reduced on
- case level : cluster analysis
- on variable level: factor analysis
Intro
Cluster analysis can get you from this:
To this:
What cluster analysis does
a
b
d
e
f
c
Cluster analysis
• generate groups which are similar
• homogeneous within the group and as much as
possible heterogeneous to other groups
• data consists usually of objects or persons
• segmentation based on more than two variables
What cluster analysis does
Cluster analysis
• generates groups which are similar
• the groups are homogeneous within themselves
and as much as possible heterogeneous to other
groups
• data consists usually of objects or persons
• segmentation is based on more than two
variables
What cluster analysis does
Examples for datasets used for cluster
analysis:
• socio-economic criteria: income, education, profession,
age, number of children, size of city of residence ....
• psychographic criteria: interest, life style, motivation,
values, involvement
• criteria linked to the buying behaviour: price range, type
of media used, intensity of use, choice of retail outlet,
fidelity, buyer/non-buyer, buying intensity
What cluster analysis does
Proximity Measures
• Proximity measures are used to represent the nearness of two
objects
• relate objects with a high similarity to the same cluster and objects
with low similarity to different clusters
• differentiation of nominal-scaled and metric-scaled variables
What cluster analysis does
m
d(yi,ys) = [∑ |yij-ysj|r
]1/r
j=1
y = vector
i,s = different objects
j = the different characteristics
r = changes the weight of assigned distances
the calculation of the distances measures is the basis of the
cluster analysis.
Two phases:
1. Forming of clusters by the chosen data set – resulting
in a new variable that identifies cluster members
among the cases
2. Description of clusters by re-crossing with the data
What cluster analysis does
Cluster Algorithm in agglomerative hierarchical
clustering methods – seven steps to get clusters
1. each object is a independent cluster, n
2. two clusters with the lowest distance are merged to
one cluster. reduce the number of clusters by 1 (n-1)
3. calculate the the distance matrix between the new
cluster and all remaining clusters
4. repeat step 2 and 3, (n-1) times until all objects form
one reminding cluster
What cluster analysis does
Finally…
1. decide upon the number of clusters you want to keep
(decision often based on the size of the clusters)
2. description of the clusters by means of the cluster-
forming variables
3. appellation of the clusters with catchy titles
What cluster analysis does
What cluster analysis does
Cluster 5Cluster 4Cluster 3Cluster 2Cluster 1
Practical Example
Consumers and Fair Trade Coffee (1997!)
214 interviews of consumers of fair trade
coffee (personal and telephone interviews)
Cluster analysis in order to identify consumer
typologies
Identification of 6 clusters
Description of these clusters by further
analysis: comparison of means, crosstabs etc.
Consumers and Fair Trade Coffee
Description of clusters:
Cluster 1 (11,6%): “self-oriented fair trade buyer”
Cluster 2 (13,6%): “less ready to take personal
constraints”
Cluster 3 (18,2%): ”less engaged about fair trade”
Cluster 4 (32,2%): “intensive buyer”
Cluster 5 (18,7%): “value-oriented”
Cluster 6 (5,6%): “does not like the taste of fair trade
coffee”
Consumers and Fair Trade Coffee
Description of Cluster 1 (11,6%): “self-oriented fair trade
buyer” :
Searches satisfaction by doing the good thing
Is not altruistic
Buys occasionally
Sticks to his conventional coffee brand
High level of formal education
Frequently religious (catholic or protestant)
Consumers and Fair Trade Coffee
Description of Cluster 2 (13,6%): “less ready to take
personal constraints”
States that “fair trade coffee is hard to find”
Feels responsible for fare development issues
Believes that fair trade is efficient for developing
countries
Is less ready to go to special fair trade outlets
Buys conventional coffee
Likes the taste of fair trade coffee
Consumers and Fair Trade Coffee
Description of clusters Cluster 3 (18,2%): ”less engaged
about fair trade” :
Feels no personal responsibility with regard to
development questions
Doesn’t see the efficiency of the consumption of fair
trade goods
The only thing that can make him change is the
influence of friends
Is older then the average fair trade buyer and has less
formal education
Consumers and Fair Trade Coffee
Description of clusters: Cluster 4 (32,2%): “intensive
buyer”
Has abandoned conventional coffee brands
Has started to buy fair trade quite a while ago (> 3
years)
Shops frequently in fair trade stores (and not in organic
retail)
Is ready to act for fair development and talks to friends
about it
Relatively young, with low incomes and high
educational values
Consumers and Fair Trade Coffee
Description of clusters: Cluster 5 (18,7%): “value-
oriented”
Together with cluster 4 highly aware of development
issues
Ready to act and to constraint consumption habits
Buys for altruistic reasons
Highly involved in social / political action
Most frequently women, highest household income
among all clusters
Own security is the basis for solidary action
Consumers and Fair Trade Coffee
Description of clusters: Cluster 6 (5,6%): “does not like
the taste of fair trade coffee”
Lowest purchase intensity of all clusters
Not willing to accept constraints in consumption habits
or higher prices
Most members of these group are attached to a
conventional coffee brand
Relatively high incomes, age within the average of all
groups, lower level of formal education
Less religious than other groups.
Conclusion /
discussion
Advantages
• no special scales of measurement necessary
• high persuasiveness and good assignment to realisable
recommendations in practice
Disadvantages
• choice of cluster-forming variables often not based on
theory but at random
• determination of the right number of clusters often time-
consuming – often decided upon arbitrarily
• high influence on the interpretation of the scientist, difficult
to control (good documentation is needed)
Conclusion /
discussion
Russell .L. Ackoff, "From Data to Wisdom," Journal of Applied Systems
Analysis 16 (1989): 3-9.
Milan Zeleny, "Management Support Systems: Towards Integrated
Knowledge Management," Human Systems Management 7, no 1
(1987): 59-70.
Tashakkori, A. and Ch. Teddlie: Combining Qualitative and Quantitaive
Approaches. Applied Social Research Methods Series, Volume 46.
Thousand Oaks, London, New Delhi, 1998.
xyxy
Sources

More Related Content

Recently uploaded

Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
KarakKing
 

Recently uploaded (20)

Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 

Featured

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

Featured (20)

Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 

Data science courses in hyderabad.pdf

  • 2. 2 Content Introduction: the necessity to reduce the complexity Recall: what cluster analysis does An example : cluster analysis in consumer research on fair trade coffee Discussion
  • 3. (…) “Where is the life, we have lost in living? Where is the wisdom, we have lost in knowledge? Where is the knowledge, we have lost in information?” (…) T. S. Elliot, Choruses from the Rock (1888 – 1965) Intro
  • 4. (…) Where is the wisdom, we have lost in knowledge? Where is the knowledge, we have lost in information? (…) “Where is the information we have lost in data?” Intro
  • 5. In order to go from data to information, to knowledge and to wisdom, we need to reduce the complexity of the data. Complexity can be reduced on - case level : cluster analysis - on variable level: factor analysis Intro
  • 6. Cluster analysis can get you from this: To this: What cluster analysis does a b d e f c
  • 7. Cluster analysis • generate groups which are similar • homogeneous within the group and as much as possible heterogeneous to other groups • data consists usually of objects or persons • segmentation based on more than two variables What cluster analysis does
  • 8. Cluster analysis • generates groups which are similar • the groups are homogeneous within themselves and as much as possible heterogeneous to other groups • data consists usually of objects or persons • segmentation is based on more than two variables What cluster analysis does
  • 9. Examples for datasets used for cluster analysis: • socio-economic criteria: income, education, profession, age, number of children, size of city of residence .... • psychographic criteria: interest, life style, motivation, values, involvement • criteria linked to the buying behaviour: price range, type of media used, intensity of use, choice of retail outlet, fidelity, buyer/non-buyer, buying intensity What cluster analysis does
  • 10. Proximity Measures • Proximity measures are used to represent the nearness of two objects • relate objects with a high similarity to the same cluster and objects with low similarity to different clusters • differentiation of nominal-scaled and metric-scaled variables What cluster analysis does m d(yi,ys) = [∑ |yij-ysj|r ]1/r j=1 y = vector i,s = different objects j = the different characteristics r = changes the weight of assigned distances the calculation of the distances measures is the basis of the cluster analysis.
  • 11. Two phases: 1. Forming of clusters by the chosen data set – resulting in a new variable that identifies cluster members among the cases 2. Description of clusters by re-crossing with the data What cluster analysis does
  • 12. Cluster Algorithm in agglomerative hierarchical clustering methods – seven steps to get clusters 1. each object is a independent cluster, n 2. two clusters with the lowest distance are merged to one cluster. reduce the number of clusters by 1 (n-1) 3. calculate the the distance matrix between the new cluster and all remaining clusters 4. repeat step 2 and 3, (n-1) times until all objects form one reminding cluster What cluster analysis does
  • 13. Finally… 1. decide upon the number of clusters you want to keep (decision often based on the size of the clusters) 2. description of the clusters by means of the cluster- forming variables 3. appellation of the clusters with catchy titles What cluster analysis does
  • 14. What cluster analysis does Cluster 5Cluster 4Cluster 3Cluster 2Cluster 1
  • 15. Practical Example Consumers and Fair Trade Coffee (1997!) 214 interviews of consumers of fair trade coffee (personal and telephone interviews) Cluster analysis in order to identify consumer typologies Identification of 6 clusters Description of these clusters by further analysis: comparison of means, crosstabs etc.
  • 16. Consumers and Fair Trade Coffee Description of clusters: Cluster 1 (11,6%): “self-oriented fair trade buyer” Cluster 2 (13,6%): “less ready to take personal constraints” Cluster 3 (18,2%): ”less engaged about fair trade” Cluster 4 (32,2%): “intensive buyer” Cluster 5 (18,7%): “value-oriented” Cluster 6 (5,6%): “does not like the taste of fair trade coffee”
  • 17. Consumers and Fair Trade Coffee Description of Cluster 1 (11,6%): “self-oriented fair trade buyer” : Searches satisfaction by doing the good thing Is not altruistic Buys occasionally Sticks to his conventional coffee brand High level of formal education Frequently religious (catholic or protestant)
  • 18. Consumers and Fair Trade Coffee Description of Cluster 2 (13,6%): “less ready to take personal constraints” States that “fair trade coffee is hard to find” Feels responsible for fare development issues Believes that fair trade is efficient for developing countries Is less ready to go to special fair trade outlets Buys conventional coffee Likes the taste of fair trade coffee
  • 19. Consumers and Fair Trade Coffee Description of clusters Cluster 3 (18,2%): ”less engaged about fair trade” : Feels no personal responsibility with regard to development questions Doesn’t see the efficiency of the consumption of fair trade goods The only thing that can make him change is the influence of friends Is older then the average fair trade buyer and has less formal education
  • 20. Consumers and Fair Trade Coffee Description of clusters: Cluster 4 (32,2%): “intensive buyer” Has abandoned conventional coffee brands Has started to buy fair trade quite a while ago (> 3 years) Shops frequently in fair trade stores (and not in organic retail) Is ready to act for fair development and talks to friends about it Relatively young, with low incomes and high educational values
  • 21. Consumers and Fair Trade Coffee Description of clusters: Cluster 5 (18,7%): “value- oriented” Together with cluster 4 highly aware of development issues Ready to act and to constraint consumption habits Buys for altruistic reasons Highly involved in social / political action Most frequently women, highest household income among all clusters Own security is the basis for solidary action
  • 22. Consumers and Fair Trade Coffee Description of clusters: Cluster 6 (5,6%): “does not like the taste of fair trade coffee” Lowest purchase intensity of all clusters Not willing to accept constraints in consumption habits or higher prices Most members of these group are attached to a conventional coffee brand Relatively high incomes, age within the average of all groups, lower level of formal education Less religious than other groups.
  • 23. Conclusion / discussion Advantages • no special scales of measurement necessary • high persuasiveness and good assignment to realisable recommendations in practice Disadvantages • choice of cluster-forming variables often not based on theory but at random • determination of the right number of clusters often time- consuming – often decided upon arbitrarily • high influence on the interpretation of the scientist, difficult to control (good documentation is needed)
  • 25. Russell .L. Ackoff, "From Data to Wisdom," Journal of Applied Systems Analysis 16 (1989): 3-9. Milan Zeleny, "Management Support Systems: Towards Integrated Knowledge Management," Human Systems Management 7, no 1 (1987): 59-70. Tashakkori, A. and Ch. Teddlie: Combining Qualitative and Quantitaive Approaches. Applied Social Research Methods Series, Volume 46. Thousand Oaks, London, New Delhi, 1998. xyxy Sources