SlideShare a Scribd company logo
1 of 19
Sorting Customers into Groups
Web Analytics Wednesday
Michael Levin
mlevin@otterbein.edu
@MichaelALevin
Sorting the Cards
Differing Forms of Machine Learning
Supervised Unsupervised
Categorial
Data
Classification
Association
Analysis
Continuous
Data
Regression
Clustering &
Dimension
Reduction
Seeing Similarity, Dissimilarity
Relishing Parsimony
Taking Action
Half Off
50%
Off
Limited
Time
Offer
Types of Clustering
Hierarchical K-Means
Step 0: Select Variables
Revenue ($) Number of Downloads (#)
Frequency of Purchase (#) Activities Registered (#)
Number of
Items in Basket (#)
Percent of Emails
Clicked (%)
Cost of Acquisition ($) Number of Pages (#)
Time Since Last Purchase
Step 1: Specify Number of Clusters
Step 2: Radom Assignment
Step 3: Compute Initial Centroids
Step 4: Re-Assign Each Point
Step 5: Re-Compute Centroids
3 Cluster Solution
Cluster
%
Black
%
Hispanic
%
Asian
Median
Age
Unemp.
Rate
Per Capita
Income
1 12 27 3 30 8 19
2 29 3 2 32 7 21
3 10 5 29 36 5 28
Distance 207.64
4 Cluster Solution
Cluster
%
Black
%
Hispanic
%
Asian
Median
Age
Unemp.
Rate
Per Capita
Income
1 14 28 6 30 9 21
2 15 4 2 32 5 20
3 48 3 1 32 9 22
4 10 5 29 36 5 30
Distance 165.35
5 Cluster Solution
Cluster
%
Black
%
Hispanic
%
Asian
Median
Age
Unemp.
Rate
Per
Capita
Income
1 14 28 6 30 9 21
2 15 4 2 32 5 20
3 48 3 1 32 9 22
4 11 9 21 36 6 30
5 1 5 71 37 5 24
Distance 145.75
Decide on Number
Cluster 3 Clusters 4 Clusters 5 Clusters
Dallas 1 1 1
Austin 1 2 2
Atlanta 2 3 3
Boston 2 2 2
Seattle 3 4 4
Honolulu 3 4 5
Add to Supervised Learning
Cluster Cluster 1 Cluster 2 Cluster 3 Cluster 4
Dallas 1 0 0 0
Austin 0 1 0 0
Atlanta 0 0 1 0
Boston 0 1 0 0
Seattle 0 0 0 1
Honolulu 0 0 0 1
Sorting Customers into Groups
Web Analytics Wednesday
Michael Levin
mlevin@otterbein.edu
@MichaelALevin

More Related Content

Similar to Columbus Web Analytics Wednesday September 2019

Data Quality Concerns when Crowdsourcing Scientific Tasks
Data Quality Concerns when Crowdsourcing Scientific TasksData Quality Concerns when Crowdsourcing Scientific Tasks
Data Quality Concerns when Crowdsourcing Scientific Tasks
Stephanie Eckman
 
Quant shopper segmentation case study
Quant shopper segmentation case studyQuant shopper segmentation case study
Quant shopper segmentation case study
Tristan Hyde
 
customer_profiling_based_on_fuzzy_principals_linkedin
customer_profiling_based_on_fuzzy_principals_linkedincustomer_profiling_based_on_fuzzy_principals_linkedin
customer_profiling_based_on_fuzzy_principals_linkedin
Asoka Korale
 
Clustering as presented at UX Poland 2013
Clustering as presented at UX Poland 2013Clustering as presented at UX Poland 2013
Clustering as presented at UX Poland 2013
Ravi Mynampaty
 
Presentation Title
Presentation TitlePresentation Title
Presentation Title
butest
 

Similar to Columbus Web Analytics Wednesday September 2019 (20)

Data Quality Concerns when Crowdsourcing Scientific Tasks
Data Quality Concerns when Crowdsourcing Scientific TasksData Quality Concerns when Crowdsourcing Scientific Tasks
Data Quality Concerns when Crowdsourcing Scientific Tasks
 
Quant shopper segmentation case study
Quant shopper segmentation case studyQuant shopper segmentation case study
Quant shopper segmentation case study
 
Datamining intro-iep
Datamining intro-iepDatamining intro-iep
Datamining intro-iep
 
Dwd mdatamining intro-iep
Dwd mdatamining intro-iepDwd mdatamining intro-iep
Dwd mdatamining intro-iep
 
Leveraging Email to Grow Your Sales
Leveraging Email to Grow Your SalesLeveraging Email to Grow Your Sales
Leveraging Email to Grow Your Sales
 
customer_profiling_based_on_fuzzy_principals_linkedin
customer_profiling_based_on_fuzzy_principals_linkedincustomer_profiling_based_on_fuzzy_principals_linkedin
customer_profiling_based_on_fuzzy_principals_linkedin
 
Target Users Forum 2009 - Managing Event Data Across Chapters
Target Users Forum 2009 - Managing Event Data Across ChaptersTarget Users Forum 2009 - Managing Event Data Across Chapters
Target Users Forum 2009 - Managing Event Data Across Chapters
 
acmsigtalkshare-121023190142-phpapp01.pptx
acmsigtalkshare-121023190142-phpapp01.pptxacmsigtalkshare-121023190142-phpapp01.pptx
acmsigtalkshare-121023190142-phpapp01.pptx
 
Data Mining with SQL Server 2008
Data Mining with SQL Server 2008Data Mining with SQL Server 2008
Data Mining with SQL Server 2008
 
Knowledge discovery claudiad amato
Knowledge discovery claudiad amatoKnowledge discovery claudiad amato
Knowledge discovery claudiad amato
 
Tutorial Knowledge Discovery
Tutorial Knowledge DiscoveryTutorial Knowledge Discovery
Tutorial Knowledge Discovery
 
Clustering as presented at UX Poland 2013
Clustering as presented at UX Poland 2013Clustering as presented at UX Poland 2013
Clustering as presented at UX Poland 2013
 
Dwdm ppt for the btech student contain basis
Dwdm ppt for the btech student contain basisDwdm ppt for the btech student contain basis
Dwdm ppt for the btech student contain basis
 
Presentation Title
Presentation TitlePresentation Title
Presentation Title
 
Trymain Rivero AFCU Presentation (for OSDC)
Trymain Rivero AFCU Presentation (for OSDC)Trymain Rivero AFCU Presentation (for OSDC)
Trymain Rivero AFCU Presentation (for OSDC)
 
A Nontechnical Introduction to Machine Learning
A Nontechnical Introduction to Machine LearningA Nontechnical Introduction to Machine Learning
A Nontechnical Introduction to Machine Learning
 
Predict online shoppers' intentions
Predict online shoppers' intentionsPredict online shoppers' intentions
Predict online shoppers' intentions
 
Mining for Gold: Using Data to Drive Revenue & Services
Mining for Gold: Using Data to Drive Revenue & ServicesMining for Gold: Using Data to Drive Revenue & Services
Mining for Gold: Using Data to Drive Revenue & Services
 
SASGF2016_Team_Avengers
SASGF2016_Team_AvengersSASGF2016_Team_Avengers
SASGF2016_Team_Avengers
 
Rokach-GomaxSlides.pptx
Rokach-GomaxSlides.pptxRokach-GomaxSlides.pptx
Rokach-GomaxSlides.pptx
 

More from Jason Packer

More from Jason Packer (17)

Third Party Cookies: Columbus DAW March 2024
Third Party Cookies: Columbus DAW March 2024Third Party Cookies: Columbus DAW March 2024
Third Party Cookies: Columbus DAW March 2024
 
Cbuswaw October '23, Marketing Mix Modeling
Cbuswaw October '23, Marketing Mix ModelingCbuswaw October '23, Marketing Mix Modeling
Cbuswaw October '23, Marketing Mix Modeling
 
Generative AI and SEO
Generative AI and SEOGenerative AI and SEO
Generative AI and SEO
 
DataOps , cbuswaw April '23
DataOps , cbuswaw April '23DataOps , cbuswaw April '23
DataOps , cbuswaw April '23
 
Google Analytics Alternatives
Google Analytics AlternativesGoogle Analytics Alternatives
Google Analytics Alternatives
 
Google Analytics Alternatives
Google Analytics AlternativesGoogle Analytics Alternatives
Google Analytics Alternatives
 
Web Analytics Wednesday April 2020 - Customer Journey Mapping
Web Analytics Wednesday April 2020 - Customer Journey MappingWeb Analytics Wednesday April 2020 - Customer Journey Mapping
Web Analytics Wednesday April 2020 - Customer Journey Mapping
 
Introduction to Factor Analysis
Introduction to Factor AnalysisIntroduction to Factor Analysis
Introduction to Factor Analysis
 
Product Analytics at Web Analytics Wednesday
Product Analytics at Web Analytics WednesdayProduct Analytics at Web Analytics Wednesday
Product Analytics at Web Analytics Wednesday
 
How to Present Test Results to Inspire Action
How to Present Test Results to Inspire ActionHow to Present Test Results to Inspire Action
How to Present Test Results to Inspire Action
 
Sentiment analysis
Sentiment analysisSentiment analysis
Sentiment analysis
 
CBUSWAW - October 2017 Alain Stephan
CBUSWAW - October 2017 Alain StephanCBUSWAW - October 2017 Alain Stephan
CBUSWAW - October 2017 Alain Stephan
 
Attribution 101
Attribution 101Attribution 101
Attribution 101
 
CBUSWAW presentation July 2016
CBUSWAW presentation July 2016CBUSWAW presentation July 2016
CBUSWAW presentation July 2016
 
CBUSWAW presentation May 2016
CBUSWAW presentation May 2016CBUSWAW presentation May 2016
CBUSWAW presentation May 2016
 
Digging into Data Collection
Digging into Data CollectionDigging into Data Collection
Digging into Data Collection
 
Columbus WordCamp 2015
Columbus WordCamp 2015Columbus WordCamp 2015
Columbus WordCamp 2015
 

Recently uploaded

FULL ENJOY Call Girls In Majnu.Ka.Tilla Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Majnu.Ka.Tilla Delhi Contact Us 8377877756FULL ENJOY Call Girls In Majnu.Ka.Tilla Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Majnu.Ka.Tilla Delhi Contact Us 8377877756
dollysharma2066
 

Recently uploaded (20)

Chat GPT Master Class - Leslie Hughes, PUNCH Media
Chat GPT Master Class - Leslie Hughes, PUNCH MediaChat GPT Master Class - Leslie Hughes, PUNCH Media
Chat GPT Master Class - Leslie Hughes, PUNCH Media
 
Martal Group - B2B Lead Gen Agency - Onboarding Overview
Martal Group - B2B Lead Gen Agency - Onboarding OverviewMartal Group - B2B Lead Gen Agency - Onboarding Overview
Martal Group - B2B Lead Gen Agency - Onboarding Overview
 
FULL ENJOY Call Girls In Majnu.Ka.Tilla Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Majnu.Ka.Tilla Delhi Contact Us 8377877756FULL ENJOY Call Girls In Majnu.Ka.Tilla Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Majnu.Ka.Tilla Delhi Contact Us 8377877756
 
Five Essential Tools for International SEO - Natalia Witczyk - SearchNorwich 15
Five Essential Tools for International SEO - Natalia Witczyk - SearchNorwich 15Five Essential Tools for International SEO - Natalia Witczyk - SearchNorwich 15
Five Essential Tools for International SEO - Natalia Witczyk - SearchNorwich 15
 
BDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort Service
 
Turn Digital Reputation Threats into Offense Tactics - Daniel Lemin
Turn Digital Reputation Threats into Offense Tactics - Daniel LeminTurn Digital Reputation Threats into Offense Tactics - Daniel Lemin
Turn Digital Reputation Threats into Offense Tactics - Daniel Lemin
 
Digital Strategy Master Class - Andrew Rupert
Digital Strategy Master Class - Andrew RupertDigital Strategy Master Class - Andrew Rupert
Digital Strategy Master Class - Andrew Rupert
 
Foundation First - Why Your Website and Content Matters - David Pisarek
Foundation First - Why Your Website and Content Matters - David PisarekFoundation First - Why Your Website and Content Matters - David Pisarek
Foundation First - Why Your Website and Content Matters - David Pisarek
 
Kraft Mac and Cheese campaign presentation
Kraft Mac and Cheese campaign presentationKraft Mac and Cheese campaign presentation
Kraft Mac and Cheese campaign presentation
 
Unraveling the Mystery of The Circleville Letters.pptx
Unraveling the Mystery of The Circleville Letters.pptxUnraveling the Mystery of The Circleville Letters.pptx
Unraveling the Mystery of The Circleville Letters.pptx
 
Top 5 Breakthrough AI Innovations Elevating Content Creation and Personalizat...
Top 5 Breakthrough AI Innovations Elevating Content Creation and Personalizat...Top 5 Breakthrough AI Innovations Elevating Content Creation and Personalizat...
Top 5 Breakthrough AI Innovations Elevating Content Creation and Personalizat...
 
Creator Influencer Strategy Master Class - Corinne Rose Guirgis
Creator Influencer Strategy Master Class - Corinne Rose GuirgisCreator Influencer Strategy Master Class - Corinne Rose Guirgis
Creator Influencer Strategy Master Class - Corinne Rose Guirgis
 
Cash payment girl 9257726604 Hand ✋ to Hand over girl
Cash payment girl 9257726604 Hand ✋ to Hand over girlCash payment girl 9257726604 Hand ✋ to Hand over girl
Cash payment girl 9257726604 Hand ✋ to Hand over girl
 
What is Google Search Console and What is it provide?
What is Google Search Console and What is it provide?What is Google Search Console and What is it provide?
What is Google Search Console and What is it provide?
 
Digital-Marketing-Into-by-Zoraiz-Ahmad.pptx
Digital-Marketing-Into-by-Zoraiz-Ahmad.pptxDigital-Marketing-Into-by-Zoraiz-Ahmad.pptx
Digital-Marketing-Into-by-Zoraiz-Ahmad.pptx
 
Situation Analysis | Management Company.
Situation Analysis | Management Company.Situation Analysis | Management Company.
Situation Analysis | Management Company.
 
How to Leverage Behavioral Science Insights for Direct Mail Success
How to Leverage Behavioral Science Insights for Direct Mail SuccessHow to Leverage Behavioral Science Insights for Direct Mail Success
How to Leverage Behavioral Science Insights for Direct Mail Success
 
Generative AI Content Creation - Andrew Jenkins
Generative AI Content Creation - Andrew JenkinsGenerative AI Content Creation - Andrew Jenkins
Generative AI Content Creation - Andrew Jenkins
 
A.I. and The Social Media Shift - Mohit Rajhans
A.I. and The Social Media Shift - Mohit RajhansA.I. and The Social Media Shift - Mohit Rajhans
A.I. and The Social Media Shift - Mohit Rajhans
 
SEO for Revenue, Grow Your Business, Not Just Your Rankings - Dale Bertrand
SEO for Revenue, Grow Your Business, Not Just Your Rankings - Dale BertrandSEO for Revenue, Grow Your Business, Not Just Your Rankings - Dale Bertrand
SEO for Revenue, Grow Your Business, Not Just Your Rankings - Dale Bertrand
 

Columbus Web Analytics Wednesday September 2019

Editor's Notes

  1. Title slide with playing cards
  2. How can you sort these cards? Color. Number non-number. Suit. High value. Low value. Blackjack Runs by number Runs by order Whole deck of card Each card Sorting the cards gets us to cluster. Similar/Dissimilar. Parsimony. Actionable
  3. Supervised – we need a teacher or analysis to make decisions about the model. Train the data. Regression – predict value of a house or price willing to pay Classification – is this email spam or not I am simplifying here. Unsupervised – algorithm makes the decision Association – people who buy X also buy Y. Basket analysis Clustering – grouping items, observations, people based on variables. Other forms besides clustering such as preference or perceptual maps and factor analysis
  4. Photos Row of different mobile devices Row of different tablets Row of different laptops
  5. Photos Laptop and Mobile devices (2) 4 laptops & 4 mobile (8) 8 laptops, 8 mobile, 8 tablets
  6. Photos Email Responsive webdesign Offers
  7. Four types of clustering. Most stats oriented packages include hierarchical and non hierarchical. Hierarchical – do not know how many groups. K-Means – need to specify number of groups to start. Performs better with large datasets
  8. Select the number of variables. Consider what is important to your customer profile.
  9. Specify the desired number of clusters K : Let us choose k=2 for these 5 data points in 2-D space.
  10. Randomly assign each data point to a cluster : Let’s assign three points in cluster 1 shown using green color and two points in cluster 2 shown using grey color.
  11. Compute cluster centroids : The centroid of data points in the green cluster is shown using green cross and those in grey cluster using grey cross.
  12. Re-assign each point to the closest cluster centroid : Note that only the data point at the bottom is assigned to the red cluster even though its closer to the centroid of grey cluster. Thus, we assign that data point into grey cluster
  13. Re-compute cluster centroids : Now, re-computing the centroids for both the clusters.
  14. Photos Parsimony
  15. Photos Parsimony
  16. Photos Parsimony
  17. As I add more clusters, I am getting more homogenous groups.
  18. When I have decided on a solution, I can link the cluster results to regression or other supervised learning by converting the cluster results to dummy results.
  19. Title slide with playing cards