SlideShare a Scribd company logo
1 of 19
Customer Opinions Analysis
for Starbucks in Yelp
Web Analytics, Fall 2014
Professor Yilu Zhou
ISGB 7978
Team:
Yixi Zhang, Xiaoshan Jin, Yi
Chun Chien, Yi Ting Kao
Agenda
1. Problem Statement
2. Project Design
3. Stage 1 - Analytic Pre-define
4. Stage 2 - Unstructured Data Analysis
• Correlation Analysis (Overall rating)
• How Rating Differs from Location (Overall rating)
• Feature Selection (Low rating)
• Python Feature Counts Algorithm (Low rating)
• Definition of Top Bad Performance Areas (Low rating)
• Analytics – Manhattan Visualization (Low rating)
5. Analytics Summary & Recommendation
2
3
Problem Statement
There are 212 Starbucks stores in Manhattan. The average rating on
Yelp is 2.8 stars. Some comments have Low rating with 1~2 star.
Project Goal:
Find out the factors causing Starbucks stores’ bad performance to
ensure the highest level of customer satisfaction.
4
Project Design
5
Stage 1 - Analytic Pre-define
• Platform & Tool Selection: Python, Content Analyzer and JMP
• Data collection:
• Use Python to craw 176 Starbucks stores in Yelp
• Variables: Store location, user location, user comment, user rating
• Reviews Distribution:
- Total review number: 3052
- Average Rating: 2.8
- 74% customers from NY;
26% customers from other places
• Pre-define Complaints Categories
• Product, Service, Waiting-time & Environment
User Rating User Location
Review#
6
Stage 2 - Correlation Analysis (Overall rating)
• Target variable: User Rating Group :High (4,5stars) vs. Low (1,2stars)
• Independent Variable: Store Area, User Location, Comment Length
• Use Goodness-of-Fit Test to see correlation between target and independent variables
- Comment Length &
Store location correlated
to User Rating Group
Significant
Significant
7
Stage 2: How Rating Differs from Location (Overall Rating)
Review # Rating
Why Midtown East is better than Midtown West
When both area have similar numbers of review and
user location?
• Top 3 Bad Areas:
• Lower East Side
• Greenwich Village and SOHO
• Chelsea and Clinton
• Top 3 Good Areas:
• Central Park and Murray Hill
• Lower Manhattan
• Inwood and Washington Heights
Low
Rating
> 62%
High
Rating
> 52%
8
Stage 2 - Feature Selection (Low rating)
Assumptions:
1) All comments from Low rating only talk about negative opinions about Starbucks;
2) An index for each feature is set as Features counts numbers/Bad Comments numbers to
every zip code in order to compare features based on zip code level.
Content Analyzer output cleansing: Stop Words and Word Stemming.
Finalized Feature list:
Product – (coffee, drink, drinks, cup, latte, tea, iced, milk, food, wrong)
Waiting time – (time, line, minutes, long, wait, slow, waiting, busy)
Environment – (bathroom, small, clean, seating)
Service – (people, service, staff, barista, baristas, rude, cashier, manager, friendly, attitude)
9
Stage 2 - Python Feature Counts Algorithm:(Low rating)
Calculation rule:
Any feature occurrence in the feature lists labels as “1”. Otherwise, “0”.
• Assess every user review by Product,
Service, Waiting time, and Environment
features;
• Group all of the feature counts based on
store location(Zip Code) .
10
Stage 2 – Definition of Top Bad Performance Areas
(Low rating)
Definition Rules(%)
Environment Complaint Product Complaint Service Complaint Waiting time Complaint
Index Range 10.71-
60
Index Range 46.43-
100
Index Range 43.75-
100
Index Range 44.44-
100
Index Median 35.36 Index Median 73.21 Index Median 71.88 Index Median 72.22
Top Bad
Performance
Index Point
35 Top Bad
Performance
Index Point
85 Top Bad
Performance
Index Point
85 Top Bad Performance
Index Point
65
11
Analytics Summary
Manhattan Top Bad Performance Areas
Environment
Complaint
Product Complaint Service Complaint Waiting Complaint
Upper West Side Lower East Side Lower Manhattan Central Park and
Murray Hill
Chelsea and Clinton Central Park and
Murray Hill
Upper East Side Chelsea and Clinton
Greenwich Village and
Soho
Upper East Side Inwood and
Washington
Heights
Lower Manhattan
N/A Inwood and Washington
Heights
N/A Inwood and
Washington Heights
N/A Central Harlem N/A
12
Analytics – Manhattan Visualization (Low rating)
Environment Complaint Product Complaint
13
Analytics – Manhattan Visualization (Low rating)
Waiting Time Complaint Service Complaint
Midtown East has lower “Service
Complaints” rate than Midtown
West
14
Recommendations
To Manager of Manhattan area:
1. The common concerns for customers in all Manhattan
area are long waiting time and bad service.
• Hire more cashiers and baristas based on each store’s
situation (financially efficient)
• Train current employees to provide more professional,
flexible and efficient services in a high quality.
• Establish an awards and penalty system for employees.
(Attitude, Efficiency)
2. Give priority to areas with high number of reviews but
relative Low rating. E.g. downtown, west midtown
15
Recommendations
3. Each zip code area should try to
improve the top three concerns of the
customers no matter what the overall
rating it get.
E.g. Inwood and Washington Heights
16
Thank You
Q&A
17
Appendix 1 Manhattan Zip Code
18
Appendix 2 Content Analyzer Output One
19
Appendix 3 Content Analyzer Output Two

More Related Content

Viewers also liked

[DSC 2016] 系列活動:李泳泉 / 星火燎原 - Spark 機器學習初探
[DSC 2016] 系列活動:李泳泉 / 星火燎原 - Spark 機器學習初探[DSC 2016] 系列活動:李泳泉 / 星火燎原 - Spark 機器學習初探
[DSC 2016] 系列活動:李泳泉 / 星火燎原 - Spark 機器學習初探台灣資料科學年會
 
[系列活動] 資料探勘速遊 - Session4 case-studies
[系列活動] 資料探勘速遊 - Session4 case-studies[系列活動] 資料探勘速遊 - Session4 case-studies
[系列活動] 資料探勘速遊 - Session4 case-studies台灣資料科學年會
 
[DSC 2016] 系列活動:李祈均 / 人類行為大數據分析
[DSC 2016] 系列活動:李祈均 / 人類行為大數據分析[DSC 2016] 系列活動:李祈均 / 人類行為大數據分析
[DSC 2016] 系列活動:李祈均 / 人類行為大數據分析台灣資料科學年會
 
[DSC 2016] 系列活動:吳牧恩、林佳緯 / 用 R 輕鬆做交易策略分析及自動下單
[DSC 2016] 系列活動:吳牧恩、林佳緯 / 用 R 輕鬆做交易策略分析及自動下單[DSC 2016] 系列活動:吳牧恩、林佳緯 / 用 R 輕鬆做交易策略分析及自動下單
[DSC 2016] 系列活動:吳牧恩、林佳緯 / 用 R 輕鬆做交易策略分析及自動下單台灣資料科學年會
 
[DSC 2016] 系列活動:許懷中 / R 語言資料探勘實務
[DSC 2016] 系列活動:許懷中 / R 語言資料探勘實務[DSC 2016] 系列活動:許懷中 / R 語言資料探勘實務
[DSC 2016] 系列活動:許懷中 / R 語言資料探勘實務台灣資料科學年會
 
[系列活動] 手把手教你R語言資料分析實務
[系列活動] 手把手教你R語言資料分析實務[系列活動] 手把手教你R語言資料分析實務
[系列活動] 手把手教你R語言資料分析實務台灣資料科學年會
 
[系列活動] 給工程師的統計學及資料分析 123
[系列活動] 給工程師的統計學及資料分析 123[系列活動] 給工程師的統計學及資料分析 123
[系列活動] 給工程師的統計學及資料分析 123台灣資料科學年會
 
[系列活動] 使用 R 語言建立自己的演算法交易事業
[系列活動] 使用 R 語言建立自己的演算法交易事業[系列活動] 使用 R 語言建立自己的演算法交易事業
[系列活動] 使用 R 語言建立自己的演算法交易事業台灣資料科學年會
 
[系列活動] Machine Learning 機器學習課程
[系列活動] Machine Learning 機器學習課程[系列活動] Machine Learning 機器學習課程
[系列活動] Machine Learning 機器學習課程台灣資料科學年會
 
[系列活動] 智慧製造與生產線上的資料科學 (製造資料科學:從預測性思維到處方性決策)
[系列活動] 智慧製造與生產線上的資料科學 (製造資料科學:從預測性思維到處方性決策)[系列活動] 智慧製造與生產線上的資料科學 (製造資料科學:從預測性思維到處方性決策)
[系列活動] 智慧製造與生產線上的資料科學 (製造資料科學:從預測性思維到處方性決策)台灣資料科學年會
 
[系列活動] 手把手的深度學習實務
[系列活動] 手把手的深度學習實務[系列活動] 手把手的深度學習實務
[系列活動] 手把手的深度學習實務台灣資料科學年會
 
[系列活動] 智慧城市中的時空大數據應用
[系列活動] 智慧城市中的時空大數據應用[系列活動] 智慧城市中的時空大數據應用
[系列活動] 智慧城市中的時空大數據應用台灣資料科學年會
 
[DSC x TAAI 2016] 林守德 / 人工智慧與機器學習在推薦系統上的應用
[DSC x TAAI 2016] 林守德 / 人工智慧與機器學習在推薦系統上的應用[DSC x TAAI 2016] 林守德 / 人工智慧與機器學習在推薦系統上的應用
[DSC x TAAI 2016] 林守德 / 人工智慧與機器學習在推薦系統上的應用台灣資料科學年會
 

Viewers also liked (16)

[DSC 2016] 系列活動:李泳泉 / 星火燎原 - Spark 機器學習初探
[DSC 2016] 系列活動:李泳泉 / 星火燎原 - Spark 機器學習初探[DSC 2016] 系列活動:李泳泉 / 星火燎原 - Spark 機器學習初探
[DSC 2016] 系列活動:李泳泉 / 星火燎原 - Spark 機器學習初探
 
[系列活動] 資料探勘速遊 - Session4 case-studies
[系列活動] 資料探勘速遊 - Session4 case-studies[系列活動] 資料探勘速遊 - Session4 case-studies
[系列活動] 資料探勘速遊 - Session4 case-studies
 
[DSC 2016] 系列活動:李祈均 / 人類行為大數據分析
[DSC 2016] 系列活動:李祈均 / 人類行為大數據分析[DSC 2016] 系列活動:李祈均 / 人類行為大數據分析
[DSC 2016] 系列活動:李祈均 / 人類行為大數據分析
 
[系列活動] 資料探勘速遊
[系列活動] 資料探勘速遊[系列活動] 資料探勘速遊
[系列活動] 資料探勘速遊
 
[DSC 2016] 系列活動:吳牧恩、林佳緯 / 用 R 輕鬆做交易策略分析及自動下單
[DSC 2016] 系列活動:吳牧恩、林佳緯 / 用 R 輕鬆做交易策略分析及自動下單[DSC 2016] 系列活動:吳牧恩、林佳緯 / 用 R 輕鬆做交易策略分析及自動下單
[DSC 2016] 系列活動:吳牧恩、林佳緯 / 用 R 輕鬆做交易策略分析及自動下單
 
[DSC 2016] 系列活動:許懷中 / R 語言資料探勘實務
[DSC 2016] 系列活動:許懷中 / R 語言資料探勘實務[DSC 2016] 系列活動:許懷中 / R 語言資料探勘實務
[DSC 2016] 系列活動:許懷中 / R 語言資料探勘實務
 
[系列活動] 手把手教你R語言資料分析實務
[系列活動] 手把手教你R語言資料分析實務[系列活動] 手把手教你R語言資料分析實務
[系列活動] 手把手教你R語言資料分析實務
 
[系列活動] 給工程師的統計學及資料分析 123
[系列活動] 給工程師的統計學及資料分析 123[系列活動] 給工程師的統計學及資料分析 123
[系列活動] 給工程師的統計學及資料分析 123
 
[系列活動] 使用 R 語言建立自己的演算法交易事業
[系列活動] 使用 R 語言建立自己的演算法交易事業[系列活動] 使用 R 語言建立自己的演算法交易事業
[系列活動] 使用 R 語言建立自己的演算法交易事業
 
[系列活動] Machine Learning 機器學習課程
[系列活動] Machine Learning 機器學習課程[系列活動] Machine Learning 機器學習課程
[系列活動] Machine Learning 機器學習課程
 
[系列活動] 智慧製造與生產線上的資料科學 (製造資料科學:從預測性思維到處方性決策)
[系列活動] 智慧製造與生產線上的資料科學 (製造資料科學:從預測性思維到處方性決策)[系列活動] 智慧製造與生產線上的資料科學 (製造資料科學:從預測性思維到處方性決策)
[系列活動] 智慧製造與生產線上的資料科學 (製造資料科學:從預測性思維到處方性決策)
 
[系列活動] 機器學習速遊
[系列活動] 機器學習速遊[系列活動] 機器學習速遊
[系列活動] 機器學習速遊
 
[系列活動] 手把手的深度學習實務
[系列活動] 手把手的深度學習實務[系列活動] 手把手的深度學習實務
[系列活動] 手把手的深度學習實務
 
[系列活動] 智慧城市中的時空大數據應用
[系列活動] 智慧城市中的時空大數據應用[系列活動] 智慧城市中的時空大數據應用
[系列活動] 智慧城市中的時空大數據應用
 
[DSC x TAAI 2016] 林守德 / 人工智慧與機器學習在推薦系統上的應用
[DSC x TAAI 2016] 林守德 / 人工智慧與機器學習在推薦系統上的應用[DSC x TAAI 2016] 林守德 / 人工智慧與機器學習在推薦系統上的應用
[DSC x TAAI 2016] 林守德 / 人工智慧與機器學習在推薦系統上的應用
 
SlideShare 101
SlideShare 101SlideShare 101
SlideShare 101
 

Similar to Starbucks Customer Ratings Analysis in Manhattan

PAS: The Planning Quality Framework
PAS: The Planning Quality FrameworkPAS: The Planning Quality Framework
PAS: The Planning Quality FrameworkPAS_Team
 
Social Media Monitoring/Reputation Management Consulting Project for Restaura...
Social Media Monitoring/Reputation Management Consulting Project for Restaura...Social Media Monitoring/Reputation Management Consulting Project for Restaura...
Social Media Monitoring/Reputation Management Consulting Project for Restaura...Elizabeth Kelly
 
Localyser - An Introduction
Localyser - An IntroductionLocalyser - An Introduction
Localyser - An Introductionsps:affinity
 
Measuring the Quality of Online Service - Jinyoung kim
Measuring the Quality of Online Service - Jinyoung kimMeasuring the Quality of Online Service - Jinyoung kim
Measuring the Quality of Online Service - Jinyoung kimJin Young Kim
 
Measuring and Managing Customer Satisfaction Programs
Measuring and Managing Customer Satisfaction ProgramsMeasuring and Managing Customer Satisfaction Programs
Measuring and Managing Customer Satisfaction ProgramsMARY MALASZEK
 
Us retail online store customer ratings and reviews full version
Us retail online store customer ratings and reviews full versionUs retail online store customer ratings and reviews full version
Us retail online store customer ratings and reviews full versionDana Harrold
 
Yelp's Review Filtering Algorithm Poster
Yelp's Review Filtering Algorithm PosterYelp's Review Filtering Algorithm Poster
Yelp's Review Filtering Algorithm PosterYao Yao
 
A/B Testing Best Practices - Do's and Don'ts
A/B Testing Best Practices - Do's and Don'tsA/B Testing Best Practices - Do's and Don'ts
A/B Testing Best Practices - Do's and Don'tsRamkumar Ravichandran
 
Best Practices in Recommender System Challenges
Best Practices in Recommender System ChallengesBest Practices in Recommender System Challenges
Best Practices in Recommender System ChallengesAlan Said
 
Support Center Back Office Model
Support Center Back Office ModelSupport Center Back Office Model
Support Center Back Office Modelbarnesrf
 
How to Benchmark Your Online Customer Experience Against Competition
How to Benchmark Your Online Customer Experience Against CompetitionHow to Benchmark Your Online Customer Experience Against Competition
How to Benchmark Your Online Customer Experience Against CompetitionUserZoom
 
Positivity Bias in Customer Satisfaction Ratings
Positivity Bias in Customer Satisfaction RatingsPositivity Bias in Customer Satisfaction Ratings
Positivity Bias in Customer Satisfaction RatingsKunwoo Park
 
Introduction to Core Assessments
Introduction to Core AssessmentsIntroduction to Core Assessments
Introduction to Core AssessmentsResolver Inc.
 
Empowering Businesses using Yelp Reviews Mining
Empowering Businesses using Yelp Reviews MiningEmpowering Businesses using Yelp Reviews Mining
Empowering Businesses using Yelp Reviews MiningVipul Munot
 

Similar to Starbucks Customer Ratings Analysis in Manhattan (20)

PAS: The Planning Quality Framework
PAS: The Planning Quality FrameworkPAS: The Planning Quality Framework
PAS: The Planning Quality Framework
 
Social Media Monitoring/Reputation Management Consulting Project for Restaura...
Social Media Monitoring/Reputation Management Consulting Project for Restaura...Social Media Monitoring/Reputation Management Consulting Project for Restaura...
Social Media Monitoring/Reputation Management Consulting Project for Restaura...
 
Localyser - An Introduction
Localyser - An IntroductionLocalyser - An Introduction
Localyser - An Introduction
 
JA, PA, Selection 2016
JA, PA, Selection 2016JA, PA, Selection 2016
JA, PA, Selection 2016
 
Measuring the Quality of Online Service - Jinyoung kim
Measuring the Quality of Online Service - Jinyoung kimMeasuring the Quality of Online Service - Jinyoung kim
Measuring the Quality of Online Service - Jinyoung kim
 
Measuring and Managing Customer Satisfaction Programs
Measuring and Managing Customer Satisfaction ProgramsMeasuring and Managing Customer Satisfaction Programs
Measuring and Managing Customer Satisfaction Programs
 
Call center assessment[1]
Call center assessment[1]Call center assessment[1]
Call center assessment[1]
 
Us retail online store customer ratings and reviews full version
Us retail online store customer ratings and reviews full versionUs retail online store customer ratings and reviews full version
Us retail online store customer ratings and reviews full version
 
Yelp's Review Filtering Algorithm Poster
Yelp's Review Filtering Algorithm PosterYelp's Review Filtering Algorithm Poster
Yelp's Review Filtering Algorithm Poster
 
Product market fit gap
Product market fit gapProduct market fit gap
Product market fit gap
 
A/B Testing Best Practices - Do's and Don'ts
A/B Testing Best Practices - Do's and Don'tsA/B Testing Best Practices - Do's and Don'ts
A/B Testing Best Practices - Do's and Don'ts
 
Best Practices in Recommender System Challenges
Best Practices in Recommender System ChallengesBest Practices in Recommender System Challenges
Best Practices in Recommender System Challenges
 
PQF Overview
PQF OverviewPQF Overview
PQF Overview
 
How Google works
How Google worksHow Google works
How Google works
 
Support Center Back Office Model
Support Center Back Office ModelSupport Center Back Office Model
Support Center Back Office Model
 
How to Benchmark Your Online Customer Experience Against Competition
How to Benchmark Your Online Customer Experience Against CompetitionHow to Benchmark Your Online Customer Experience Against Competition
How to Benchmark Your Online Customer Experience Against Competition
 
Positivity Bias in Customer Satisfaction Ratings
Positivity Bias in Customer Satisfaction RatingsPositivity Bias in Customer Satisfaction Ratings
Positivity Bias in Customer Satisfaction Ratings
 
Introduction to Core Assessments
Introduction to Core AssessmentsIntroduction to Core Assessments
Introduction to Core Assessments
 
Empowering Businesses using Yelp Reviews Mining
Empowering Businesses using Yelp Reviews MiningEmpowering Businesses using Yelp Reviews Mining
Empowering Businesses using Yelp Reviews Mining
 
Benchmarking
BenchmarkingBenchmarking
Benchmarking
 

Recently uploaded

Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfSwapnil Therkar
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...jana861314
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Patrick Diehl
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptMAESTRELLAMesa2
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfSELF-EXPLANATORY
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxyaramohamed343013
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 

Recently uploaded (20)

Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.ppt
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docx
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 

Starbucks Customer Ratings Analysis in Manhattan

  • 1. Customer Opinions Analysis for Starbucks in Yelp Web Analytics, Fall 2014 Professor Yilu Zhou ISGB 7978 Team: Yixi Zhang, Xiaoshan Jin, Yi Chun Chien, Yi Ting Kao
  • 2. Agenda 1. Problem Statement 2. Project Design 3. Stage 1 - Analytic Pre-define 4. Stage 2 - Unstructured Data Analysis • Correlation Analysis (Overall rating) • How Rating Differs from Location (Overall rating) • Feature Selection (Low rating) • Python Feature Counts Algorithm (Low rating) • Definition of Top Bad Performance Areas (Low rating) • Analytics – Manhattan Visualization (Low rating) 5. Analytics Summary & Recommendation 2
  • 3. 3 Problem Statement There are 212 Starbucks stores in Manhattan. The average rating on Yelp is 2.8 stars. Some comments have Low rating with 1~2 star. Project Goal: Find out the factors causing Starbucks stores’ bad performance to ensure the highest level of customer satisfaction.
  • 5. 5 Stage 1 - Analytic Pre-define • Platform & Tool Selection: Python, Content Analyzer and JMP • Data collection: • Use Python to craw 176 Starbucks stores in Yelp • Variables: Store location, user location, user comment, user rating • Reviews Distribution: - Total review number: 3052 - Average Rating: 2.8 - 74% customers from NY; 26% customers from other places • Pre-define Complaints Categories • Product, Service, Waiting-time & Environment User Rating User Location Review#
  • 6. 6 Stage 2 - Correlation Analysis (Overall rating) • Target variable: User Rating Group :High (4,5stars) vs. Low (1,2stars) • Independent Variable: Store Area, User Location, Comment Length • Use Goodness-of-Fit Test to see correlation between target and independent variables - Comment Length & Store location correlated to User Rating Group Significant Significant
  • 7. 7 Stage 2: How Rating Differs from Location (Overall Rating) Review # Rating Why Midtown East is better than Midtown West When both area have similar numbers of review and user location? • Top 3 Bad Areas: • Lower East Side • Greenwich Village and SOHO • Chelsea and Clinton • Top 3 Good Areas: • Central Park and Murray Hill • Lower Manhattan • Inwood and Washington Heights Low Rating > 62% High Rating > 52%
  • 8. 8 Stage 2 - Feature Selection (Low rating) Assumptions: 1) All comments from Low rating only talk about negative opinions about Starbucks; 2) An index for each feature is set as Features counts numbers/Bad Comments numbers to every zip code in order to compare features based on zip code level. Content Analyzer output cleansing: Stop Words and Word Stemming. Finalized Feature list: Product – (coffee, drink, drinks, cup, latte, tea, iced, milk, food, wrong) Waiting time – (time, line, minutes, long, wait, slow, waiting, busy) Environment – (bathroom, small, clean, seating) Service – (people, service, staff, barista, baristas, rude, cashier, manager, friendly, attitude)
  • 9. 9 Stage 2 - Python Feature Counts Algorithm:(Low rating) Calculation rule: Any feature occurrence in the feature lists labels as “1”. Otherwise, “0”. • Assess every user review by Product, Service, Waiting time, and Environment features; • Group all of the feature counts based on store location(Zip Code) .
  • 10. 10 Stage 2 – Definition of Top Bad Performance Areas (Low rating) Definition Rules(%) Environment Complaint Product Complaint Service Complaint Waiting time Complaint Index Range 10.71- 60 Index Range 46.43- 100 Index Range 43.75- 100 Index Range 44.44- 100 Index Median 35.36 Index Median 73.21 Index Median 71.88 Index Median 72.22 Top Bad Performance Index Point 35 Top Bad Performance Index Point 85 Top Bad Performance Index Point 85 Top Bad Performance Index Point 65
  • 11. 11 Analytics Summary Manhattan Top Bad Performance Areas Environment Complaint Product Complaint Service Complaint Waiting Complaint Upper West Side Lower East Side Lower Manhattan Central Park and Murray Hill Chelsea and Clinton Central Park and Murray Hill Upper East Side Chelsea and Clinton Greenwich Village and Soho Upper East Side Inwood and Washington Heights Lower Manhattan N/A Inwood and Washington Heights N/A Inwood and Washington Heights N/A Central Harlem N/A
  • 12. 12 Analytics – Manhattan Visualization (Low rating) Environment Complaint Product Complaint
  • 13. 13 Analytics – Manhattan Visualization (Low rating) Waiting Time Complaint Service Complaint Midtown East has lower “Service Complaints” rate than Midtown West
  • 14. 14 Recommendations To Manager of Manhattan area: 1. The common concerns for customers in all Manhattan area are long waiting time and bad service. • Hire more cashiers and baristas based on each store’s situation (financially efficient) • Train current employees to provide more professional, flexible and efficient services in a high quality. • Establish an awards and penalty system for employees. (Attitude, Efficiency) 2. Give priority to areas with high number of reviews but relative Low rating. E.g. downtown, west midtown
  • 15. 15 Recommendations 3. Each zip code area should try to improve the top three concerns of the customers no matter what the overall rating it get. E.g. Inwood and Washington Heights
  • 18. 18 Appendix 2 Content Analyzer Output One
  • 19. 19 Appendix 3 Content Analyzer Output Two