SlideShare a Scribd company logo
1 of 19
Customer Opinions Analysis
for Starbucks in Yelp
Web Analytics, Fall 2014
Professor Yilu Zhou
ISGB 7978
Team:
Yixi Zhang, Xiaoshan Jin, Yi
Chun Chien, Yi Ting Kao
Agenda
1. Problem Statement
2. Project Design
3. Stage 1 - Analytic Pre-define
4. Stage 2 - Unstructured Data Analysis
• Correlation Analysis (Overall rating)
• How Rating Differs from Location (Overall rating)
• Feature Selection (Low rating)
• Python Feature Counts Algorithm (Low rating)
• Definition of Top Bad Performance Areas (Low rating)
• Analytics – Manhattan Visualization (Low rating)
5. Analytics Summary & Recommendation
2
3
Problem Statement
There are 212 Starbucks stores in Manhattan. The average rating on
Yelp is 2.8 stars. Some comments have Low rating with 1~2 star.
Project Goal:
Find out the factors causing Starbucks stores’ bad performance to
ensure the highest level of customer satisfaction.
4
Project Design
5
Stage 1 - Analytic Pre-define
• Platform & Tool Selection: Python, Content Analyzer and JMP
• Data collection:
• Use Python to craw 176 Starbucks stores in Yelp
• Variables: Store location, user location, user comment, user rating
• Reviews Distribution:
- Total review number: 3052
- Average Rating: 2.8
- 74% customers from NY;
26% customers from other places
• Pre-define Complaints Categories
• Product, Service, Waiting-time & Environment
User Rating User Location
Review#
6
Stage 2 - Correlation Analysis (Overall rating)
• Target variable: User Rating Group :High (4,5stars) vs. Low (1,2stars)
• Independent Variable: Store Area, User Location, Comment Length
• Use Goodness-of-Fit Test to see correlation between target and independent variables
- Comment Length &
Store location correlated
to User Rating Group
Significant
Significant
7
Stage 2: How Rating Differs from Location (Overall Rating)
Review # Rating
Why Midtown East is better than Midtown West
When both area have similar numbers of review and
user location?
• Top 3 Bad Areas:
• Lower East Side
• Greenwich Village and SOHO
• Chelsea and Clinton
• Top 3 Good Areas:
• Central Park and Murray Hill
• Lower Manhattan
• Inwood and Washington Heights
Low
Rating
> 62%
High
Rating
> 52%
8
Stage 2 - Feature Selection (Low rating)
Assumptions:
1) All comments from Low rating only talk about negative opinions about Starbucks;
2) An index for each feature is set as Features counts numbers/Bad Comments numbers to
every zip code in order to compare features based on zip code level.
Content Analyzer output cleansing: Stop Words and Word Stemming.
Finalized Feature list:
Product – (coffee, drink, drinks, cup, latte, tea, iced, milk, food, wrong)
Waiting time – (time, line, minutes, long, wait, slow, waiting, busy)
Environment – (bathroom, small, clean, seating)
Service – (people, service, staff, barista, baristas, rude, cashier, manager, friendly, attitude)
9
Stage 2 - Python Feature Counts Algorithm:(Low rating)
Calculation rule:
Any feature occurrence in the feature lists labels as “1”. Otherwise, “0”.
• Assess every user review by Product,
Service, Waiting time, and Environment
features;
• Group all of the feature counts based on
store location(Zip Code) .
10
Stage 2 – Definition of Top Bad Performance Areas
(Low rating)
Definition Rules(%)
Environment Complaint Product Complaint Service Complaint Waiting time Complaint
Index Range 10.71-
60
Index Range 46.43-
100
Index Range 43.75-
100
Index Range 44.44-
100
Index Median 35.36 Index Median 73.21 Index Median 71.88 Index Median 72.22
Top Bad
Performance
Index Point
35 Top Bad
Performance
Index Point
85 Top Bad
Performance
Index Point
85 Top Bad Performance
Index Point
65
11
Analytics Summary
Manhattan Top Bad Performance Areas
Environment
Complaint
Product Complaint Service Complaint Waiting Complaint
Upper West Side Lower East Side Lower Manhattan Central Park and
Murray Hill
Chelsea and Clinton Central Park and
Murray Hill
Upper East Side Chelsea and Clinton
Greenwich Village and
Soho
Upper East Side Inwood and
Washington
Heights
Lower Manhattan
N/A Inwood and Washington
Heights
N/A Inwood and
Washington Heights
N/A Central Harlem N/A
12
Analytics – Manhattan Visualization (Low rating)
Environment Complaint Product Complaint
13
Analytics – Manhattan Visualization (Low rating)
Waiting Time Complaint Service Complaint
Midtown East has lower “Service
Complaints” rate than Midtown
West
14
Recommendations
To Manager of Manhattan area:
1. The common concerns for customers in all Manhattan
area are long waiting time and bad service.
• Hire more cashiers and baristas based on each store’s
situation (financially efficient)
• Train current employees to provide more professional,
flexible and efficient services in a high quality.
• Establish an awards and penalty system for employees.
(Attitude, Efficiency)
2. Give priority to areas with high number of reviews but
relative Low rating. E.g. downtown, west midtown
15
Recommendations
3. Each zip code area should try to
improve the top three concerns of the
customers no matter what the overall
rating it get.
E.g. Inwood and Washington Heights
16
Thank You
Q&A
17
Appendix 1 Manhattan Zip Code
18
Appendix 2 Content Analyzer Output One
19
Appendix 3 Content Analyzer Output Two

More Related Content

Viewers also liked

[DSC 2016] 系列活動:李泳泉 / 星火燎原 - Spark 機器學習初探
[DSC 2016] 系列活動:李泳泉 / 星火燎原 - Spark 機器學習初探[DSC 2016] 系列活動:李泳泉 / 星火燎原 - Spark 機器學習初探
[DSC 2016] 系列活動:李泳泉 / 星火燎原 - Spark 機器學習初探台灣資料科學年會
 
[系列活動] 資料探勘速遊 - Session4 case-studies
[系列活動] 資料探勘速遊 - Session4 case-studies[系列活動] 資料探勘速遊 - Session4 case-studies
[系列活動] 資料探勘速遊 - Session4 case-studies台灣資料科學年會
 
[DSC 2016] 系列活動:李祈均 / 人類行為大數據分析
[DSC 2016] 系列活動:李祈均 / 人類行為大數據分析[DSC 2016] 系列活動:李祈均 / 人類行為大數據分析
[DSC 2016] 系列活動:李祈均 / 人類行為大數據分析台灣資料科學年會
 
[DSC 2016] 系列活動:吳牧恩、林佳緯 / 用 R 輕鬆做交易策略分析及自動下單
[DSC 2016] 系列活動:吳牧恩、林佳緯 / 用 R 輕鬆做交易策略分析及自動下單[DSC 2016] 系列活動:吳牧恩、林佳緯 / 用 R 輕鬆做交易策略分析及自動下單
[DSC 2016] 系列活動:吳牧恩、林佳緯 / 用 R 輕鬆做交易策略分析及自動下單台灣資料科學年會
 
[DSC 2016] 系列活動:許懷中 / R 語言資料探勘實務
[DSC 2016] 系列活動:許懷中 / R 語言資料探勘實務[DSC 2016] 系列活動:許懷中 / R 語言資料探勘實務
[DSC 2016] 系列活動:許懷中 / R 語言資料探勘實務台灣資料科學年會
 
[系列活動] 手把手教你R語言資料分析實務
[系列活動] 手把手教你R語言資料分析實務[系列活動] 手把手教你R語言資料分析實務
[系列活動] 手把手教你R語言資料分析實務台灣資料科學年會
 
[系列活動] 給工程師的統計學及資料分析 123
[系列活動] 給工程師的統計學及資料分析 123[系列活動] 給工程師的統計學及資料分析 123
[系列活動] 給工程師的統計學及資料分析 123台灣資料科學年會
 
[系列活動] 使用 R 語言建立自己的演算法交易事業
[系列活動] 使用 R 語言建立自己的演算法交易事業[系列活動] 使用 R 語言建立自己的演算法交易事業
[系列活動] 使用 R 語言建立自己的演算法交易事業台灣資料科學年會
 
[系列活動] Machine Learning 機器學習課程
[系列活動] Machine Learning 機器學習課程[系列活動] Machine Learning 機器學習課程
[系列活動] Machine Learning 機器學習課程台灣資料科學年會
 
[系列活動] 智慧製造與生產線上的資料科學 (製造資料科學:從預測性思維到處方性決策)
[系列活動] 智慧製造與生產線上的資料科學 (製造資料科學:從預測性思維到處方性決策)[系列活動] 智慧製造與生產線上的資料科學 (製造資料科學:從預測性思維到處方性決策)
[系列活動] 智慧製造與生產線上的資料科學 (製造資料科學:從預測性思維到處方性決策)台灣資料科學年會
 
[系列活動] 手把手的深度學習實務
[系列活動] 手把手的深度學習實務[系列活動] 手把手的深度學習實務
[系列活動] 手把手的深度學習實務台灣資料科學年會
 
[系列活動] 智慧城市中的時空大數據應用
[系列活動] 智慧城市中的時空大數據應用[系列活動] 智慧城市中的時空大數據應用
[系列活動] 智慧城市中的時空大數據應用台灣資料科學年會
 
[DSC x TAAI 2016] 林守德 / 人工智慧與機器學習在推薦系統上的應用
[DSC x TAAI 2016] 林守德 / 人工智慧與機器學習在推薦系統上的應用[DSC x TAAI 2016] 林守德 / 人工智慧與機器學習在推薦系統上的應用
[DSC x TAAI 2016] 林守德 / 人工智慧與機器學習在推薦系統上的應用台灣資料科學年會
 

Viewers also liked (16)

[DSC 2016] 系列活動:李泳泉 / 星火燎原 - Spark 機器學習初探
[DSC 2016] 系列活動:李泳泉 / 星火燎原 - Spark 機器學習初探[DSC 2016] 系列活動:李泳泉 / 星火燎原 - Spark 機器學習初探
[DSC 2016] 系列活動:李泳泉 / 星火燎原 - Spark 機器學習初探
 
[系列活動] 資料探勘速遊 - Session4 case-studies
[系列活動] 資料探勘速遊 - Session4 case-studies[系列活動] 資料探勘速遊 - Session4 case-studies
[系列活動] 資料探勘速遊 - Session4 case-studies
 
[DSC 2016] 系列活動:李祈均 / 人類行為大數據分析
[DSC 2016] 系列活動:李祈均 / 人類行為大數據分析[DSC 2016] 系列活動:李祈均 / 人類行為大數據分析
[DSC 2016] 系列活動:李祈均 / 人類行為大數據分析
 
[系列活動] 資料探勘速遊
[系列活動] 資料探勘速遊[系列活動] 資料探勘速遊
[系列活動] 資料探勘速遊
 
[DSC 2016] 系列活動:吳牧恩、林佳緯 / 用 R 輕鬆做交易策略分析及自動下單
[DSC 2016] 系列活動:吳牧恩、林佳緯 / 用 R 輕鬆做交易策略分析及自動下單[DSC 2016] 系列活動:吳牧恩、林佳緯 / 用 R 輕鬆做交易策略分析及自動下單
[DSC 2016] 系列活動:吳牧恩、林佳緯 / 用 R 輕鬆做交易策略分析及自動下單
 
[DSC 2016] 系列活動:許懷中 / R 語言資料探勘實務
[DSC 2016] 系列活動:許懷中 / R 語言資料探勘實務[DSC 2016] 系列活動:許懷中 / R 語言資料探勘實務
[DSC 2016] 系列活動:許懷中 / R 語言資料探勘實務
 
[系列活動] 手把手教你R語言資料分析實務
[系列活動] 手把手教你R語言資料分析實務[系列活動] 手把手教你R語言資料分析實務
[系列活動] 手把手教你R語言資料分析實務
 
[系列活動] 給工程師的統計學及資料分析 123
[系列活動] 給工程師的統計學及資料分析 123[系列活動] 給工程師的統計學及資料分析 123
[系列活動] 給工程師的統計學及資料分析 123
 
[系列活動] 使用 R 語言建立自己的演算法交易事業
[系列活動] 使用 R 語言建立自己的演算法交易事業[系列活動] 使用 R 語言建立自己的演算法交易事業
[系列活動] 使用 R 語言建立自己的演算法交易事業
 
[系列活動] Machine Learning 機器學習課程
[系列活動] Machine Learning 機器學習課程[系列活動] Machine Learning 機器學習課程
[系列活動] Machine Learning 機器學習課程
 
[系列活動] 智慧製造與生產線上的資料科學 (製造資料科學:從預測性思維到處方性決策)
[系列活動] 智慧製造與生產線上的資料科學 (製造資料科學:從預測性思維到處方性決策)[系列活動] 智慧製造與生產線上的資料科學 (製造資料科學:從預測性思維到處方性決策)
[系列活動] 智慧製造與生產線上的資料科學 (製造資料科學:從預測性思維到處方性決策)
 
[系列活動] 機器學習速遊
[系列活動] 機器學習速遊[系列活動] 機器學習速遊
[系列活動] 機器學習速遊
 
[系列活動] 手把手的深度學習實務
[系列活動] 手把手的深度學習實務[系列活動] 手把手的深度學習實務
[系列活動] 手把手的深度學習實務
 
[系列活動] 智慧城市中的時空大數據應用
[系列活動] 智慧城市中的時空大數據應用[系列活動] 智慧城市中的時空大數據應用
[系列活動] 智慧城市中的時空大數據應用
 
[DSC x TAAI 2016] 林守德 / 人工智慧與機器學習在推薦系統上的應用
[DSC x TAAI 2016] 林守德 / 人工智慧與機器學習在推薦系統上的應用[DSC x TAAI 2016] 林守德 / 人工智慧與機器學習在推薦系統上的應用
[DSC x TAAI 2016] 林守德 / 人工智慧與機器學習在推薦系統上的應用
 
SlideShare 101
SlideShare 101SlideShare 101
SlideShare 101
 

Similar to Starbucks Customer Ratings Analysis in Manhattan

PAS: The Planning Quality Framework
PAS: The Planning Quality FrameworkPAS: The Planning Quality Framework
PAS: The Planning Quality FrameworkPAS_Team
 
Social Media Monitoring/Reputation Management Consulting Project for Restaura...
Social Media Monitoring/Reputation Management Consulting Project for Restaura...Social Media Monitoring/Reputation Management Consulting Project for Restaura...
Social Media Monitoring/Reputation Management Consulting Project for Restaura...Elizabeth Kelly
 
Localyser - An Introduction
Localyser - An IntroductionLocalyser - An Introduction
Localyser - An Introductionsps:affinity
 
Measuring the Quality of Online Service - Jinyoung kim
Measuring the Quality of Online Service - Jinyoung kimMeasuring the Quality of Online Service - Jinyoung kim
Measuring the Quality of Online Service - Jinyoung kimJin Young Kim
 
Measuring and Managing Customer Satisfaction Programs
Measuring and Managing Customer Satisfaction ProgramsMeasuring and Managing Customer Satisfaction Programs
Measuring and Managing Customer Satisfaction ProgramsMARY MALASZEK
 
Us retail online store customer ratings and reviews full version
Us retail online store customer ratings and reviews full versionUs retail online store customer ratings and reviews full version
Us retail online store customer ratings and reviews full versionDana Harrold
 
Yelp's Review Filtering Algorithm Poster
Yelp's Review Filtering Algorithm PosterYelp's Review Filtering Algorithm Poster
Yelp's Review Filtering Algorithm PosterYao Yao
 
A/B Testing Best Practices - Do's and Don'ts
A/B Testing Best Practices - Do's and Don'tsA/B Testing Best Practices - Do's and Don'ts
A/B Testing Best Practices - Do's and Don'tsRamkumar Ravichandran
 
Best Practices in Recommender System Challenges
Best Practices in Recommender System ChallengesBest Practices in Recommender System Challenges
Best Practices in Recommender System ChallengesAlan Said
 
Support Center Back Office Model
Support Center Back Office ModelSupport Center Back Office Model
Support Center Back Office Modelbarnesrf
 
How to Benchmark Your Online Customer Experience Against Competition
How to Benchmark Your Online Customer Experience Against CompetitionHow to Benchmark Your Online Customer Experience Against Competition
How to Benchmark Your Online Customer Experience Against CompetitionUserZoom
 
Positivity Bias in Customer Satisfaction Ratings
Positivity Bias in Customer Satisfaction RatingsPositivity Bias in Customer Satisfaction Ratings
Positivity Bias in Customer Satisfaction RatingsKunwoo Park
 
Introduction to Core Assessments
Introduction to Core AssessmentsIntroduction to Core Assessments
Introduction to Core AssessmentsResolver Inc.
 
Empowering Businesses using Yelp Reviews Mining
Empowering Businesses using Yelp Reviews MiningEmpowering Businesses using Yelp Reviews Mining
Empowering Businesses using Yelp Reviews MiningVipul Munot
 

Similar to Starbucks Customer Ratings Analysis in Manhattan (20)

PAS: The Planning Quality Framework
PAS: The Planning Quality FrameworkPAS: The Planning Quality Framework
PAS: The Planning Quality Framework
 
Social Media Monitoring/Reputation Management Consulting Project for Restaura...
Social Media Monitoring/Reputation Management Consulting Project for Restaura...Social Media Monitoring/Reputation Management Consulting Project for Restaura...
Social Media Monitoring/Reputation Management Consulting Project for Restaura...
 
Localyser - An Introduction
Localyser - An IntroductionLocalyser - An Introduction
Localyser - An Introduction
 
JA, PA, Selection 2016
JA, PA, Selection 2016JA, PA, Selection 2016
JA, PA, Selection 2016
 
Measuring the Quality of Online Service - Jinyoung kim
Measuring the Quality of Online Service - Jinyoung kimMeasuring the Quality of Online Service - Jinyoung kim
Measuring the Quality of Online Service - Jinyoung kim
 
Measuring and Managing Customer Satisfaction Programs
Measuring and Managing Customer Satisfaction ProgramsMeasuring and Managing Customer Satisfaction Programs
Measuring and Managing Customer Satisfaction Programs
 
Call center assessment[1]
Call center assessment[1]Call center assessment[1]
Call center assessment[1]
 
Us retail online store customer ratings and reviews full version
Us retail online store customer ratings and reviews full versionUs retail online store customer ratings and reviews full version
Us retail online store customer ratings and reviews full version
 
Yelp's Review Filtering Algorithm Poster
Yelp's Review Filtering Algorithm PosterYelp's Review Filtering Algorithm Poster
Yelp's Review Filtering Algorithm Poster
 
Product market fit gap
Product market fit gapProduct market fit gap
Product market fit gap
 
A/B Testing Best Practices - Do's and Don'ts
A/B Testing Best Practices - Do's and Don'tsA/B Testing Best Practices - Do's and Don'ts
A/B Testing Best Practices - Do's and Don'ts
 
Best Practices in Recommender System Challenges
Best Practices in Recommender System ChallengesBest Practices in Recommender System Challenges
Best Practices in Recommender System Challenges
 
PQF Overview
PQF OverviewPQF Overview
PQF Overview
 
How Google works
How Google worksHow Google works
How Google works
 
Support Center Back Office Model
Support Center Back Office ModelSupport Center Back Office Model
Support Center Back Office Model
 
How to Benchmark Your Online Customer Experience Against Competition
How to Benchmark Your Online Customer Experience Against CompetitionHow to Benchmark Your Online Customer Experience Against Competition
How to Benchmark Your Online Customer Experience Against Competition
 
Positivity Bias in Customer Satisfaction Ratings
Positivity Bias in Customer Satisfaction RatingsPositivity Bias in Customer Satisfaction Ratings
Positivity Bias in Customer Satisfaction Ratings
 
Introduction to Core Assessments
Introduction to Core AssessmentsIntroduction to Core Assessments
Introduction to Core Assessments
 
Empowering Businesses using Yelp Reviews Mining
Empowering Businesses using Yelp Reviews MiningEmpowering Businesses using Yelp Reviews Mining
Empowering Businesses using Yelp Reviews Mining
 
Benchmarking
BenchmarkingBenchmarking
Benchmarking
 

Recently uploaded

Genomic DNA And Complementary DNA Libraries construction.
Genomic DNA And Complementary DNA Libraries construction.Genomic DNA And Complementary DNA Libraries construction.
Genomic DNA And Complementary DNA Libraries construction.k64182334
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhousejana861314
 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physicsvishikhakeshava1
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxAArockiyaNisha
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trssuser06f238
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxSwapnil Therkar
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxyaramohamed343013
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tantaDashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tantaPraksha3
 
Recombination DNA Technology (Microinjection)
Recombination DNA Technology (Microinjection)Recombination DNA Technology (Microinjection)
Recombination DNA Technology (Microinjection)Jshifa
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfnehabiju2046
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfSELF-EXPLANATORY
 

Recently uploaded (20)

Genomic DNA And Complementary DNA Libraries construction.
Genomic DNA And Complementary DNA Libraries construction.Genomic DNA And Complementary DNA Libraries construction.
Genomic DNA And Complementary DNA Libraries construction.
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhouse
 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physics
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 tr
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docx
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tantaDashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
 
Recombination DNA Technology (Microinjection)
Recombination DNA Technology (Microinjection)Recombination DNA Technology (Microinjection)
Recombination DNA Technology (Microinjection)
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdf
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
 

Starbucks Customer Ratings Analysis in Manhattan

  • 1. Customer Opinions Analysis for Starbucks in Yelp Web Analytics, Fall 2014 Professor Yilu Zhou ISGB 7978 Team: Yixi Zhang, Xiaoshan Jin, Yi Chun Chien, Yi Ting Kao
  • 2. Agenda 1. Problem Statement 2. Project Design 3. Stage 1 - Analytic Pre-define 4. Stage 2 - Unstructured Data Analysis • Correlation Analysis (Overall rating) • How Rating Differs from Location (Overall rating) • Feature Selection (Low rating) • Python Feature Counts Algorithm (Low rating) • Definition of Top Bad Performance Areas (Low rating) • Analytics – Manhattan Visualization (Low rating) 5. Analytics Summary & Recommendation 2
  • 3. 3 Problem Statement There are 212 Starbucks stores in Manhattan. The average rating on Yelp is 2.8 stars. Some comments have Low rating with 1~2 star. Project Goal: Find out the factors causing Starbucks stores’ bad performance to ensure the highest level of customer satisfaction.
  • 5. 5 Stage 1 - Analytic Pre-define • Platform & Tool Selection: Python, Content Analyzer and JMP • Data collection: • Use Python to craw 176 Starbucks stores in Yelp • Variables: Store location, user location, user comment, user rating • Reviews Distribution: - Total review number: 3052 - Average Rating: 2.8 - 74% customers from NY; 26% customers from other places • Pre-define Complaints Categories • Product, Service, Waiting-time & Environment User Rating User Location Review#
  • 6. 6 Stage 2 - Correlation Analysis (Overall rating) • Target variable: User Rating Group :High (4,5stars) vs. Low (1,2stars) • Independent Variable: Store Area, User Location, Comment Length • Use Goodness-of-Fit Test to see correlation between target and independent variables - Comment Length & Store location correlated to User Rating Group Significant Significant
  • 7. 7 Stage 2: How Rating Differs from Location (Overall Rating) Review # Rating Why Midtown East is better than Midtown West When both area have similar numbers of review and user location? • Top 3 Bad Areas: • Lower East Side • Greenwich Village and SOHO • Chelsea and Clinton • Top 3 Good Areas: • Central Park and Murray Hill • Lower Manhattan • Inwood and Washington Heights Low Rating > 62% High Rating > 52%
  • 8. 8 Stage 2 - Feature Selection (Low rating) Assumptions: 1) All comments from Low rating only talk about negative opinions about Starbucks; 2) An index for each feature is set as Features counts numbers/Bad Comments numbers to every zip code in order to compare features based on zip code level. Content Analyzer output cleansing: Stop Words and Word Stemming. Finalized Feature list: Product – (coffee, drink, drinks, cup, latte, tea, iced, milk, food, wrong) Waiting time – (time, line, minutes, long, wait, slow, waiting, busy) Environment – (bathroom, small, clean, seating) Service – (people, service, staff, barista, baristas, rude, cashier, manager, friendly, attitude)
  • 9. 9 Stage 2 - Python Feature Counts Algorithm:(Low rating) Calculation rule: Any feature occurrence in the feature lists labels as “1”. Otherwise, “0”. • Assess every user review by Product, Service, Waiting time, and Environment features; • Group all of the feature counts based on store location(Zip Code) .
  • 10. 10 Stage 2 – Definition of Top Bad Performance Areas (Low rating) Definition Rules(%) Environment Complaint Product Complaint Service Complaint Waiting time Complaint Index Range 10.71- 60 Index Range 46.43- 100 Index Range 43.75- 100 Index Range 44.44- 100 Index Median 35.36 Index Median 73.21 Index Median 71.88 Index Median 72.22 Top Bad Performance Index Point 35 Top Bad Performance Index Point 85 Top Bad Performance Index Point 85 Top Bad Performance Index Point 65
  • 11. 11 Analytics Summary Manhattan Top Bad Performance Areas Environment Complaint Product Complaint Service Complaint Waiting Complaint Upper West Side Lower East Side Lower Manhattan Central Park and Murray Hill Chelsea and Clinton Central Park and Murray Hill Upper East Side Chelsea and Clinton Greenwich Village and Soho Upper East Side Inwood and Washington Heights Lower Manhattan N/A Inwood and Washington Heights N/A Inwood and Washington Heights N/A Central Harlem N/A
  • 12. 12 Analytics – Manhattan Visualization (Low rating) Environment Complaint Product Complaint
  • 13. 13 Analytics – Manhattan Visualization (Low rating) Waiting Time Complaint Service Complaint Midtown East has lower “Service Complaints” rate than Midtown West
  • 14. 14 Recommendations To Manager of Manhattan area: 1. The common concerns for customers in all Manhattan area are long waiting time and bad service. • Hire more cashiers and baristas based on each store’s situation (financially efficient) • Train current employees to provide more professional, flexible and efficient services in a high quality. • Establish an awards and penalty system for employees. (Attitude, Efficiency) 2. Give priority to areas with high number of reviews but relative Low rating. E.g. downtown, west midtown
  • 15. 15 Recommendations 3. Each zip code area should try to improve the top three concerns of the customers no matter what the overall rating it get. E.g. Inwood and Washington Heights
  • 18. 18 Appendix 2 Content Analyzer Output One
  • 19. 19 Appendix 3 Content Analyzer Output Two