SlideShare a Scribd company logo
Comparative analysis
on the quality and
popularity of the
Indian restaurants in
the US.
--Manzil Mudbari
Introduction
+ Since I couldn’t share projects I did for my previous employers, I have done
a mini-Market Research to showcase my skillset.
Source code and Main project:
https://colab.research.google.com/drive/1tJyxqIDbrrLm9LjUfcBcgFT9RLuk-Vum
Motivation
+ In this project, I have used Python to extract information and prepared a report to
evaluate the popularity and quality of Indian restaurants in three different regions of
America.
+ I used Yelp API and Python to fetch pertinent information and used Python's
statistical programming packages for analysis.
Data Fetching
+ I have filtered API responses for the keyword "Indian Restaurant" and
location (Chicago, New York, LA) in three different tables.
+ I used Python packages such as Pandas, JSON, Matplotlib, etc.
+ I got the following table out of the process, which you can also see in detail
in the source code provided at the bottom of the second slide.
Analysis Overview
+ I started by analyzing the distribution of ratings and review counts in
each location.
+ I used customer ratings as a signal to evaluate and compare quality
in these three regions.
+ I have used both ratings and the number of ratings & reviews as a
signal of popularity.
+ Using the information from the table on the slide above, I have plotted
histograms for the number of reviews and the ratings for each city.
+ I have also generated a summary stats table at the end of it.
Data Analysis: New York City
+ Histogram on the left side shows the distribution
of the number of reviews for restaurants.
+ We can observe that there are very few
restaurants that have more than 1000 reviews. A
lot of the restaurants have reviews between 300-
500. A moderate number of restaurants have
reviews between 500-100.
+ But how do we know these reviews are positive or
negative? We cannot exactly know for sure until
we do some form of sentiment analysis.
+ So, we look at the right bar graph showing ratings,
we see most of the restaurants have ratings of
either 4 or 4.5. Out of 50 restaurants, around 5
have a 3.5 rating, and every other restaurant are
ranked higher.
+ As discussed earlier, the high ratings do back the
conjecture that most reviews are positive and
hence, based on the histogram on the left, we can
say that the Indian restaurants are popular in New
York City. While, from the right graph, we can say
that the customers highly approve of the overall
quality of the restaurants in the city.
SUMMARY STATS
+ The summary statistics just validates
our intuition from the graphs above.
+ The mean rating across restaurants
in New York City is 4.17 and the
mean review count is around ~472.
+ This indicates there is high number
of people rating Indian restaurants
more than 4 (on a scale of 5.)
Data Analysis: Los Angeles
We see similar a trend in Los Angeles as we did in New York. Both the average review count (~595) and
the ratings (4.18) are very high. With the same line of argument that we discussed for New York City, the
quality and popularity of Indian restaurants in Los Angeles are also impressive.
Data Analysis: Chicago
• We can see that the mean review count is approximately 295 which is much lower than that of New
York City and LA. In terms of ratings, the mean rating of Chicago Indian restaurants is 4.06, which is a
slight decrease compared to that of LA and New York City.
• We can conclude that Indian restaurants in Chicago aren’t as popular as in the other two cities, and
there is also a slight decrease in quality based on the average rating.
+ The correlation is slightly negative. But the small
magnitude of the correlation and the low variance hardly
gives us any room for interpretation of these statistics.
+ Therefore, we will compare the average ratings and review
counts across three cities by using bar graphs.
Correlation between rating and the
number of review per restaurants: -0.1328
Visual representation of combined data.
• The high average rating across all cities gives us reason to believe that, in general,
customers highly approve of Indian restaurants.
• It is hard to say what is causing this lower number of average reviews in Chicago
despite having high ratings.
• But this also gives us further scope of exploration and research.
+ We can further evaluate the difference between the low
number of reviews in NY and Chicago despite having
almost similar average ratings as LA.
+ Perhaps the restaurant business is marketed more in one
region than the others, in that case we can explore data
relevant to marketing in each cities.
+ Maybe, it has to do with the relative distribution of Indian
diaspora and the income distribution in different cities,
where we can explore income and demographic data.
+ We can also always investigate more cities to see if any of
these cities is an outliers for their respective regions.
Further scope of exploration.
Conclusion:
• We don’t have sufficient data to comment anything
on the discrepancy in the number of reviews in
different cities compared to their relatively same
ratings. But it also gives us further scope to explore.
• However, the combined the average review count
and average rating is approximately 445 and 4.13
respectively, which are both good numbers in the
context of restaurant business.
• These two statistics show that Indian restaurants
are very popular across three cities.
• Although generalizing this result across all cities
would be a big jump, nevertheless, this analysis
does give us an idea about the general vibe
surrounding the Indian restaurant business in the
US.
Thank You!
+ Manzil Mudbari
+ manzil@mudbari.com
+ San Jose, California

More Related Content

Similar to Comparative analysis of the quality and popularity of the Indian restaurants

Prediciting restaurant and popularity based on Yelp Dataset - 2
Prediciting restaurant and popularity based on Yelp Dataset - 2Prediciting restaurant and popularity based on Yelp Dataset - 2
Prediciting restaurant and popularity based on Yelp Dataset - 2
ALIN BABU
 
Intro to Bubble Charts by BECKON
Intro to Bubble Charts by BECKONIntro to Bubble Charts by BECKON
Intro to Bubble Charts by BECKON
Amanda Roberts
 
Restaurants new york toronto
Restaurants new york torontoRestaurants new york toronto
Restaurants new york toronto
marinaHunt1
 
Buttle of Nebourhoods
Buttle of NebourhoodsButtle of Nebourhoods
Buttle of Nebourhoods
marinaHunt1
 
Text Data Mining and Predictive Modeling of Online Reviews
Text Data Mining and Predictive Modeling of Online ReviewsText Data Mining and Predictive Modeling of Online Reviews
Text Data Mining and Predictive Modeling of Online Reviews
Mark Chesney
 
10-Point-Guide-Digital-eCommerce_LucidFusion
10-Point-Guide-Digital-eCommerce_LucidFusion10-Point-Guide-Digital-eCommerce_LucidFusion
10-Point-Guide-Digital-eCommerce_LucidFusion
Mark Stimpfig
 
How to rank higher on Yelp
How to rank higher on YelpHow to rank higher on Yelp
How to rank higher on Yelp
Steven Moody
 
Market Research Analysis
Market Research AnalysisMarket Research Analysis
Market Research Analysis
Leah Jackson, PMP
 
College Essay Writers Block. Online assignment writing service.
College Essay Writers Block. Online assignment writing service.College Essay Writers Block. Online assignment writing service.
College Essay Writers Block. Online assignment writing service.
Erica Spivey
 
Sentiment analysis of Restaurant reviews ppt
Sentiment analysis of Restaurant reviews pptSentiment analysis of Restaurant reviews ppt
Sentiment analysis of Restaurant reviews ppt
bhaskargani46
 
Yelp's Review Filtering Algorithm Paper
Yelp's Review Filtering Algorithm PaperYelp's Review Filtering Algorithm Paper
Yelp's Review Filtering Algorithm Paper
Yao Yao
 
ACC201 (MyEducator) Course Project - OverviewFor your Course Pro.docx
ACC201 (MyEducator) Course Project - OverviewFor your Course Pro.docxACC201 (MyEducator) Course Project - OverviewFor your Course Pro.docx
ACC201 (MyEducator) Course Project - OverviewFor your Course Pro.docx
bartholomeocoombs
 
Yelp Presentation
Yelp PresentationYelp Presentation
Yelp Presentation
Jayavardhan Reddy Peddamail
 
APA Style Main Body And In Text Citations
APA Style Main Body And In Text CitationsAPA Style Main Body And In Text Citations
APA Style Main Body And In Text Citations
Kerry Lewis
 
Starbucks Data Analysis
Starbucks Data AnalysisStarbucks Data Analysis
Starbucks Data Analysis
crmowbray
 
Consumer Market Research with ReferenceUSA
Consumer Market Research with ReferenceUSAConsumer Market Research with ReferenceUSA
Consumer Market Research with ReferenceUSA
Leah Jackson, PMP
 
Get your Yelp on!
Get your Yelp on!Get your Yelp on!
Get your Yelp on!
Mike Russell
 
Andrea Alajbegovic CBRF Poster copy
Andrea Alajbegovic CBRF Poster copyAndrea Alajbegovic CBRF Poster copy
Andrea Alajbegovic CBRF Poster copy
Andrea Alajbegović
 
Anth390_Final
Anth390_FinalAnth390_Final
Anth390_Final
Aubrie Powell
 
SMX Melbourne 2012 - Auditing PPC Campaigns - Leigh Hanney
SMX Melbourne 2012 - Auditing PPC Campaigns - Leigh HanneySMX Melbourne 2012 - Auditing PPC Campaigns - Leigh Hanney
SMX Melbourne 2012 - Auditing PPC Campaigns - Leigh Hanney
Leigh Hanney
 

Similar to Comparative analysis of the quality and popularity of the Indian restaurants (20)

Prediciting restaurant and popularity based on Yelp Dataset - 2
Prediciting restaurant and popularity based on Yelp Dataset - 2Prediciting restaurant and popularity based on Yelp Dataset - 2
Prediciting restaurant and popularity based on Yelp Dataset - 2
 
Intro to Bubble Charts by BECKON
Intro to Bubble Charts by BECKONIntro to Bubble Charts by BECKON
Intro to Bubble Charts by BECKON
 
Restaurants new york toronto
Restaurants new york torontoRestaurants new york toronto
Restaurants new york toronto
 
Buttle of Nebourhoods
Buttle of NebourhoodsButtle of Nebourhoods
Buttle of Nebourhoods
 
Text Data Mining and Predictive Modeling of Online Reviews
Text Data Mining and Predictive Modeling of Online ReviewsText Data Mining and Predictive Modeling of Online Reviews
Text Data Mining and Predictive Modeling of Online Reviews
 
10-Point-Guide-Digital-eCommerce_LucidFusion
10-Point-Guide-Digital-eCommerce_LucidFusion10-Point-Guide-Digital-eCommerce_LucidFusion
10-Point-Guide-Digital-eCommerce_LucidFusion
 
How to rank higher on Yelp
How to rank higher on YelpHow to rank higher on Yelp
How to rank higher on Yelp
 
Market Research Analysis
Market Research AnalysisMarket Research Analysis
Market Research Analysis
 
College Essay Writers Block. Online assignment writing service.
College Essay Writers Block. Online assignment writing service.College Essay Writers Block. Online assignment writing service.
College Essay Writers Block. Online assignment writing service.
 
Sentiment analysis of Restaurant reviews ppt
Sentiment analysis of Restaurant reviews pptSentiment analysis of Restaurant reviews ppt
Sentiment analysis of Restaurant reviews ppt
 
Yelp's Review Filtering Algorithm Paper
Yelp's Review Filtering Algorithm PaperYelp's Review Filtering Algorithm Paper
Yelp's Review Filtering Algorithm Paper
 
ACC201 (MyEducator) Course Project - OverviewFor your Course Pro.docx
ACC201 (MyEducator) Course Project - OverviewFor your Course Pro.docxACC201 (MyEducator) Course Project - OverviewFor your Course Pro.docx
ACC201 (MyEducator) Course Project - OverviewFor your Course Pro.docx
 
Yelp Presentation
Yelp PresentationYelp Presentation
Yelp Presentation
 
APA Style Main Body And In Text Citations
APA Style Main Body And In Text CitationsAPA Style Main Body And In Text Citations
APA Style Main Body And In Text Citations
 
Starbucks Data Analysis
Starbucks Data AnalysisStarbucks Data Analysis
Starbucks Data Analysis
 
Consumer Market Research with ReferenceUSA
Consumer Market Research with ReferenceUSAConsumer Market Research with ReferenceUSA
Consumer Market Research with ReferenceUSA
 
Get your Yelp on!
Get your Yelp on!Get your Yelp on!
Get your Yelp on!
 
Andrea Alajbegovic CBRF Poster copy
Andrea Alajbegovic CBRF Poster copyAndrea Alajbegovic CBRF Poster copy
Andrea Alajbegovic CBRF Poster copy
 
Anth390_Final
Anth390_FinalAnth390_Final
Anth390_Final
 
SMX Melbourne 2012 - Auditing PPC Campaigns - Leigh Hanney
SMX Melbourne 2012 - Auditing PPC Campaigns - Leigh HanneySMX Melbourne 2012 - Auditing PPC Campaigns - Leigh Hanney
SMX Melbourne 2012 - Auditing PPC Campaigns - Leigh Hanney
 

Recently uploaded

Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
vikram sood
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
soxrziqu
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
Sm321
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
Timothy Spann
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Kiwi Creative
 
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
74nqk8xf
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
jitskeb
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Aggregage
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
Walaa Eldin Moustafa
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
Sachin Paul
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
aqzctr7x
 
A presentation that explain the Power BI Licensing
A presentation that explain the Power BI LicensingA presentation that explain the Power BI Licensing
A presentation that explain the Power BI Licensing
AlessioFois2
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
v7oacc3l
 
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
g4dpvqap0
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
Social Samosa
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
74nqk8xf
 
Intelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicineIntelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicine
AndrzejJarynowski
 

Recently uploaded (20)

Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
 
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
 
A presentation that explain the Power BI Licensing
A presentation that explain the Power BI LicensingA presentation that explain the Power BI Licensing
A presentation that explain the Power BI Licensing
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
 
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
 
Intelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicineIntelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicine
 

Comparative analysis of the quality and popularity of the Indian restaurants

  • 1. Comparative analysis on the quality and popularity of the Indian restaurants in the US. --Manzil Mudbari
  • 2. Introduction + Since I couldn’t share projects I did for my previous employers, I have done a mini-Market Research to showcase my skillset. Source code and Main project: https://colab.research.google.com/drive/1tJyxqIDbrrLm9LjUfcBcgFT9RLuk-Vum Motivation + In this project, I have used Python to extract information and prepared a report to evaluate the popularity and quality of Indian restaurants in three different regions of America. + I used Yelp API and Python to fetch pertinent information and used Python's statistical programming packages for analysis.
  • 3. Data Fetching + I have filtered API responses for the keyword "Indian Restaurant" and location (Chicago, New York, LA) in three different tables. + I used Python packages such as Pandas, JSON, Matplotlib, etc. + I got the following table out of the process, which you can also see in detail in the source code provided at the bottom of the second slide.
  • 4. Analysis Overview + I started by analyzing the distribution of ratings and review counts in each location. + I used customer ratings as a signal to evaluate and compare quality in these three regions. + I have used both ratings and the number of ratings & reviews as a signal of popularity. + Using the information from the table on the slide above, I have plotted histograms for the number of reviews and the ratings for each city. + I have also generated a summary stats table at the end of it.
  • 5. Data Analysis: New York City + Histogram on the left side shows the distribution of the number of reviews for restaurants. + We can observe that there are very few restaurants that have more than 1000 reviews. A lot of the restaurants have reviews between 300- 500. A moderate number of restaurants have reviews between 500-100. + But how do we know these reviews are positive or negative? We cannot exactly know for sure until we do some form of sentiment analysis. + So, we look at the right bar graph showing ratings, we see most of the restaurants have ratings of either 4 or 4.5. Out of 50 restaurants, around 5 have a 3.5 rating, and every other restaurant are ranked higher. + As discussed earlier, the high ratings do back the conjecture that most reviews are positive and hence, based on the histogram on the left, we can say that the Indian restaurants are popular in New York City. While, from the right graph, we can say that the customers highly approve of the overall quality of the restaurants in the city.
  • 6. SUMMARY STATS + The summary statistics just validates our intuition from the graphs above. + The mean rating across restaurants in New York City is 4.17 and the mean review count is around ~472. + This indicates there is high number of people rating Indian restaurants more than 4 (on a scale of 5.)
  • 7. Data Analysis: Los Angeles We see similar a trend in Los Angeles as we did in New York. Both the average review count (~595) and the ratings (4.18) are very high. With the same line of argument that we discussed for New York City, the quality and popularity of Indian restaurants in Los Angeles are also impressive.
  • 8. Data Analysis: Chicago • We can see that the mean review count is approximately 295 which is much lower than that of New York City and LA. In terms of ratings, the mean rating of Chicago Indian restaurants is 4.06, which is a slight decrease compared to that of LA and New York City. • We can conclude that Indian restaurants in Chicago aren’t as popular as in the other two cities, and there is also a slight decrease in quality based on the average rating.
  • 9. + The correlation is slightly negative. But the small magnitude of the correlation and the low variance hardly gives us any room for interpretation of these statistics. + Therefore, we will compare the average ratings and review counts across three cities by using bar graphs. Correlation between rating and the number of review per restaurants: -0.1328
  • 10. Visual representation of combined data. • The high average rating across all cities gives us reason to believe that, in general, customers highly approve of Indian restaurants. • It is hard to say what is causing this lower number of average reviews in Chicago despite having high ratings. • But this also gives us further scope of exploration and research.
  • 11. + We can further evaluate the difference between the low number of reviews in NY and Chicago despite having almost similar average ratings as LA. + Perhaps the restaurant business is marketed more in one region than the others, in that case we can explore data relevant to marketing in each cities. + Maybe, it has to do with the relative distribution of Indian diaspora and the income distribution in different cities, where we can explore income and demographic data. + We can also always investigate more cities to see if any of these cities is an outliers for their respective regions. Further scope of exploration.
  • 12. Conclusion: • We don’t have sufficient data to comment anything on the discrepancy in the number of reviews in different cities compared to their relatively same ratings. But it also gives us further scope to explore. • However, the combined the average review count and average rating is approximately 445 and 4.13 respectively, which are both good numbers in the context of restaurant business. • These two statistics show that Indian restaurants are very popular across three cities. • Although generalizing this result across all cities would be a big jump, nevertheless, this analysis does give us an idea about the general vibe surrounding the Indian restaurant business in the US.
  • 13. Thank You! + Manzil Mudbari + manzil@mudbari.com + San Jose, California