SlideShare a Scribd company logo
1 of 13
Comparative analysis
on the quality and
popularity of the
Indian restaurants in
the US.
--Manzil Mudbari
Introduction
+ Since I couldn’t share projects I did for my previous employers, I have done
a mini-Market Research to showcase my skillset.
Source code and Main project:
https://colab.research.google.com/drive/1tJyxqIDbrrLm9LjUfcBcgFT9RLuk-Vum
Motivation
+ In this project, I have used Python to extract information and prepared a report to
evaluate the popularity and quality of Indian restaurants in three different regions of
America.
+ I used Yelp API and Python to fetch pertinent information and used Python's
statistical programming packages for analysis.
Data Fetching
+ I have filtered API responses for the keyword "Indian Restaurant" and
location (Chicago, New York, LA) in three different tables.
+ I used Python packages such as Pandas, JSON, Matplotlib, etc.
+ I got the following table out of the process, which you can also see in detail
in the source code provided at the bottom of the second slide.
Analysis Overview
+ I started by analyzing the distribution of ratings and review counts in
each location.
+ I used customer ratings as a signal to evaluate and compare quality
in these three regions.
+ I have used both ratings and the number of ratings & reviews as a
signal of popularity.
+ Using the information from the table on the slide above, I have plotted
histograms for the number of reviews and the ratings for each city.
+ I have also generated a summary stats table at the end of it.
Data Analysis: New York City
+ Histogram on the left side shows the distribution
of the number of reviews for restaurants.
+ We can observe that there are very few
restaurants that have more than 1000 reviews. A
lot of the restaurants have reviews between 300-
500. A moderate number of restaurants have
reviews between 500-100.
+ But how do we know these reviews are positive or
negative? We cannot exactly know for sure until
we do some form of sentiment analysis.
+ So, we look at the right bar graph showing ratings,
we see most of the restaurants have ratings of
either 4 or 4.5. Out of 50 restaurants, around 5
have a 3.5 rating, and every other restaurant are
ranked higher.
+ As discussed earlier, the high ratings do back the
conjecture that most reviews are positive and
hence, based on the histogram on the left, we can
say that the Indian restaurants are popular in New
York City. While, from the right graph, we can say
that the customers highly approve of the overall
quality of the restaurants in the city.
SUMMARY STATS
+ The summary statistics just validates
our intuition from the graphs above.
+ The mean rating across restaurants
in New York City is 4.17 and the
mean review count is around ~472.
+ This indicates there is high number
of people rating Indian restaurants
more than 4 (on a scale of 5.)
Data Analysis: Los Angeles
We see similar a trend in Los Angeles as we did in New York. Both the average review count (~595) and
the ratings (4.18) are very high. With the same line of argument that we discussed for New York City, the
quality and popularity of Indian restaurants in Los Angeles are also impressive.
Data Analysis: Chicago
• We can see that the mean review count is approximately 295 which is much lower than that of New
York City and LA. In terms of ratings, the mean rating of Chicago Indian restaurants is 4.06, which is a
slight decrease compared to that of LA and New York City.
• We can conclude that Indian restaurants in Chicago aren’t as popular as in the other two cities, and
there is also a slight decrease in quality based on the average rating.
+ The correlation is slightly negative. But the small
magnitude of the correlation and the low variance hardly
gives us any room for interpretation of these statistics.
+ Therefore, we will compare the average ratings and review
counts across three cities by using bar graphs.
Correlation between rating and the
number of review per restaurants: -0.1328
Visual representation of combined data.
• The high average rating across all cities gives us reason to believe that, in general,
customers highly approve of Indian restaurants.
• It is hard to say what is causing this lower number of average reviews in Chicago
despite having high ratings.
• But this also gives us further scope of exploration and research.
+ We can further evaluate the difference between the low
number of reviews in NY and Chicago despite having
almost similar average ratings as LA.
+ Perhaps the restaurant business is marketed more in one
region than the others, in that case we can explore data
relevant to marketing in each cities.
+ Maybe, it has to do with the relative distribution of Indian
diaspora and the income distribution in different cities,
where we can explore income and demographic data.
+ We can also always investigate more cities to see if any of
these cities is an outliers for their respective regions.
Further scope of exploration.
Conclusion:
• We don’t have sufficient data to comment anything
on the discrepancy in the number of reviews in
different cities compared to their relatively same
ratings. But it also gives us further scope to explore.
• However, the combined the average review count
and average rating is approximately 445 and 4.13
respectively, which are both good numbers in the
context of restaurant business.
• These two statistics show that Indian restaurants
are very popular across three cities.
• Although generalizing this result across all cities
would be a big jump, nevertheless, this analysis
does give us an idea about the general vibe
surrounding the Indian restaurant business in the
US.
Thank You!
+ Manzil Mudbari
+ manzil@mudbari.com
+ San Jose, California

More Related Content

Similar to Indian Restaurant Quality and Popularity in US Cities

Intro to Bubble Charts by BECKON
Intro to Bubble Charts by BECKONIntro to Bubble Charts by BECKON
Intro to Bubble Charts by BECKONAmanda Roberts
 
Text Data Mining and Predictive Modeling of Online Reviews
Text Data Mining and Predictive Modeling of Online ReviewsText Data Mining and Predictive Modeling of Online Reviews
Text Data Mining and Predictive Modeling of Online ReviewsMark Chesney
 
10-Point-Guide-Digital-eCommerce_LucidFusion
10-Point-Guide-Digital-eCommerce_LucidFusion10-Point-Guide-Digital-eCommerce_LucidFusion
10-Point-Guide-Digital-eCommerce_LucidFusionMark Stimpfig
 
Restaurants of Seoul - "likes" prediction report
Restaurants of Seoul - "likes" prediction reportRestaurants of Seoul - "likes" prediction report
Restaurants of Seoul - "likes" prediction reportAmelia Choi
 
How to rank higher on Yelp
How to rank higher on YelpHow to rank higher on Yelp
How to rank higher on YelpSteven Moody
 
College Essay Writers Block. Online assignment writing service.
College Essay Writers Block. Online assignment writing service.College Essay Writers Block. Online assignment writing service.
College Essay Writers Block. Online assignment writing service.Erica Spivey
 
Sentiment analysis of Restaurant reviews ppt
Sentiment analysis of Restaurant reviews pptSentiment analysis of Restaurant reviews ppt
Sentiment analysis of Restaurant reviews pptbhaskargani46
 
Yelp's Review Filtering Algorithm Paper
Yelp's Review Filtering Algorithm PaperYelp's Review Filtering Algorithm Paper
Yelp's Review Filtering Algorithm PaperYao Yao
 
ACC201 (MyEducator) Course Project - OverviewFor your Course Pro.docx
ACC201 (MyEducator) Course Project - OverviewFor your Course Pro.docxACC201 (MyEducator) Course Project - OverviewFor your Course Pro.docx
ACC201 (MyEducator) Course Project - OverviewFor your Course Pro.docxbartholomeocoombs
 
APA Style Main Body And In Text Citations
APA Style Main Body And In Text CitationsAPA Style Main Body And In Text Citations
APA Style Main Body And In Text CitationsKerry Lewis
 
Starbucks Data Analysis
Starbucks Data AnalysisStarbucks Data Analysis
Starbucks Data Analysiscrmowbray
 
Consumer Market Research with ReferenceUSA
Consumer Market Research with ReferenceUSAConsumer Market Research with ReferenceUSA
Consumer Market Research with ReferenceUSALeah Jackson, PMP
 
Andrea Alajbegovic CBRF Poster copy
Andrea Alajbegovic CBRF Poster copyAndrea Alajbegovic CBRF Poster copy
Andrea Alajbegovic CBRF Poster copyAndrea Alajbegović
 
SMX Melbourne 2012 - Auditing PPC Campaigns - Leigh Hanney
SMX Melbourne 2012 - Auditing PPC Campaigns - Leigh HanneySMX Melbourne 2012 - Auditing PPC Campaigns - Leigh Hanney
SMX Melbourne 2012 - Auditing PPC Campaigns - Leigh HanneyLeigh Hanney
 
PPT Chapter 1_Stat 1.pptx
PPT Chapter 1_Stat 1.pptxPPT Chapter 1_Stat 1.pptx
PPT Chapter 1_Stat 1.pptxDiyPro
 

Similar to Indian Restaurant Quality and Popularity in US Cities (20)

Intro to Bubble Charts by BECKON
Intro to Bubble Charts by BECKONIntro to Bubble Charts by BECKON
Intro to Bubble Charts by BECKON
 
Text Data Mining and Predictive Modeling of Online Reviews
Text Data Mining and Predictive Modeling of Online ReviewsText Data Mining and Predictive Modeling of Online Reviews
Text Data Mining and Predictive Modeling of Online Reviews
 
10-Point-Guide-Digital-eCommerce_LucidFusion
10-Point-Guide-Digital-eCommerce_LucidFusion10-Point-Guide-Digital-eCommerce_LucidFusion
10-Point-Guide-Digital-eCommerce_LucidFusion
 
Restaurants of Seoul - "likes" prediction report
Restaurants of Seoul - "likes" prediction reportRestaurants of Seoul - "likes" prediction report
Restaurants of Seoul - "likes" prediction report
 
How to rank higher on Yelp
How to rank higher on YelpHow to rank higher on Yelp
How to rank higher on Yelp
 
Market Research Analysis
Market Research AnalysisMarket Research Analysis
Market Research Analysis
 
College Essay Writers Block. Online assignment writing service.
College Essay Writers Block. Online assignment writing service.College Essay Writers Block. Online assignment writing service.
College Essay Writers Block. Online assignment writing service.
 
Sentiment analysis of Restaurant reviews ppt
Sentiment analysis of Restaurant reviews pptSentiment analysis of Restaurant reviews ppt
Sentiment analysis of Restaurant reviews ppt
 
Yelp's Review Filtering Algorithm Paper
Yelp's Review Filtering Algorithm PaperYelp's Review Filtering Algorithm Paper
Yelp's Review Filtering Algorithm Paper
 
ACC201 (MyEducator) Course Project - OverviewFor your Course Pro.docx
ACC201 (MyEducator) Course Project - OverviewFor your Course Pro.docxACC201 (MyEducator) Course Project - OverviewFor your Course Pro.docx
ACC201 (MyEducator) Course Project - OverviewFor your Course Pro.docx
 
Yelp Presentation
Yelp PresentationYelp Presentation
Yelp Presentation
 
APA Style Main Body And In Text Citations
APA Style Main Body And In Text CitationsAPA Style Main Body And In Text Citations
APA Style Main Body And In Text Citations
 
Starbucks Data Analysis
Starbucks Data AnalysisStarbucks Data Analysis
Starbucks Data Analysis
 
Consumer Market Research with ReferenceUSA
Consumer Market Research with ReferenceUSAConsumer Market Research with ReferenceUSA
Consumer Market Research with ReferenceUSA
 
Get your Yelp on!
Get your Yelp on!Get your Yelp on!
Get your Yelp on!
 
Andrea Alajbegovic CBRF Poster copy
Andrea Alajbegovic CBRF Poster copyAndrea Alajbegovic CBRF Poster copy
Andrea Alajbegovic CBRF Poster copy
 
Anth390_Final
Anth390_FinalAnth390_Final
Anth390_Final
 
SMX Melbourne 2012 - Auditing PPC Campaigns - Leigh Hanney
SMX Melbourne 2012 - Auditing PPC Campaigns - Leigh HanneySMX Melbourne 2012 - Auditing PPC Campaigns - Leigh Hanney
SMX Melbourne 2012 - Auditing PPC Campaigns - Leigh Hanney
 
PPT Chapter 1_Stat 1.pptx
PPT Chapter 1_Stat 1.pptxPPT Chapter 1_Stat 1.pptx
PPT Chapter 1_Stat 1.pptx
 
Lemon effect
Lemon effect Lemon effect
Lemon effect
 

Recently uploaded

9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home ServiceSapana Sha
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAbdelrhman abooda
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 

Recently uploaded (20)

9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Decoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in ActionDecoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in Action
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 

Indian Restaurant Quality and Popularity in US Cities

  • 1. Comparative analysis on the quality and popularity of the Indian restaurants in the US. --Manzil Mudbari
  • 2. Introduction + Since I couldn’t share projects I did for my previous employers, I have done a mini-Market Research to showcase my skillset. Source code and Main project: https://colab.research.google.com/drive/1tJyxqIDbrrLm9LjUfcBcgFT9RLuk-Vum Motivation + In this project, I have used Python to extract information and prepared a report to evaluate the popularity and quality of Indian restaurants in three different regions of America. + I used Yelp API and Python to fetch pertinent information and used Python's statistical programming packages for analysis.
  • 3. Data Fetching + I have filtered API responses for the keyword "Indian Restaurant" and location (Chicago, New York, LA) in three different tables. + I used Python packages such as Pandas, JSON, Matplotlib, etc. + I got the following table out of the process, which you can also see in detail in the source code provided at the bottom of the second slide.
  • 4. Analysis Overview + I started by analyzing the distribution of ratings and review counts in each location. + I used customer ratings as a signal to evaluate and compare quality in these three regions. + I have used both ratings and the number of ratings & reviews as a signal of popularity. + Using the information from the table on the slide above, I have plotted histograms for the number of reviews and the ratings for each city. + I have also generated a summary stats table at the end of it.
  • 5. Data Analysis: New York City + Histogram on the left side shows the distribution of the number of reviews for restaurants. + We can observe that there are very few restaurants that have more than 1000 reviews. A lot of the restaurants have reviews between 300- 500. A moderate number of restaurants have reviews between 500-100. + But how do we know these reviews are positive or negative? We cannot exactly know for sure until we do some form of sentiment analysis. + So, we look at the right bar graph showing ratings, we see most of the restaurants have ratings of either 4 or 4.5. Out of 50 restaurants, around 5 have a 3.5 rating, and every other restaurant are ranked higher. + As discussed earlier, the high ratings do back the conjecture that most reviews are positive and hence, based on the histogram on the left, we can say that the Indian restaurants are popular in New York City. While, from the right graph, we can say that the customers highly approve of the overall quality of the restaurants in the city.
  • 6. SUMMARY STATS + The summary statistics just validates our intuition from the graphs above. + The mean rating across restaurants in New York City is 4.17 and the mean review count is around ~472. + This indicates there is high number of people rating Indian restaurants more than 4 (on a scale of 5.)
  • 7. Data Analysis: Los Angeles We see similar a trend in Los Angeles as we did in New York. Both the average review count (~595) and the ratings (4.18) are very high. With the same line of argument that we discussed for New York City, the quality and popularity of Indian restaurants in Los Angeles are also impressive.
  • 8. Data Analysis: Chicago • We can see that the mean review count is approximately 295 which is much lower than that of New York City and LA. In terms of ratings, the mean rating of Chicago Indian restaurants is 4.06, which is a slight decrease compared to that of LA and New York City. • We can conclude that Indian restaurants in Chicago aren’t as popular as in the other two cities, and there is also a slight decrease in quality based on the average rating.
  • 9. + The correlation is slightly negative. But the small magnitude of the correlation and the low variance hardly gives us any room for interpretation of these statistics. + Therefore, we will compare the average ratings and review counts across three cities by using bar graphs. Correlation between rating and the number of review per restaurants: -0.1328
  • 10. Visual representation of combined data. • The high average rating across all cities gives us reason to believe that, in general, customers highly approve of Indian restaurants. • It is hard to say what is causing this lower number of average reviews in Chicago despite having high ratings. • But this also gives us further scope of exploration and research.
  • 11. + We can further evaluate the difference between the low number of reviews in NY and Chicago despite having almost similar average ratings as LA. + Perhaps the restaurant business is marketed more in one region than the others, in that case we can explore data relevant to marketing in each cities. + Maybe, it has to do with the relative distribution of Indian diaspora and the income distribution in different cities, where we can explore income and demographic data. + We can also always investigate more cities to see if any of these cities is an outliers for their respective regions. Further scope of exploration.
  • 12. Conclusion: • We don’t have sufficient data to comment anything on the discrepancy in the number of reviews in different cities compared to their relatively same ratings. But it also gives us further scope to explore. • However, the combined the average review count and average rating is approximately 445 and 4.13 respectively, which are both good numbers in the context of restaurant business. • These two statistics show that Indian restaurants are very popular across three cities. • Although generalizing this result across all cities would be a big jump, nevertheless, this analysis does give us an idea about the general vibe surrounding the Indian restaurant business in the US.
  • 13. Thank You! + Manzil Mudbari + manzil@mudbari.com + San Jose, California