New York City Restaurant Inspection Analysis

Should You Eat There?
An Analysis of NYC Restaurant Inspection Data
BusinessIntelligence
&DataAnalytics

Samantha Grant
Jingshu Sun
Akash Dhruv
Candice Brown
Leeyat Slyper
Meet Our Team
Group 2

Agenda
The Data
Data Exploration
Unsupervised Learning
Supervised Learning
Recommendations
1
2
3
4
5

Business
Objectives
IDENTIFYING
VIOLATION
TRENDS
1
PREDICTING
VIOLATIONS
2
REDUCING
VIOLATIONS
3
Help NYC restaurant,
and restaurant-goers
by...

So there won’t
be any more of
this...

Data Attributes
● Inspection Date
● Inspection Type
● Violation Code
● Critical Flag
● Grade (A,B,C)
● Scores
● ID
● Restaurant Name
● Cuisine Description
● New York Boro
● Zip Code
RESTAURANT DETAILS VIOLATION DETAILS
477,000 rows

Data Cleaning
1
2
3
4
Removed rows with
inspection dates in the
future.
REMOVED
BAD DATA
Reduced number
of rows
SHRANK
DATA SET
FIXED SPELLING &
INCONSISTENCIES
REPLACEMENT
& FLAG CREATION
Fixed spelling errors.
Replaced ‘Not Yet Graded’
with ‘N’.
Broke ‘Inspection Type’
into 2 columns.
Violation Categories
Inspection Categories
Seasonal Flags
Landmark Flags

● Allergies/Safety
● Animals
● Certification
● Documentation
Replacement
110 Violation Codes → 13 Violation Categories
● Facility Amenities
● Tobacco
● Facility Cleanliness
● Hazardous
Chemicals
● Food Temperature
● Food Contamination
● Tobacco
● Worker Cleanliness
● Other

TOP 3
Violation Categories
#1
Facility
Amenities
#2
Animals
#3
Facility
Cleanliness
? Violation Trends:
What are the most common violation types?

?Violation Density:
Which borough has the most violations?
Staten Island
Queens
Brooklyn
Bronx
Manhattan

1,438,159 population
13,221 persons/sq. km
9.48%
2,321,580 population
8,237 persons/sq. km
24.07%
39%
Violations
Manhattan
3%
Violations
Staten Island
24%
Violations
Brooklyn
9%
Violations
Bronx
24%
Violations
Queens
Restaurant Density vs. Percent Violations

These articles confirm our findings...

Insight:
There are not major
differences in average
restaurant scores
despite differing
borough wealth and
popularity.
Do inspection scores differ
across borough? ?

Recommendation:
Re-opening average
scores are lowest
scores. A separate
process could be in
place for re-openings to
ensure good scores.
Inspection Type:
How Do Scores Differ for Inspection Types ?

Restaurant Grade Distribution:Takeaways:
● Hamburgers,
Cafes and
American
food have the
highest % of
A grades.
● Indian food
has the
largest share
of C grades
Grade A
Grade B
Grade C
Source: What’s the safest food in New York City? - Data Diversions - tumblr.com [NYC Open Data]

Association Rules
Animals
Facility Amenities
Worker Cleanliness
Facility Cleanliness
Food Temperature
Food Contamination
1.06 Lift
1.01 Lift

Violations per
Season
Winter
~2k
Spring
>2K
Summer
<1.5k
Fall
~1.5k
Seasonal Trends:
Which season has the most violations?
Spring has the most violations & American, Chinese and Italian Food had the most violations.
?
Winter Spring Summer Fall

Clustering Results:
Segment Size

Takeaway:
Seasonal Dummy Variable was the most influential across the boardVariable Worth

Cluster Findings:
What are the prevalence of violations by season?
Takeaways:
Cluster 1: Spring
Cluster 2: Summer
Cluster 3: Winter.
highest Manhattan
incidence
Cluster 4: Spring
All Clusters:
American & Chinese
food violations,
Manhattan &
Brooklyn, Score
impactful on all
clusters, especially
1 & 4
Other Findings:
Staten Island is not
impactful on any
cluster

Cluster Findings:
What are the prevalence of violations by grade?
Takeaways:
Cluster 1: C Grade,
Food Temp,
Flies/Food Refuse
Violation, Mice
Cluster 2: A Grade
Cluster 3: A Grade,
highest Manhattan
incidence
Cluster 4: B Grade
All Clusters:
Manhattan
impactful on all
clusters

Focus Point: Chipotle
Answer:
Yes...in STATEN
ISLAND - No
violations were
detected in any
Chipotle outlets there
Top Borough for
violations at Chipotle
outlets:
MANHATTAN

Focus Point: Chipotle
Takeaways:
Most common
violations category:
1. Animals: 04N
2. Food Temperature:
02B, 02G
3. Worker
Cleanliness: 06A, 06B

Do landmark NY
restaurants perform
better?
?

Focus Point: Landmark Restaurants
Landmark
Restaurants:
- Famous
- Oldest
- Movie Scenes
- Favorites

Hypothesis
Confirmed:
Not Critical violations
are more common for
Landmark
restaurants than
others.

Hypothesis
Confirmed:
Landmark
restaurants have
higher percentage of
A’s.

Finding:
Second most
common violation for
landmark restaurants
due to not cleaning
surfaces after each
use
Recommendation:
Hire employee who
cleans while chefs
cook

Hypothesis Not
Supported:
Violations, or lack
thereof are not
indicators of
Landmark
restaurants.

What factors lead to
a judgement of
critical violation?
?

Part One: Decision Tree Model
VIOLATION PREDICTION --- Interpreting the Inspection Result
What kind of restaurants are more likely to be judged critical violation?
Key: Create a CRITICAL_DUMMY according to CRITICAL_FLAG; Assign Role “Target” and Level “Binary”
Not Critical Critical
Critical_Dummy = 0
VS Critical_Dummy = 1

Unsupervised
Learning
SCORE
CRITICAL_
FLAG
Cheating
Splitting
?
Variable Selection

Unsupervised
Learning
Two-Way
&
Three -
Way
?
Running Model:
Data Partition--70% Training Data & 30% Validation Data

Findings (Two-Way):
Grade
1.0000
Inspection
_Type
0.4314
BORO
0.1675
Restaurants who get a
score under B are 68.17%
likely to be judged critical
violation, compared to
48% likely to be critical
violation with Grade A.
Restaurants with an initial
low grade are more likely
to be judged a critical
violation during re-
inspection, with a
possibility to nearly 70%.
“BORO” does not appear
to affect much on Critical
Violation. The probability for
critical judging is around
52% for re-inspection with
initial high grades in all
regions.

Part Two: Logistic Regression
Outcome: Critical_Dummy
Variable Selection: Stepwise

Findings (Similar to Decision Tree):
Score GRADE BInspection
Type
GRADE C
0.0983 0.0948 0.06100.1596

● Dine after Spring, since restaurants have been issued the most violations
by that time.
● Be wary of Indian and Chinese restaurants in New York City.
● Don’t pay Manhattan prices; it does not have cleaner restaurants.
● If you want to eat at Chipotle, go to Staten Island.
FOR THE HUNGRY CONSUMER

● Hire a dedicated cleaner in high-volume landmark restaurants.
● Since Facility Amenities violations are the most common, construction
is a critical stage -- do extensive research before contracting.
● Focus on cleanliness for the Spring season.
● Be sure to do well for re-inspection, you’ll either pass with flying colors
or be severely penalized.
● Set a benchmark to be met before allowing re-opening.
FOR RESTAURANTS

Questions?
Thanks for listening!

New York City Restaurant Inspection Analysis

Recommended

Recommended

More Related Content

What's hot

What's hot (7)

Similar to New York City Restaurant Inspection Analysis

Similar to New York City Restaurant Inspection Analysis (9)

Recently uploaded

Recently uploaded (20)

New York City Restaurant Inspection Analysis