SlideShare a Scribd company logo
NYC Taxi Data Analysis
Sona Manukyan
This presentation will cover the following questions:
• The most pick up locations on hourly basis in a day
• How does hot weather affect to the number of taxi trips?
• How did the demand change after green taxi appearance?
• How does the fare differ of “rush hour” and “other hours”?
The most pick up locations on hourly basis in a day
(borough)
0
500000
1000000
1500000
2000000
2500000
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23
Bronx
Brooklyn
EWR
Manhattan
Queens
Staten Island
Unknown
(blank)
Most of hours in a day, especially after 17, Manhattan have the biggest share, though there a huge amount of
unknown and blank.
Script:
SELECT HOUR(pickup_datetime) As pickup_hour, borough, count(borough) as qty FROM [bigquery-public-
data:new_york.tlc_fhv_trips_2015] group by pickup_hour, borough
Hot weather vs number of taxi trips?
There is a slight difference between hot and normal
weather conditions taxi trips quantity (the graph show the
number of Yellow taxi trips in 2016).
Script for How hot weather affects to number of taxi trips:
SELECT date(pickup_datetime) as pickup_date, count(
pickup_datetime) as qty
FROM [bigquery-public-
data:new_york.tlc_yellow_trips_2016] group by
pickup_date
SELECT CONCAT( year, '-', mo, '-', da) as temp_date, case
when AVG(temp)>=40 then "hot" else "normal" end as
temp_hot
FROM [bigquery-public-data:noaa_gsod.gsod2016] group
by temp_date
358803
353901
351000
352000
353000
354000
355000
356000
357000
358000
359000
360000
hot normal
How did the demand change after green taxi
appearance on yearly basis
1% 9%
12%
11%
0
50000000
100000000
150000000
200000000
2009 2010 2011 2012 2013 2014 2015 2016
yellow_yearly_qty green_yearly_qty
After green taxi appearance in market the yearly quantity of yellow taxi trips decreased. The green
taxi yearly pick ups share in total amount of yellow and green taxi together in 2013 was 1% which
reached to 12% in 2015.
Script for How the demand was changed after green taxi appearance on yearly basis:
SELECT year( pickup_datetime) as green_year, count( pickup_datetime) as qty
FROM [bigquery-public-data:new_york.tlc_green_trips_2013] group by green_year
SELECT year( pickup_datetime) as green_year, count( pickup_datetime) as qty
FROM [bigquery-public-data:new_york.tlc_green_trips_2014] group by green_year
SELECT year( pickup_datetime) as green_year, count( pickup_datetime) as qty
FROM [bigquery-public-data:new_york.tlc_green_trips_2015] group by green_year
SELECT year( pickup_datetime) as green_year, count( pickup_datetime) as qty
FROM [bigquery-public-data:new_york.tlc_green_trips_2016] group by green_year
SELECT year( pickup_datetime) as yellow_year, count( pickup_datetime) FROM [nyc-tlc:yellow.trips] group by
yellow_year
SELECT year( pickup_datetime) as yellow_year, count( pickup_datetime) as qty
FROM [bigquery-public-data:new_york.tlc_yellow_trips_2016] group by yellow_year
“rush hour” and “other hours” fare difference
-
2,000,000
4,000,000
6,000,000
8,000,000
10,000,000
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23
Hourly taxi pick ups quantity
0.00
5.00
10.00
15.00
20.00
25.00
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23
Hourly average fare
Script:
SELECT
Hour(pickup_datetime) AS
pickup_hour,
AVG( total_amount ) AS avg_fare,
SUM( total_amount ) AS sum_fare,
COUNT( pickup_datetime) AS qty
FROM [bigquery-public-
data:new_york.tlc_yellow_trips_2016
]
GROUP BY
pickup_hour
The previous graph shows
that, hourly average fare
didn’t change dramatically, so
the sum of fare changes is
due to the hourly changes of
pick up quantity.
What is surprising, the
average fare for rush hours is
less then for other hours
(Rush hour is defined as “if
pick up quantity greater than
the average quantity, then
the hour considers as rush
hour”).
16.3
17.1
15.8
16.0
16.2
16.4
16.6
16.8
17.0
17.2
rush hour other hours
average of fare
109907757
48720367
0
20000000
40000000
60000000
80000000
100000000
120000000
rush hour other hours
sum of fare
Thank You

More Related Content

What's hot

Telecom Churn Analysis
Telecom Churn AnalysisTelecom Churn Analysis
Telecom Churn Analysis
Vasudev pendyala
 
Deep learning
Deep learningDeep learning
Deep learning
AnimaSinghDhabal
 
Data warehousing
Data warehousingData warehousing
Data warehousing
Shruti Dalela
 
Blablacar - innovation nouvelle génération
Blablacar  - innovation nouvelle générationBlablacar  - innovation nouvelle génération
Blablacar - innovation nouvelle génération
Bpifrance
 
Clustering - Machine Learning Techniques
Clustering - Machine Learning TechniquesClustering - Machine Learning Techniques
Clustering - Machine Learning Techniques
Kush Kulshrestha
 
Telecom Churn Prediction Presentation
Telecom Churn Prediction PresentationTelecom Churn Prediction Presentation
Telecom Churn Prediction Presentation
PinintiHarishReddy
 
Big-Data in HealthCare _ Overview
Big-Data in HealthCare _ OverviewBig-Data in HealthCare _ Overview
Big-Data in HealthCare _ Overview
Hamdaoui Younes
 
Churn prediction
Churn predictionChurn prediction
Churn prediction
Gigi Lino
 
Application Of Graph Data Structure
Application Of Graph Data StructureApplication Of Graph Data Structure
Application Of Graph Data Structure
Gaurang Dobariya
 
Uber presentation
Uber presentationUber presentation
Uber presentation
Brett Miller
 
MetroCluster in Clustered Data Ontap
MetroCluster in Clustered Data OntapMetroCluster in Clustered Data Ontap
MetroCluster in Clustered Data Ontap
Fast Lane Consulting and Education, Inc.
 
Module 5: Decision Trees
Module 5: Decision TreesModule 5: Decision Trees
Module 5: Decision Trees
Sara Hooker
 
HEALTH PREDICTION ANALYSIS USING DATA MINING
HEALTH PREDICTION ANALYSIS USING DATA  MININGHEALTH PREDICTION ANALYSIS USING DATA  MINING
HEALTH PREDICTION ANALYSIS USING DATA MINING
Ashish Salve
 
Customer churn prediction for telecom data set.
Customer churn prediction for telecom data set.Customer churn prediction for telecom data set.
Customer churn prediction for telecom data set.
Kuldeep Mahani
 
Association Rule.ppt
Association Rule.pptAssociation Rule.ppt
Association Rule.ppt
SowmyaJyothi3
 
Statistics vs machine learning
Statistics vs machine learningStatistics vs machine learning
Statistics vs machine learning
Tom Dierickx
 
OLAP
OLAPOLAP
OLAP
Ashir Ali
 
Detecting Fraud Using Data Mining Techniques
Detecting Fraud Using Data Mining TechniquesDetecting Fraud Using Data Mining Techniques
Detecting Fraud Using Data Mining TechniquesDecosimoCPAs
 
Data mining
Data miningData mining
Data mining
Birju Tank
 

What's hot (20)

Telecom Churn Analysis
Telecom Churn AnalysisTelecom Churn Analysis
Telecom Churn Analysis
 
Deep learning
Deep learningDeep learning
Deep learning
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Blablacar - innovation nouvelle génération
Blablacar  - innovation nouvelle générationBlablacar  - innovation nouvelle génération
Blablacar - innovation nouvelle génération
 
Clustering - Machine Learning Techniques
Clustering - Machine Learning TechniquesClustering - Machine Learning Techniques
Clustering - Machine Learning Techniques
 
Unit 4 Advanced Data Analytics
Unit 4 Advanced Data AnalyticsUnit 4 Advanced Data Analytics
Unit 4 Advanced Data Analytics
 
Telecom Churn Prediction Presentation
Telecom Churn Prediction PresentationTelecom Churn Prediction Presentation
Telecom Churn Prediction Presentation
 
Big-Data in HealthCare _ Overview
Big-Data in HealthCare _ OverviewBig-Data in HealthCare _ Overview
Big-Data in HealthCare _ Overview
 
Churn prediction
Churn predictionChurn prediction
Churn prediction
 
Application Of Graph Data Structure
Application Of Graph Data StructureApplication Of Graph Data Structure
Application Of Graph Data Structure
 
Uber presentation
Uber presentationUber presentation
Uber presentation
 
MetroCluster in Clustered Data Ontap
MetroCluster in Clustered Data OntapMetroCluster in Clustered Data Ontap
MetroCluster in Clustered Data Ontap
 
Module 5: Decision Trees
Module 5: Decision TreesModule 5: Decision Trees
Module 5: Decision Trees
 
HEALTH PREDICTION ANALYSIS USING DATA MINING
HEALTH PREDICTION ANALYSIS USING DATA  MININGHEALTH PREDICTION ANALYSIS USING DATA  MINING
HEALTH PREDICTION ANALYSIS USING DATA MINING
 
Customer churn prediction for telecom data set.
Customer churn prediction for telecom data set.Customer churn prediction for telecom data set.
Customer churn prediction for telecom data set.
 
Association Rule.ppt
Association Rule.pptAssociation Rule.ppt
Association Rule.ppt
 
Statistics vs machine learning
Statistics vs machine learningStatistics vs machine learning
Statistics vs machine learning
 
OLAP
OLAPOLAP
OLAP
 
Detecting Fraud Using Data Mining Techniques
Detecting Fraud Using Data Mining TechniquesDetecting Fraud Using Data Mining Techniques
Detecting Fraud Using Data Mining Techniques
 
Data mining
Data miningData mining
Data mining
 

Similar to NYC Taxi

鄭世昐/未來城市的任意門 (Mobility on Demand for Future Cities)
鄭世昐/未來城市的任意門 (Mobility on Demand for Future Cities)鄭世昐/未來城市的任意門 (Mobility on Demand for Future Cities)
鄭世昐/未來城市的任意門 (Mobility on Demand for Future Cities)
台灣資料科學年會
 
HAOLI-UBPL756 Desoto Travel Demand Models
HAOLI-UBPL756 Desoto Travel Demand ModelsHAOLI-UBPL756 Desoto Travel Demand Models
HAOLI-UBPL756 Desoto Travel Demand ModelsHao Li
 
2016_tlc_factbook
2016_tlc_factbook2016_tlc_factbook
2016_tlc_factbookJeff Roth
 
San Francisco Transportation Plan - May 30, 2013 Board Workshop Presentations
San Francisco Transportation Plan - May 30, 2013 Board Workshop PresentationsSan Francisco Transportation Plan - May 30, 2013 Board Workshop Presentations
San Francisco Transportation Plan - May 30, 2013 Board Workshop Presentations
SFCTA
 
A travel behaviour change framework for the city of Cape Town.pdf
A travel behaviour change framework for the city of Cape Town.pdfA travel behaviour change framework for the city of Cape Town.pdf
A travel behaviour change framework for the city of Cape Town.pdf
Samantha Martinez
 
foresight-47-congestion-charging-north-america
foresight-47-congestion-charging-north-americaforesight-47-congestion-charging-north-america
foresight-47-congestion-charging-north-americaClark Savolaine
 
Next Generation Intelligent Transportation: Solutions for Smart Cities
Next Generation Intelligent Transportation: Solutions for Smart CitiesNext Generation Intelligent Transportation: Solutions for Smart Cities
Next Generation Intelligent Transportation: Solutions for Smart Cities
UGPTI
 
Case Studies in Managing Traffic in a Developing Country with Privacy-Preserv...
Case Studies in Managing Traffic in a Developing Country with Privacy-Preserv...Case Studies in Managing Traffic in a Developing Country with Privacy-Preserv...
Case Studies in Managing Traffic in a Developing Country with Privacy-Preserv...
Biplav Srivastava
 
San Francisco Smart City Challenge
San Francisco Smart City Challenge San Francisco Smart City Challenge
San Francisco Smart City Challenge
Urban SDK
 
A Wish Called $quander
A Wish Called $quanderA Wish Called $quander
A Wish Called $quander
Catalystian
 
Global traffic scorecard di Inrix
Global traffic scorecard di InrixGlobal traffic scorecard di Inrix
Global traffic scorecard di Inrix
Filippo Bernardi
 
Chapter 1 IntroductionIn The High Cost of Free Parking, Dr. Danie.docx
Chapter 1 IntroductionIn The High Cost of Free Parking, Dr. Danie.docxChapter 1 IntroductionIn The High Cost of Free Parking, Dr. Danie.docx
Chapter 1 IntroductionIn The High Cost of Free Parking, Dr. Danie.docx
sleeperharwell
 
Government Policies – Transit - Canada - February 2018
Government Policies – Transit -  Canada - February 2018Government Policies – Transit -  Canada - February 2018
Government Policies – Transit - Canada - February 2018
paul young cpa, cga
 
Cahier_Traffic_volume.pptx
Cahier_Traffic_volume.pptxCahier_Traffic_volume.pptx
Cahier_Traffic_volume.pptx
MohamedMohamed216790
 
Transit - Ridership and Transit Revenue - Canada - May 2018
Transit - Ridership and Transit Revenue - Canada - May 2018Transit - Ridership and Transit Revenue - Canada - May 2018
Transit - Ridership and Transit Revenue - Canada - May 2018
paul young cpa, cga
 
Marketing Analytics Final Project
Marketing Analytics Final ProjectMarketing Analytics Final Project
Marketing Analytics Final Project
AlexandraBlom1
 
San Francisco Transportation Plan Overview - Fall 2010
San Francisco Transportation Plan Overview - Fall 2010San Francisco Transportation Plan Overview - Fall 2010
San Francisco Transportation Plan Overview - Fall 2010SanFranciscoTA
 
Effect of Weather on Uber Ridership_rev1 (1)
Effect of Weather on Uber Ridership_rev1 (1)Effect of Weather on Uber Ridership_rev1 (1)
Effect of Weather on Uber Ridership_rev1 (1)Anusha Mamillapalli
 

Similar to NYC Taxi (20)

鄭世昐/未來城市的任意門 (Mobility on Demand for Future Cities)
鄭世昐/未來城市的任意門 (Mobility on Demand for Future Cities)鄭世昐/未來城市的任意門 (Mobility on Demand for Future Cities)
鄭世昐/未來城市的任意門 (Mobility on Demand for Future Cities)
 
HAOLI-UBPL756 Desoto Travel Demand Models
HAOLI-UBPL756 Desoto Travel Demand ModelsHAOLI-UBPL756 Desoto Travel Demand Models
HAOLI-UBPL756 Desoto Travel Demand Models
 
2016_tlc_factbook
2016_tlc_factbook2016_tlc_factbook
2016_tlc_factbook
 
San Francisco Transportation Plan - May 30, 2013 Board Workshop Presentations
San Francisco Transportation Plan - May 30, 2013 Board Workshop PresentationsSan Francisco Transportation Plan - May 30, 2013 Board Workshop Presentations
San Francisco Transportation Plan - May 30, 2013 Board Workshop Presentations
 
A travel behaviour change framework for the city of Cape Town.pdf
A travel behaviour change framework for the city of Cape Town.pdfA travel behaviour change framework for the city of Cape Town.pdf
A travel behaviour change framework for the city of Cape Town.pdf
 
foresight-47-congestion-charging-north-america
foresight-47-congestion-charging-north-americaforesight-47-congestion-charging-north-america
foresight-47-congestion-charging-north-america
 
Next Generation Intelligent Transportation: Solutions for Smart Cities
Next Generation Intelligent Transportation: Solutions for Smart CitiesNext Generation Intelligent Transportation: Solutions for Smart Cities
Next Generation Intelligent Transportation: Solutions for Smart Cities
 
Case Studies in Managing Traffic in a Developing Country with Privacy-Preserv...
Case Studies in Managing Traffic in a Developing Country with Privacy-Preserv...Case Studies in Managing Traffic in a Developing Country with Privacy-Preserv...
Case Studies in Managing Traffic in a Developing Country with Privacy-Preserv...
 
San Francisco Smart City Challenge
San Francisco Smart City Challenge San Francisco Smart City Challenge
San Francisco Smart City Challenge
 
A Wish Called $quander
A Wish Called $quanderA Wish Called $quander
A Wish Called $quander
 
Global traffic scorecard di Inrix
Global traffic scorecard di InrixGlobal traffic scorecard di Inrix
Global traffic scorecard di Inrix
 
Chapter 1 IntroductionIn The High Cost of Free Parking, Dr. Danie.docx
Chapter 1 IntroductionIn The High Cost of Free Parking, Dr. Danie.docxChapter 1 IntroductionIn The High Cost of Free Parking, Dr. Danie.docx
Chapter 1 IntroductionIn The High Cost of Free Parking, Dr. Danie.docx
 
Government Policies – Transit - Canada - February 2018
Government Policies – Transit -  Canada - February 2018Government Policies – Transit -  Canada - February 2018
Government Policies – Transit - Canada - February 2018
 
Cahier_Traffic_volume.pptx
Cahier_Traffic_volume.pptxCahier_Traffic_volume.pptx
Cahier_Traffic_volume.pptx
 
Transit - Ridership and Transit Revenue - Canada - May 2018
Transit - Ridership and Transit Revenue - Canada - May 2018Transit - Ridership and Transit Revenue - Canada - May 2018
Transit - Ridership and Transit Revenue - Canada - May 2018
 
Marketing Analytics Final Project
Marketing Analytics Final ProjectMarketing Analytics Final Project
Marketing Analytics Final Project
 
San Francisco Transportation Plan Overview - Fall 2010
San Francisco Transportation Plan Overview - Fall 2010San Francisco Transportation Plan Overview - Fall 2010
San Francisco Transportation Plan Overview - Fall 2010
 
A Tale of Two Forecasts
A Tale of Two ForecastsA Tale of Two Forecasts
A Tale of Two Forecasts
 
Effect of Weather on Uber Ridership_rev1 (1)
Effect of Weather on Uber Ridership_rev1 (1)Effect of Weather on Uber Ridership_rev1 (1)
Effect of Weather on Uber Ridership_rev1 (1)
 
Transportasi kota malang
Transportasi kota malangTransportasi kota malang
Transportasi kota malang
 

Recently uploaded

原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
u86oixdj
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTESAdjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Subhajit Sahu
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Subhajit Sahu
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
Roger Valdez
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
Oppotus
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
balafet
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
John Andrews
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 

Recently uploaded (20)

原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTESAdjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 

NYC Taxi

  • 1. NYC Taxi Data Analysis Sona Manukyan
  • 2. This presentation will cover the following questions: • The most pick up locations on hourly basis in a day • How does hot weather affect to the number of taxi trips? • How did the demand change after green taxi appearance? • How does the fare differ of “rush hour” and “other hours”?
  • 3. The most pick up locations on hourly basis in a day (borough) 0 500000 1000000 1500000 2000000 2500000 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 Bronx Brooklyn EWR Manhattan Queens Staten Island Unknown (blank) Most of hours in a day, especially after 17, Manhattan have the biggest share, though there a huge amount of unknown and blank. Script: SELECT HOUR(pickup_datetime) As pickup_hour, borough, count(borough) as qty FROM [bigquery-public- data:new_york.tlc_fhv_trips_2015] group by pickup_hour, borough
  • 4. Hot weather vs number of taxi trips? There is a slight difference between hot and normal weather conditions taxi trips quantity (the graph show the number of Yellow taxi trips in 2016). Script for How hot weather affects to number of taxi trips: SELECT date(pickup_datetime) as pickup_date, count( pickup_datetime) as qty FROM [bigquery-public- data:new_york.tlc_yellow_trips_2016] group by pickup_date SELECT CONCAT( year, '-', mo, '-', da) as temp_date, case when AVG(temp)>=40 then "hot" else "normal" end as temp_hot FROM [bigquery-public-data:noaa_gsod.gsod2016] group by temp_date 358803 353901 351000 352000 353000 354000 355000 356000 357000 358000 359000 360000 hot normal
  • 5. How did the demand change after green taxi appearance on yearly basis 1% 9% 12% 11% 0 50000000 100000000 150000000 200000000 2009 2010 2011 2012 2013 2014 2015 2016 yellow_yearly_qty green_yearly_qty After green taxi appearance in market the yearly quantity of yellow taxi trips decreased. The green taxi yearly pick ups share in total amount of yellow and green taxi together in 2013 was 1% which reached to 12% in 2015.
  • 6. Script for How the demand was changed after green taxi appearance on yearly basis: SELECT year( pickup_datetime) as green_year, count( pickup_datetime) as qty FROM [bigquery-public-data:new_york.tlc_green_trips_2013] group by green_year SELECT year( pickup_datetime) as green_year, count( pickup_datetime) as qty FROM [bigquery-public-data:new_york.tlc_green_trips_2014] group by green_year SELECT year( pickup_datetime) as green_year, count( pickup_datetime) as qty FROM [bigquery-public-data:new_york.tlc_green_trips_2015] group by green_year SELECT year( pickup_datetime) as green_year, count( pickup_datetime) as qty FROM [bigquery-public-data:new_york.tlc_green_trips_2016] group by green_year SELECT year( pickup_datetime) as yellow_year, count( pickup_datetime) FROM [nyc-tlc:yellow.trips] group by yellow_year SELECT year( pickup_datetime) as yellow_year, count( pickup_datetime) as qty FROM [bigquery-public-data:new_york.tlc_yellow_trips_2016] group by yellow_year
  • 7. “rush hour” and “other hours” fare difference - 2,000,000 4,000,000 6,000,000 8,000,000 10,000,000 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 Hourly taxi pick ups quantity 0.00 5.00 10.00 15.00 20.00 25.00 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 Hourly average fare Script: SELECT Hour(pickup_datetime) AS pickup_hour, AVG( total_amount ) AS avg_fare, SUM( total_amount ) AS sum_fare, COUNT( pickup_datetime) AS qty FROM [bigquery-public- data:new_york.tlc_yellow_trips_2016 ] GROUP BY pickup_hour
  • 8. The previous graph shows that, hourly average fare didn’t change dramatically, so the sum of fare changes is due to the hourly changes of pick up quantity. What is surprising, the average fare for rush hours is less then for other hours (Rush hour is defined as “if pick up quantity greater than the average quantity, then the hour considers as rush hour”). 16.3 17.1 15.8 16.0 16.2 16.4 16.6 16.8 17.0 17.2 rush hour other hours average of fare 109907757 48720367 0 20000000 40000000 60000000 80000000 100000000 120000000 rush hour other hours sum of fare