SlideShare a Scribd company logo
1 of 18
Data Challenge
TeamDC19036
Wei Chen
MSBA Student
Liya Zhang
MSBA Student
Project Team1
Yunjia Jiang
MSBA Student
Tariq Haque
MSBA Student
Agenda
• Introduction
• Exploratory Analysis of Data
• Manual Anomaly Detection
• Using Machine to Detect Anomaly
• Recommendations
• Future Studies
2 Challenge for Taxicabs
Dynamic
Fare
Fixed Fare
3
11M
2017 Trip Data from
DFHV
Time, fare amount, origin,
destination, mileage, duration,
etc.
42 Variables
53
Zip codes
Data Description
150
Taxi companies
4 Neighborhoods
• Divided DC into 217
neighborhoods
• These are small regions
considered to possess
uniform characters
• Added neighborhood
information for each trip
5 Hourly Trip Frequency
11 AM : 10 PM
6 Average Trip Distance by Hour
Longest distance
happens at
3 to 7 am
7
Customers take taxis
most frequently on
Weekly Trip Frequency
8 Weekly Trip Frequency by Hour
9 How do we decide price?
• Match supply with ridership/demand
• Study pattern in ridership and identify anomalies
○ Hourly and Weekly demand pattern in each neighborhood
○ Identify anomalies in demand pattern by comparing with overall pattern
10
0
Anomaly 1
11 Anomaly 2
12 Anomaly 3
13 Anomaly Detection Code
Comparing 36,456 with
168 data points
14 Recommendations
Anomalies for the Union Station Neighborhood
● High Ridership: Experiment by increasing fares and assess
the impact on ridership
● Low Ridership: Experiment by providing discounts and assess
the impact on ridership
15 What to do in future?
➔Conduct our analysis on complete dataset
➔Apply machine learning to cluster
neighborhoods based on their characteristics
(hospitals, train station etc)
◆ Match this information with areas showing
anomalies to efficiently identify reasons for
anomalies
➔Collect drivers’ real-time geolocation
information for supply and analysis
15

More Related Content

Similar to Data challenge - Department of For-Hire Vehicles

Theme 3 The costumer experience
Theme 3 The costumer experienceTheme 3 The costumer experience
Theme 3 The costumer experience
BRTCoE
 
Performance and challenges of zonal taxi transport system
Performance and challenges of zonal taxi transport systemPerformance and challenges of zonal taxi transport system
Performance and challenges of zonal taxi transport system
wondem
 

Similar to Data challenge - Department of For-Hire Vehicles (20)

Final project_Hop On
Final project_Hop OnFinal project_Hop On
Final project_Hop On
 
Liberty Mobility Now
Liberty Mobility NowLiberty Mobility Now
Liberty Mobility Now
 
Remix Conference 2015—Robert del Rosario, "Shaking Up Service and Standardizi...
Remix Conference 2015—Robert del Rosario, "Shaking Up Service and Standardizi...Remix Conference 2015—Robert del Rosario, "Shaking Up Service and Standardizi...
Remix Conference 2015—Robert del Rosario, "Shaking Up Service and Standardizi...
 
Itm 2016
Itm 2016Itm 2016
Itm 2016
 
Theme 3 The costumer experience
Theme 3 The costumer experienceTheme 3 The costumer experience
Theme 3 The costumer experience
 
Dept. of aging 2012
Dept. of aging 2012Dept. of aging 2012
Dept. of aging 2012
 
Predict Traffic flow with KNN and LSTM
Predict Traffic flow with KNN and LSTMPredict Traffic flow with KNN and LSTM
Predict Traffic flow with KNN and LSTM
 
Performance and challenges of zonal taxi transport system
Performance and challenges of zonal taxi transport systemPerformance and challenges of zonal taxi transport system
Performance and challenges of zonal taxi transport system
 
BRT Workshop - The Customer Experience
BRT Workshop - The Customer ExperienceBRT Workshop - The Customer Experience
BRT Workshop - The Customer Experience
 
Developer day-2013-01-17
Developer day-2013-01-17Developer day-2013-01-17
Developer day-2013-01-17
 
Survey on BEST buses Mumbai
Survey on BEST  buses MumbaiSurvey on BEST  buses Mumbai
Survey on BEST buses Mumbai
 
Microtransit Planning & Implementation Lessons from the North Carolina Experi...
Microtransit Planning & Implementation Lessons from the North Carolina Experi...Microtransit Planning & Implementation Lessons from the North Carolina Experi...
Microtransit Planning & Implementation Lessons from the North Carolina Experi...
 
Origin and destination survey
Origin and destination surveyOrigin and destination survey
Origin and destination survey
 
Proactive Planning for Autonomous Vehicles - American Planning Assn Annual Co...
Proactive Planning for Autonomous Vehicles - American Planning Assn Annual Co...Proactive Planning for Autonomous Vehicles - American Planning Assn Annual Co...
Proactive Planning for Autonomous Vehicles - American Planning Assn Annual Co...
 
Analyzing Streets, Traffic, and Drivetimes in MapInfo Pro
Analyzing Streets, Traffic, and Drivetimes in MapInfo ProAnalyzing Streets, Traffic, and Drivetimes in MapInfo Pro
Analyzing Streets, Traffic, and Drivetimes in MapInfo Pro
 
Redbus
RedbusRedbus
Redbus
 
Starting Bicycle and Pedestrian Counting Programs
Starting Bicycle and Pedestrian Counting ProgramsStarting Bicycle and Pedestrian Counting Programs
Starting Bicycle and Pedestrian Counting Programs
 
Greater Brisbane transport demand model
Greater Brisbane transport demand modelGreater Brisbane transport demand model
Greater Brisbane transport demand model
 
Traffic studies ORIGIN AND DESTINATION STUDIES
Traffic studies ORIGIN AND DESTINATION STUDIESTraffic studies ORIGIN AND DESTINATION STUDIES
Traffic studies ORIGIN AND DESTINATION STUDIES
 
Use of IT in Data Collection and its Implications in Improving Service Operat...
Use of IT in Data Collection and its Implications in Improving Service Operat...Use of IT in Data Collection and its Implications in Improving Service Operat...
Use of IT in Data Collection and its Implications in Improving Service Operat...
 

Recently uploaded

一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理
cyebo
 
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotecAbortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
pyhepag
 
Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxExploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptx
DilipVasan
 
Fuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertaintyFuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertainty
RafigAliyev2
 
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
pyhepag
 
一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理
pyhepag
 

Recently uploaded (20)

2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting
 
一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理
 
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotecAbortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
 
AI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdfAI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdf
 
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
 
Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxExploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptx
 
Generative AI for Trailblazers_ Unlock the Future of AI.pdf
Generative AI for Trailblazers_ Unlock the Future of AI.pdfGenerative AI for Trailblazers_ Unlock the Future of AI.pdf
Generative AI for Trailblazers_ Unlock the Future of AI.pdf
 
Slip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp ClaimsSlip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp Claims
 
Pre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxPre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptx
 
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictSupply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
 
How I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prisonHow I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prison
 
社内勉強会資料  Mamba - A new era or ephemeral
社内勉強会資料   Mamba - A new era or ephemeral社内勉強会資料   Mamba - A new era or ephemeral
社内勉強会資料  Mamba - A new era or ephemeral
 
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsWebinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
 
Fuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertaintyFuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertainty
 
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
 
Machine Learning for Accident Severity Prediction
Machine Learning for Accident Severity PredictionMachine Learning for Accident Severity Prediction
Machine Learning for Accident Severity Prediction
 
Artificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdfArtificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdf
 
basics of data science with application areas.pdf
basics of data science with application areas.pdfbasics of data science with application areas.pdf
basics of data science with application areas.pdf
 
Easy and simple project file on mp online
Easy and simple project file on mp onlineEasy and simple project file on mp online
Easy and simple project file on mp online
 
一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理
 

Data challenge - Department of For-Hire Vehicles

  • 2. Wei Chen MSBA Student Liya Zhang MSBA Student Project Team1 Yunjia Jiang MSBA Student Tariq Haque MSBA Student
  • 3. Agenda • Introduction • Exploratory Analysis of Data • Manual Anomaly Detection • Using Machine to Detect Anomaly • Recommendations • Future Studies
  • 4. 2 Challenge for Taxicabs Dynamic Fare Fixed Fare
  • 5. 3 11M 2017 Trip Data from DFHV Time, fare amount, origin, destination, mileage, duration, etc. 42 Variables 53 Zip codes Data Description 150 Taxi companies
  • 6. 4 Neighborhoods • Divided DC into 217 neighborhoods • These are small regions considered to possess uniform characters • Added neighborhood information for each trip
  • 7. 5 Hourly Trip Frequency 11 AM : 10 PM
  • 8. 6 Average Trip Distance by Hour Longest distance happens at 3 to 7 am
  • 9. 7 Customers take taxis most frequently on Weekly Trip Frequency
  • 10. 8 Weekly Trip Frequency by Hour
  • 11. 9 How do we decide price? • Match supply with ridership/demand • Study pattern in ridership and identify anomalies ○ Hourly and Weekly demand pattern in each neighborhood ○ Identify anomalies in demand pattern by comparing with overall pattern
  • 15. 13 Anomaly Detection Code Comparing 36,456 with 168 data points
  • 16. 14 Recommendations Anomalies for the Union Station Neighborhood ● High Ridership: Experiment by increasing fares and assess the impact on ridership ● Low Ridership: Experiment by providing discounts and assess the impact on ridership
  • 17. 15 What to do in future? ➔Conduct our analysis on complete dataset ➔Apply machine learning to cluster neighborhoods based on their characteristics (hospitals, train station etc) ◆ Match this information with areas showing anomalies to efficiently identify reasons for anomalies ➔Collect drivers’ real-time geolocation information for supply and analysis
  • 18. 15