SlideShare a Scribd company logo
MBTA Route Performance Analysis
Rider ExperienceSchedule Analysis Vehicle Optimization
Agenda
1. Problem Statement: Transportation Issues
2. Data, What it can/cannot tell us
3. Platform Overview
4. Early Observations
5. Early Insights
6. Early Recommendations to MBTA
1. Problem Statement: Transportation Issues
• Waits can be long
• Buses not always showing up on schedule
• Buses sometimes stacked one-after-another
which is wasteful
• How much, how often, and why?
2. Data, What it can tell us
"route_id": "71",
"mode_name": "Bus",
"direction_name": "Inbound",
"trip_id": "28346992",
"trip_name": "9:30 pm from Watertown Sq …",
"vehicle_id": "y4128",
"vehicle_lat": "42.3747940063477",
"vehicle_lon": "-71.1326065063477",
"vehicle_bearing": "85",
"vehicle_timestamp": "1447988172"
11/19/2015
9:56:12 PM
Every 60 seconds, each bus reports its position
These dots represent the 20+ pings of a typical trip of the Route 71 bus
2. Data, What it can tell us
route_id: 71
INDEX: 42
trip_id: 28346775
stop_id:
stop_sequence:
vehicle_id: y4121
vehicle_lat: 42.3740653991699
vehicle_lon: -71.1249313354492
vehicle_timestamp: 1444948220
route_id: 71
INDEX: 41
trip_id: 28346775
stop_id: 2073
stop_sequence: 21
vehicle_id: y4121
vehicle_lat: 42.373956
vehicle_lon: -71.124752
vehicle_timestamp: 1444948214
route_id: 71
INDEX: 43
trip_id: 28346775
stop_id: 2074
stop_sequence: 22
vehicle_id: y4121
vehicle_lat: 42.373356
vehicle_lon: -71.123226
vehicle_timestamp: 1444948311
By portioning time according to
distance, we can estimate when
the bus was at the stop.
2. Data, What it can tell us
StopsStopsStopsStopsStopsBus DataBus DataBus DataBus Data
2. Data, What it cannot tell us
We don’t know how many people are on the bus:
- Can’t tell whether riders are getting left at the curb (full bus)
- Can’t tell the busy stops from the light stops
- Can’t understand utilization by segment
3. Platform Overview
Web-based “SaaS” solution
- Add/Del Tracked Routes
- Summary Scorecard
- Detailed analysis
- Schedule Variance
- Schedule Viability
- Wait time Distribution
- Vehicle Deployment Policy
3. Platform Overview
Scorecard, comprised of metrics, so that
• Best performing routes can be learned from,
• Worst performing routes can be addressed
Heat map to show Schedule Variance Distribution to show Wait Times
Maps to show Bus Locations Logs to validate charts
DONE
3. Platform Overview
http://realtime.mbta.com/portal
current_location raw_location
Fetcher()
trip_actuals
raw_to_trip()
trip_variance
compute_trip_variance()
DONE
3. Platform Overview
http://realtime.mbta.com/portal
current_location raw_location
Fetcher()
trip_actuals
raw_to_trip()
trip_variance
compute_trip_variance()
GTIF
Schedules
4. Early Observations
5:13 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
5:33 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
5:53 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
6:06 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
6:14 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
6:20 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
6:27 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
6:35 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
6:42 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
6:49 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
6:56 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
7:11 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
7:20 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
7:29 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
7:38 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
7:45 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
7:52 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
7:59 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
8:10 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
8:23 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
8:33 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
8:43 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
8:48 am from Watertown Sq Terminal to Harvard Upper Busway @ Red Line
8:54 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
8:58 am from Watertown Sq Terminal to Harvard Upper Busway @ Red Line
9:09 am from Watertown Sq Terminal to Harvard Upper Busway @ Red Line
9:15 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
9:24 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
9:34 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
9:44 am from Watertown Sq Terminal to Harvard Upper Busway @ Red Line
9:55 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
10:10 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
10:25 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
10:40 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
10:55 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
11:10 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
11:25 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave
Printed Schedule Published GTIF Schedule (Database)

Printed schedule not the same as the operations schedule.
4. Early Observations
Frequency
0
earlier 0
15 to 20 min early 1
10 to 15 min early 4
5 to 10 min early 165
2 to 5 min early 1202
+/- 2 min 699
2 to 5 min late 617
5 to 10 min late 397
10 to 15 min late 354
15 to 20 min late 374
later 0
Early Late
According to Schedule …
Route 71 is frequently early or late, too-often it is very late.
4. Early Observations
Route 71 wait time (time between buses) can be significant.
Frequency
no wait 4
0-5 mins 1099
5-10 mins 809
10-15 mins 761
15-20 mins 466
20-25 mins 313
25-30 mins 169
30-35 mins 53
35-40 mins 51
40-45 mins 27
45-50 mins 5
50 mins or more 8
Min 0:00:00
Max 1:56:45
AVG 0:11:24
Median 0:09:50
29% > 15 min wait
4. Early ObservationsMorningRushEveningRush
10/19/2015 5:37
10/19/2015 5:57
10/19/2015 6:05
10/19/2015 6:11
10/19/2015 6:17
10/19/2015 6:31
10/19/2015 6:48
10/19/2015 6:41
10/19/2015 6:55
10/19/2015 7:02
10/19/2015 7:09
10/19/2015 7:16
10/19/2015 7:24
10/19/2015 7:30 <10 min
10/19/2015 7:42
10/19/2015 7:56
10/19/2015 8:12
10/19/2015 8:04
10/19/2015 8:26
10/19/2015 8:19
10/19/2015 8:33
10/19/2015 8:43
10/19/2015 8:48
10/19/2015 8:58
10/19/2015 9:18
10/19/2015 9:08
10/19/2015 9:30
10/19/2015 9:42
10/19/2015 9:55 <15 min
10/19/2015 10:25
10/19/2015 10:40
10/19/2015 11:10
10/19/2015 11:25
10/19/2015 11:40
10/19/2015 11:55
10/19/2015 12:08
10/19/2015 12:23
10/19/2015 12:38
10/19/2015 12:53
10/19/2015 13:08
10/19/2015 13:23
10/19/2015 13:38
10/19/2015 13:53
10/19/2015 14:06
10/19/2015 14:14
10/19/2015 14:24
10/19/2015 14:43
10/19/2015 14:51
10/19/2015 14:59
10/19/2015 15:10
10/19/2015 15:17
10/19/2015 15:28
10/19/2015 15:37 <10 min
10/19/2015 15:55
10/19/2015 16:04
10/19/2015 16:12
10/19/2015 16:33
10/19/2015 16:24
10/19/2015 16:43
10/19/2015 17:00
10/19/2015 17:10
10/19/2015 17:37
10/19/2015 17:19
10/19/2015 17:46
10/19/2015 18:04
10/19/2015 17:55
10/19/2015 18:13
10/19/2015 18:22
10/19/2015 18:40
10/19/2015 18:31
10/19/2015 18:49
10/19/2015 19:00
10/19/2015 19:22
10/19/2015 19:36
10/19/2015 19:50
Worst waits are
at peak periods!
4. Early Observations
TRIP TIMES
min 0:15:37
max 2:50:06
avg 0:27:12
median 0:25:19
End-to-end trip times vary widely, but
averages are not sensitive to changing
conditions of the day … we need to
observe more closely
Outbound, Total Time
Inbound, Total Time
4. Early Insights
End-to-end trip times vary widely,
depending on time of day. Schedules
could be aligned more closely to the
reality of traffic on the route throughout
the day.
4. Early Observations
End-to-end trip times vary widely,
depending on time of day. Schedules
could be aligned more closely to the
reality of traffic on the route throughout
the day.
4. Early Observations
A second look at the day reveals that
schedule is overly aggressive and can’t be
met.
Either schedule should be reduced to
meet equipment or equipment should be
increased to meet schedule.
>
Late more often than Early
Not enough equipment to meet Schedule
4. Summary of Early Observations
• Insufficient buses to meet 71 schedule
– Fall behind during peek periods
– Almost catch up during lull periods
– Missed Trips, 17 of 630 on 10/19/15 = 2.7%
• Propensity for Buses to cluster
– Un-uniform Wait Times
• Very short waits or very long waits
• Route 71 Schedule is a fallacy
– Printed schedule inconsistent with GTIF
– Buses are simply looping as fast as they can
– Number of vehicles governs availability
5. Early Insights
Optimizing for equipment utilization, looping as fast as possible, results in
vehicle clustering, short waits followed by long waits (time between buses).
Thesis:
Equilibrium
Uniform Wait Time
Uniform Occupancy
Uniform Velocity
5. Early Insights
Optimizing for equipment utilization, looping as fast as possible, results in
vehicle clustering, short waits followed by long waits (time between buses).
Thesis:
Out of
Equilibrium
Short Wait Time
Lower Bus Occupancy
Bus speeds up with fewer riders
5. Early Insights
Optimizing for equipment utilization, looping as fast as possible, results in
vehicle clustering, short waits followed by long waits (time between buses).
Thesis:
Out of
Equilibrium
Longer Wait Time
Higher Bus Occupancy
Bus Slows down with increased riders
5. Early Insights
Optimizing for equipment utilization, looping as fast as possible, results in
vehicle clustering, short waits followed by long waits (time between buses).
Thesis:
Out of
Equilibrium
Un-Uniform Wait Times irritating riders
Un-Uniform Occupancy reducing utilization
Speed impacted, exacerbates clustering
6. Early Recommendations to MBTA
• Align schedules, printed and GTIF/internal
• Set realistic schedules based on analysis
• Space vehicles more uniformly to reduce maximum rider wait
times
– Requires thought to balance schedule with rider experience – should
be “tuned” through A/B testing
• Send location update when bus at stop and door open
– Improve accuracy of future analyses
• Add Ridership data to feed
– # people (or weight of vehicle for approximation)
– Anonymized load/unload so individual rider trips can be analyzed
6. Early Recommendations to MBTA
• Can’t Accelerate Slow Bus
– Road Speed can’t be influenced
– Skipping stops will anger waiting riders
• Only options is to decelerate Fast Bus
– Bus waits at stops to maintain spacing
Maintaining equilibrium
6. Early Recommendations to MBTA
“Uber-fication” of buses – a connected app managing network of vehicles
Wait at Stop
Go when able
6. Early Recommendations to MBTA
• Slowing bus
– Will reduce number of feasible trips and
scheduled time between buses
– Will improve uniformity of wait time
– Likely to improve utilization rate (fewer over-
crowded buses, few people left at curb)
– Likely to improve overall rider satisfaction
Thank you

More Related Content

Viewers also liked

動物園公廁管理與清潔分享
動物園公廁管理與清潔分享動物園公廁管理與清潔分享
動物園公廁管理與清潔分享epaslideshare
 
Enterprise 2.0 (fixed)
Enterprise 2.0 (fixed)Enterprise 2.0 (fixed)
Enterprise 2.0 (fixed)
Dipock Das
 
Pascua
PascuaPascua
09 Higher Revision Notes
09  Higher Revision Notes09  Higher Revision Notes
09 Higher Revision Notes
s2francetrip
 
700.09 protocol for treatment of dogs and cats on surgery log
700.09 protocol for treatment of dogs and cats on surgery log700.09 protocol for treatment of dogs and cats on surgery log
700.09 protocol for treatment of dogs and cats on surgery log
No Kill Shelter Alliance
 
paper 4
paper 4paper 4
Noticia sociales (1)
Noticia sociales (1)Noticia sociales (1)
Noticia sociales (1)
Jose Ramon Macias Ballestero
 
IT cv
IT cvIT cv
Investigación de mercados "SMARTPHONE"
Investigación de mercados "SMARTPHONE"Investigación de mercados "SMARTPHONE"
Investigación de mercados "SMARTPHONE"
Sssalazar
 
El teatro 1920-1940
El teatro 1920-1940El teatro 1920-1940
El teatro 1920-1940
kikapu8
 
Dimensional Tolerances
Dimensional TolerancesDimensional Tolerances
Dimensional Tolerances
s2francetrip
 
Elastic impression materials/cosmetic dentistry courses
Elastic impression materials/cosmetic dentistry coursesElastic impression materials/cosmetic dentistry courses
Elastic impression materials/cosmetic dentistry courses
Indian dental academy
 
Downhill consultants
Downhill   consultantsDownhill   consultants
Downhill consultants
jackson4545
 

Viewers also liked (14)

動物園公廁管理與清潔分享
動物園公廁管理與清潔分享動物園公廁管理與清潔分享
動物園公廁管理與清潔分享
 
Enterprise 2.0 (fixed)
Enterprise 2.0 (fixed)Enterprise 2.0 (fixed)
Enterprise 2.0 (fixed)
 
Pascua
PascuaPascua
Pascua
 
09 Higher Revision Notes
09  Higher Revision Notes09  Higher Revision Notes
09 Higher Revision Notes
 
700.09 protocol for treatment of dogs and cats on surgery log
700.09 protocol for treatment of dogs and cats on surgery log700.09 protocol for treatment of dogs and cats on surgery log
700.09 protocol for treatment of dogs and cats on surgery log
 
paper 4
paper 4paper 4
paper 4
 
Noticia sociales (1)
Noticia sociales (1)Noticia sociales (1)
Noticia sociales (1)
 
4) Era Khulafa & Dinasti Umayya
4) Era Khulafa & Dinasti Umayya4) Era Khulafa & Dinasti Umayya
4) Era Khulafa & Dinasti Umayya
 
IT cv
IT cvIT cv
IT cv
 
Investigación de mercados "SMARTPHONE"
Investigación de mercados "SMARTPHONE"Investigación de mercados "SMARTPHONE"
Investigación de mercados "SMARTPHONE"
 
El teatro 1920-1940
El teatro 1920-1940El teatro 1920-1940
El teatro 1920-1940
 
Dimensional Tolerances
Dimensional TolerancesDimensional Tolerances
Dimensional Tolerances
 
Elastic impression materials/cosmetic dentistry courses
Elastic impression materials/cosmetic dentistry coursesElastic impression materials/cosmetic dentistry courses
Elastic impression materials/cosmetic dentistry courses
 
Downhill consultants
Downhill   consultantsDownhill   consultants
Downhill consultants
 

Similar to 2015-12-04 Overview & Requirements Gathering

Innovation in London’s transport: Big Data for a better customer experience
Innovation in London’s transport: Big Data for a better customer experienceInnovation in London’s transport: Big Data for a better customer experience
Innovation in London’s transport: Big Data for a better customer experience
BCS Data Management Specialist Group
 
January service change final page spread
January service change final page spreadJanuary service change final page spread
January service change final page spread
cotabus
 
The possibility to redesign 255 bus route in order reduce dead journey
The possibility to redesign 255 bus route in order reduce dead journeyThe possibility to redesign 255 bus route in order reduce dead journey
The possibility to redesign 255 bus route in order reduce dead journey
Rafhan Rifan
 
2017 07-05 Transportation Needs assessment
2017 07-05 Transportation Needs assessment2017 07-05 Transportation Needs assessment
2017 07-05 Transportation Needs assessment
Rishi Kumar
 
Apr8 10 2014_public_mtg_draft3
Apr8 10 2014_public_mtg_draft3Apr8 10 2014_public_mtg_draft3
Apr8 10 2014_public_mtg_draft3
cotabus
 
Presentation To Shaw Group Final Nov 6
Presentation To Shaw Group Final Nov 6Presentation To Shaw Group Final Nov 6
Presentation To Shaw Group Final Nov 6
André Darmanin
 
Full cost reliability by Juan Carlos Muñoz
Full cost reliability by Juan Carlos MuñozFull cost reliability by Juan Carlos Muñoz
Full cost reliability by Juan Carlos Muñoz
BRTCoE
 
COTA's September 2013 Service Changes
COTA's September 2013 Service Changes COTA's September 2013 Service Changes
COTA's September 2013 Service Changes
COTA BUS
 
2016 0505 st3_draftplanpi_report
2016 0505 st3_draftplanpi_report2016 0505 st3_draftplanpi_report
2016 0505 st3_draftplanpi_report
Zach Shaner
 
Travel pattern in the near future
Travel pattern in the near futureTravel pattern in the near future
Travel pattern in the near future
Heemanish Midde
 
TRB-762_KJackson_SiteSelectionMethods-Notes
TRB-762_KJackson_SiteSelectionMethods-NotesTRB-762_KJackson_SiteSelectionMethods-Notes
TRB-762_KJackson_SiteSelectionMethods-Notes
Kristy Jackson
 
Together B.R._Transit Presentation 2011
Together B.R._Transit Presentation 2011Together B.R._Transit Presentation 2011
Together B.R._Transit Presentation 2011
Together Baton Rouge
 
Public Transportation in Winnipeg
Public Transportation in WinnipegPublic Transportation in Winnipeg
Public Transportation in Winnipeg
English Online Inc.
 
Summary Report 020913
Summary Report 020913Summary Report 020913
Summary Report 020913
Shaz Umer
 
Traffic Control Center Award
Traffic Control Center AwardTraffic Control Center Award
Traffic Control Center Award
City of College Station
 
Caltrain_Equity_522
Caltrain_Equity_522Caltrain_Equity_522
Caltrain_Equity_522
Adina Levin
 
Transit services in norfolk for dnc meeting
Transit services in norfolk for dnc meetingTransit services in norfolk for dnc meeting
Transit services in norfolk for dnc meeting
Russell Manning
 
640
640640
Dec 3 10_2013_public_mtg_projects_update final
Dec 3 10_2013_public_mtg_projects_update finalDec 3 10_2013_public_mtg_projects_update final
Dec 3 10_2013_public_mtg_projects_update final
cotabus
 
Kyle Field Game Day Transportation Plan
Kyle Field Game Day Transportation PlanKyle Field Game Day Transportation Plan
Kyle Field Game Day Transportation Plan
City of College Station
 

Similar to 2015-12-04 Overview & Requirements Gathering (20)

Innovation in London’s transport: Big Data for a better customer experience
Innovation in London’s transport: Big Data for a better customer experienceInnovation in London’s transport: Big Data for a better customer experience
Innovation in London’s transport: Big Data for a better customer experience
 
January service change final page spread
January service change final page spreadJanuary service change final page spread
January service change final page spread
 
The possibility to redesign 255 bus route in order reduce dead journey
The possibility to redesign 255 bus route in order reduce dead journeyThe possibility to redesign 255 bus route in order reduce dead journey
The possibility to redesign 255 bus route in order reduce dead journey
 
2017 07-05 Transportation Needs assessment
2017 07-05 Transportation Needs assessment2017 07-05 Transportation Needs assessment
2017 07-05 Transportation Needs assessment
 
Apr8 10 2014_public_mtg_draft3
Apr8 10 2014_public_mtg_draft3Apr8 10 2014_public_mtg_draft3
Apr8 10 2014_public_mtg_draft3
 
Presentation To Shaw Group Final Nov 6
Presentation To Shaw Group Final Nov 6Presentation To Shaw Group Final Nov 6
Presentation To Shaw Group Final Nov 6
 
Full cost reliability by Juan Carlos Muñoz
Full cost reliability by Juan Carlos MuñozFull cost reliability by Juan Carlos Muñoz
Full cost reliability by Juan Carlos Muñoz
 
COTA's September 2013 Service Changes
COTA's September 2013 Service Changes COTA's September 2013 Service Changes
COTA's September 2013 Service Changes
 
2016 0505 st3_draftplanpi_report
2016 0505 st3_draftplanpi_report2016 0505 st3_draftplanpi_report
2016 0505 st3_draftplanpi_report
 
Travel pattern in the near future
Travel pattern in the near futureTravel pattern in the near future
Travel pattern in the near future
 
TRB-762_KJackson_SiteSelectionMethods-Notes
TRB-762_KJackson_SiteSelectionMethods-NotesTRB-762_KJackson_SiteSelectionMethods-Notes
TRB-762_KJackson_SiteSelectionMethods-Notes
 
Together B.R._Transit Presentation 2011
Together B.R._Transit Presentation 2011Together B.R._Transit Presentation 2011
Together B.R._Transit Presentation 2011
 
Public Transportation in Winnipeg
Public Transportation in WinnipegPublic Transportation in Winnipeg
Public Transportation in Winnipeg
 
Summary Report 020913
Summary Report 020913Summary Report 020913
Summary Report 020913
 
Traffic Control Center Award
Traffic Control Center AwardTraffic Control Center Award
Traffic Control Center Award
 
Caltrain_Equity_522
Caltrain_Equity_522Caltrain_Equity_522
Caltrain_Equity_522
 
Transit services in norfolk for dnc meeting
Transit services in norfolk for dnc meetingTransit services in norfolk for dnc meeting
Transit services in norfolk for dnc meeting
 
640
640640
640
 
Dec 3 10_2013_public_mtg_projects_update final
Dec 3 10_2013_public_mtg_projects_update finalDec 3 10_2013_public_mtg_projects_update final
Dec 3 10_2013_public_mtg_projects_update final
 
Kyle Field Game Day Transportation Plan
Kyle Field Game Day Transportation PlanKyle Field Game Day Transportation Plan
Kyle Field Game Day Transportation Plan
 

2015-12-04 Overview & Requirements Gathering

  • 1. MBTA Route Performance Analysis Rider ExperienceSchedule Analysis Vehicle Optimization
  • 2. Agenda 1. Problem Statement: Transportation Issues 2. Data, What it can/cannot tell us 3. Platform Overview 4. Early Observations 5. Early Insights 6. Early Recommendations to MBTA
  • 3. 1. Problem Statement: Transportation Issues • Waits can be long • Buses not always showing up on schedule • Buses sometimes stacked one-after-another which is wasteful • How much, how often, and why?
  • 4. 2. Data, What it can tell us "route_id": "71", "mode_name": "Bus", "direction_name": "Inbound", "trip_id": "28346992", "trip_name": "9:30 pm from Watertown Sq …", "vehicle_id": "y4128", "vehicle_lat": "42.3747940063477", "vehicle_lon": "-71.1326065063477", "vehicle_bearing": "85", "vehicle_timestamp": "1447988172" 11/19/2015 9:56:12 PM Every 60 seconds, each bus reports its position These dots represent the 20+ pings of a typical trip of the Route 71 bus
  • 5. 2. Data, What it can tell us route_id: 71 INDEX: 42 trip_id: 28346775 stop_id: stop_sequence: vehicle_id: y4121 vehicle_lat: 42.3740653991699 vehicle_lon: -71.1249313354492 vehicle_timestamp: 1444948220 route_id: 71 INDEX: 41 trip_id: 28346775 stop_id: 2073 stop_sequence: 21 vehicle_id: y4121 vehicle_lat: 42.373956 vehicle_lon: -71.124752 vehicle_timestamp: 1444948214 route_id: 71 INDEX: 43 trip_id: 28346775 stop_id: 2074 stop_sequence: 22 vehicle_id: y4121 vehicle_lat: 42.373356 vehicle_lon: -71.123226 vehicle_timestamp: 1444948311 By portioning time according to distance, we can estimate when the bus was at the stop.
  • 6. 2. Data, What it can tell us StopsStopsStopsStopsStopsBus DataBus DataBus DataBus Data
  • 7. 2. Data, What it cannot tell us We don’t know how many people are on the bus: - Can’t tell whether riders are getting left at the curb (full bus) - Can’t tell the busy stops from the light stops - Can’t understand utilization by segment
  • 8. 3. Platform Overview Web-based “SaaS” solution - Add/Del Tracked Routes - Summary Scorecard - Detailed analysis - Schedule Variance - Schedule Viability - Wait time Distribution - Vehicle Deployment Policy
  • 9. 3. Platform Overview Scorecard, comprised of metrics, so that • Best performing routes can be learned from, • Worst performing routes can be addressed
  • 10. Heat map to show Schedule Variance Distribution to show Wait Times Maps to show Bus Locations Logs to validate charts
  • 11. DONE 3. Platform Overview http://realtime.mbta.com/portal current_location raw_location Fetcher() trip_actuals raw_to_trip() trip_variance compute_trip_variance()
  • 12. DONE 3. Platform Overview http://realtime.mbta.com/portal current_location raw_location Fetcher() trip_actuals raw_to_trip() trip_variance compute_trip_variance()
  • 13. GTIF Schedules 4. Early Observations 5:13 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 5:33 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 5:53 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 6:06 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 6:14 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 6:20 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 6:27 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 6:35 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 6:42 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 6:49 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 6:56 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 7:11 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 7:20 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 7:29 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 7:38 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 7:45 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 7:52 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 7:59 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 8:10 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 8:23 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 8:33 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 8:43 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 8:48 am from Watertown Sq Terminal to Harvard Upper Busway @ Red Line 8:54 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 8:58 am from Watertown Sq Terminal to Harvard Upper Busway @ Red Line 9:09 am from Watertown Sq Terminal to Harvard Upper Busway @ Red Line 9:15 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 9:24 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 9:34 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 9:44 am from Watertown Sq Terminal to Harvard Upper Busway @ Red Line 9:55 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 10:10 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 10:25 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 10:40 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 10:55 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 11:10 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave 11:25 am from Watertown Sq Terminal to Waterhouse St @ Massachusetts Ave Printed Schedule Published GTIF Schedule (Database)  Printed schedule not the same as the operations schedule.
  • 14. 4. Early Observations Frequency 0 earlier 0 15 to 20 min early 1 10 to 15 min early 4 5 to 10 min early 165 2 to 5 min early 1202 +/- 2 min 699 2 to 5 min late 617 5 to 10 min late 397 10 to 15 min late 354 15 to 20 min late 374 later 0 Early Late According to Schedule … Route 71 is frequently early or late, too-often it is very late.
  • 15. 4. Early Observations Route 71 wait time (time between buses) can be significant. Frequency no wait 4 0-5 mins 1099 5-10 mins 809 10-15 mins 761 15-20 mins 466 20-25 mins 313 25-30 mins 169 30-35 mins 53 35-40 mins 51 40-45 mins 27 45-50 mins 5 50 mins or more 8 Min 0:00:00 Max 1:56:45 AVG 0:11:24 Median 0:09:50 29% > 15 min wait
  • 16. 4. Early ObservationsMorningRushEveningRush 10/19/2015 5:37 10/19/2015 5:57 10/19/2015 6:05 10/19/2015 6:11 10/19/2015 6:17 10/19/2015 6:31 10/19/2015 6:48 10/19/2015 6:41 10/19/2015 6:55 10/19/2015 7:02 10/19/2015 7:09 10/19/2015 7:16 10/19/2015 7:24 10/19/2015 7:30 <10 min 10/19/2015 7:42 10/19/2015 7:56 10/19/2015 8:12 10/19/2015 8:04 10/19/2015 8:26 10/19/2015 8:19 10/19/2015 8:33 10/19/2015 8:43 10/19/2015 8:48 10/19/2015 8:58 10/19/2015 9:18 10/19/2015 9:08 10/19/2015 9:30 10/19/2015 9:42 10/19/2015 9:55 <15 min 10/19/2015 10:25 10/19/2015 10:40 10/19/2015 11:10 10/19/2015 11:25 10/19/2015 11:40 10/19/2015 11:55 10/19/2015 12:08 10/19/2015 12:23 10/19/2015 12:38 10/19/2015 12:53 10/19/2015 13:08 10/19/2015 13:23 10/19/2015 13:38 10/19/2015 13:53 10/19/2015 14:06 10/19/2015 14:14 10/19/2015 14:24 10/19/2015 14:43 10/19/2015 14:51 10/19/2015 14:59 10/19/2015 15:10 10/19/2015 15:17 10/19/2015 15:28 10/19/2015 15:37 <10 min 10/19/2015 15:55 10/19/2015 16:04 10/19/2015 16:12 10/19/2015 16:33 10/19/2015 16:24 10/19/2015 16:43 10/19/2015 17:00 10/19/2015 17:10 10/19/2015 17:37 10/19/2015 17:19 10/19/2015 17:46 10/19/2015 18:04 10/19/2015 17:55 10/19/2015 18:13 10/19/2015 18:22 10/19/2015 18:40 10/19/2015 18:31 10/19/2015 18:49 10/19/2015 19:00 10/19/2015 19:22 10/19/2015 19:36 10/19/2015 19:50 Worst waits are at peak periods!
  • 17. 4. Early Observations TRIP TIMES min 0:15:37 max 2:50:06 avg 0:27:12 median 0:25:19 End-to-end trip times vary widely, but averages are not sensitive to changing conditions of the day … we need to observe more closely Outbound, Total Time Inbound, Total Time
  • 18. 4. Early Insights End-to-end trip times vary widely, depending on time of day. Schedules could be aligned more closely to the reality of traffic on the route throughout the day.
  • 19. 4. Early Observations End-to-end trip times vary widely, depending on time of day. Schedules could be aligned more closely to the reality of traffic on the route throughout the day.
  • 20. 4. Early Observations A second look at the day reveals that schedule is overly aggressive and can’t be met. Either schedule should be reduced to meet equipment or equipment should be increased to meet schedule. > Late more often than Early Not enough equipment to meet Schedule
  • 21. 4. Summary of Early Observations • Insufficient buses to meet 71 schedule – Fall behind during peek periods – Almost catch up during lull periods – Missed Trips, 17 of 630 on 10/19/15 = 2.7% • Propensity for Buses to cluster – Un-uniform Wait Times • Very short waits or very long waits • Route 71 Schedule is a fallacy – Printed schedule inconsistent with GTIF – Buses are simply looping as fast as they can – Number of vehicles governs availability
  • 22. 5. Early Insights Optimizing for equipment utilization, looping as fast as possible, results in vehicle clustering, short waits followed by long waits (time between buses). Thesis: Equilibrium Uniform Wait Time Uniform Occupancy Uniform Velocity
  • 23. 5. Early Insights Optimizing for equipment utilization, looping as fast as possible, results in vehicle clustering, short waits followed by long waits (time between buses). Thesis: Out of Equilibrium Short Wait Time Lower Bus Occupancy Bus speeds up with fewer riders
  • 24. 5. Early Insights Optimizing for equipment utilization, looping as fast as possible, results in vehicle clustering, short waits followed by long waits (time between buses). Thesis: Out of Equilibrium Longer Wait Time Higher Bus Occupancy Bus Slows down with increased riders
  • 25. 5. Early Insights Optimizing for equipment utilization, looping as fast as possible, results in vehicle clustering, short waits followed by long waits (time between buses). Thesis: Out of Equilibrium Un-Uniform Wait Times irritating riders Un-Uniform Occupancy reducing utilization Speed impacted, exacerbates clustering
  • 26. 6. Early Recommendations to MBTA • Align schedules, printed and GTIF/internal • Set realistic schedules based on analysis • Space vehicles more uniformly to reduce maximum rider wait times – Requires thought to balance schedule with rider experience – should be “tuned” through A/B testing • Send location update when bus at stop and door open – Improve accuracy of future analyses • Add Ridership data to feed – # people (or weight of vehicle for approximation) – Anonymized load/unload so individual rider trips can be analyzed
  • 27. 6. Early Recommendations to MBTA • Can’t Accelerate Slow Bus – Road Speed can’t be influenced – Skipping stops will anger waiting riders • Only options is to decelerate Fast Bus – Bus waits at stops to maintain spacing Maintaining equilibrium
  • 28. 6. Early Recommendations to MBTA “Uber-fication” of buses – a connected app managing network of vehicles Wait at Stop Go when able
  • 29. 6. Early Recommendations to MBTA • Slowing bus – Will reduce number of feasible trips and scheduled time between buses – Will improve uniformity of wait time – Likely to improve utilization rate (fewer over- crowded buses, few people left at curb) – Likely to improve overall rider satisfaction

Editor's Notes

  1. Here is the raw data interleaved with computed stop data. This view was not only useful for verifying the computed stop data … but also for trouble-shooting anomalies.
  2. We know the MBTA has ridership data because it would be a natural extention to the Charlie Card system. We can also assume that the bus has weight sensors to protect against dangerous overloading – weight could be used to estimate the occupancy of the bus. - This technique is commonly used in relationship to elevators.
  3. My primary objective was to get back into programming. Therefore, developing a web application to collect, transform and present the data was an obvious choice.
  4. The platform would be a “dashboard” where MBTA performance could be understood at a glance. Performance scores would be comprised of metrics such as Schedule Variance, Wait Times, and other possible factors that I would observe once I got into the data.
  5. Here is a closer look. For each metric, I would present the data with a sensible visualization technique, heatmaps for schedule variance, gousian distributions for Wait times and geographical representation to convey location.
  6. After three months of ramping up, testing different programming approaches and getting settled with the coding/deployment environments, I was able to complete the data collection, management, and transformation layers. I now have a system that collects the data from the MBTA systems and prepares it for analysis. But, before I was to begin coding the visualizations, I really needed to get to know more about the data.
  7. There are few tools better than Excel for kicking data around. I spent about a week studying one route – the 71 bus that runs from Harvard Sq to Watertown Yard. What I learned would change my direction…
  8. First and foremost, I observed that the printed schedule are not the same as operational schedules. Perhaps the MBTA feels that no one reads the printed schedules or that the schedules are similar enough that – what the heck … However, riders look a schedules when vehicles are more than 10 minutes apart. Schedules matter.
  9. This chart plots incidents of timeliness Said another way, the X axis shows whether the bus was ON TIME, EARLY or LATE The Y axis is how often. Keep in mind that EARLY is NOT GOOD – especially if you time your arrival at the bus stop according to schedule (assume the schedule is correct). This chart shows that there are as many early buses as late buses – BAD. But there’s another way to look at this … rather than measure early or late according to schedule, let’s just look at the time between buses – how long are riders waiting at the stop?
  10. This chart shows wait times. At first it is surprising to see so many short wait times (0-5 minutes) – wow, that’s great! Then, you realize that when buses get close together, they widen the gap on the other side. For every short wait, there is a compensatory long wait. Routes are closed systems with a fixed number of vehicles – ideally, they should be spaced uniformly. But this is not the whole picture. These averages don’t tell us what’s going on at peak usage periods.
  11. These charts show the same data bucketed by peak and off-peak periods. The worst periods, as one would expect are the peak periods when riders depend most on the service. Simply stated, the buses fall behind schedule during peak periods then catch up through the off-peak periods Averaging wait times on a daily basis nets the early with the late and doesn’t looks so bad. Bucketing shows how bad it really is. … but WHY IS IT SO BAD?
  12. To get a better sense of why it’s bad, let’s look at the feasability of the trips. How long does it REALLY take for the bus to run the trips? To answer this question we subtract the start time from the finish time – Here it is presented as a distribution across the day. We see that the longest trip times are 3X the shortest – traffic conditions and rider load/unload times really impact overall trip time. But, like the wait time analysis, we would expect this to vary by time of day – let’s take a closer look…
  13. Here are the outbound trips of a single day – statistically, not a large data set but perhaps telling. The orange bars are the actual trip times, the blue line is the scheduled trips. We expect variance, but it seems that the morning commute schedule is unrealistic – the bus never meets the expect trip time and falls behind. Next, you can see that the bus beats the scheduled trip time through the off-peak period – the bus is catching up to schedule. The evening commute does not look all that bad.
  14. Here is the Inbound side. Whoa! Why are there so many more Inbound Trips than Outbound Trips?!?!? We expect the number of Inbound trips to be equal to the number of outbound trips. Simply put, I threw out more Outbound trip data because of anomalies – it deserves further investigation, but I think I know why – the 71 bus starts in the tunnel and it takes time for the GPS to get accurate once the bus comes out of the tunnel. As a result, I don’t get location data or the data is deemed inaccurate at the start of the trip. I throw it out because I can’t estimate time accurately across multiple stops. Anyway … Here we see the morning schedule is no better than the Inbound side – as expected. Strangely, however, we see something weird happen in the evening. Why would trip times oscillate as they do? This deserves further investigation.
  15. Area above the curve is greater than the area under the curve. Buses are more often late than early - the 2.7% we observed.
  16. … The number of missed trips, 2.7%, does not look that bad – however, it’s not the whole picture as the data will show. Bottom line, with initial insights like these, it is difficult to justify building out the system further. The MBTA must be aware and yet there is no evidence of efforts to improve. Said another way, the MBTA should clean up the big items before they consider using a system to refine the little ones. Let’s take a closer look …