SlideShare a Scribd company logo
Predicting Post-Safetrack Metro Reliability
GU SCS Data Science Capstone Project September 10, 2016
Micah Melling
Drew Wheatley
Patrick McGrady
over 250m riders annually
118 miles of track
Facts
over 13 disruptions per day
Problem Statement
Problem Statement
highly publicized safety lapses
& deferred maintenance
1 Year timeframe
estimated $60,000,000 price tag
improved safety & reliability?
Hypothesis
The DC Metro System is a pivotal transportation asset for Washington DC and the surrounding
regions. The SafeTrack project is meant to increase system safety and reliability. While technical
and operational disruptions are inevitable, we believe that available data can provide insight into
how frequently Metro riders will experience post-SafeTrack disruptions and ultimately improve their
Metro commute expectations.
Scenario #1
Improvement
Scenario #2
Improvement
Scenario #3
Improvement
To quantify the outcome, we will explore several scenarios to provide riders with a
clearer picture of their post-safetrack commute.
Scenario #4
Improvement
Scenario #5
Improvement
Data Ingestion & Wrangling
System Operations Data: used to determine system
behavior under optimal conditions
Disruption Data: historical data used to analyze the frequency
and effect of technical and operational
disruptions (ie: delays)
Ridership Data: in conjunction with operational datasets,
ridership data used to quantify and extrapolate
the scope of Metro delays.
The Data
ON TIME
ON TIME
ON TIME DELAYED
DELAYED
DELAYED
Planned Operating
Schedule
Disruption Data
Data_Source: wmata.com
Data_Scope:
Provided operating data
under a perfectly
efficient system with no
delays or disruptions
Data_Scope:
Provided 5 years of daily
disruption logs,
including; cause of
disruption and minutes
delayed
Data_Source: opendatadc
Planned
Operating
Schedule and
Disruption Data
provided
a basis for
comparing pre
and post-
safetrack system
behavior
LN CAR DEST MINLN CAR DEST MIN
RD 6
RD 6
RD 6
RD 6
RD 6
RD 6
The Data
24,335
records
between April
2012 - July
2016
All Metro lines
represented in
the dataset
Description of
disruption
cause.
Translated as
technical or
operational
Delay, in
minutes
Computation & Analysis: Limitations
AccuracyLocation
Station - To -
Station
‘Garbage in -
Garbage out’
concept
Opted to take a two-pronged approach:
1.) Build data product
2.) Develop simulation based on available data
Completeness
Compounding
Delays
Computation & Analysis: Methodology
1
Calculated the number of minutes of trips per day on
each line.
Broke daily delays into five tiers based on severity.
Scenario:1 Scenario:2 Scenario:3 Scenario:5
Tier 2
Tier 3
Tier 4
Tier 5
Tier 1
Tier 2
Tier 3
Tier 4
Tier 5
Tier 1
Tier 2
Tier 3
Tier 4
Tier 5
Tier 1
Tier 2
Tier 3
Tier 4
Tier 5
Tier 1
Tier 2
Tier 3
Tier 4
Tier 1
Scenario:4
Built in compounding delays based on expected train
departures.
Injected random noise into the system.
2
3
4
Results of Simulated Scenario
A Look Under The Hood
[software system demo]
Results
Created visualizations of
the various simulations
Analyzed results to
determine the shape of the
data
Results
Current 9861.402 102.51522
Scenario #1 9868.713 97.10936
Scenario #2 9854.400 108.57028
Scenario #3 9852.256 102.1384
Scenario #4 9850.429 101.7149
Scenario #5 9848.057 104.1241
Current 8121.386 95.954
Scenario #1 8117.496 97.341
Scenario #2 8115.761 99.953
Scenario #3 8114.653 104.407
Scenario #4 8104.47 99.702
Scenario #5 8093.36 98.429
Current 5280.572 100.5566
Scenario #1 5261.651 92.5748
Scenario #2 5262.043 114.093
Scenario #3 5020.293 41.431
Scenario #4 5014.868 41.251
Scenario #5 5013.92 40.980
Current 6762.053 97.839
Scenario #1 6765.053 97.839
Scenario #2 6759.09 103.266
Scenario #3 6562.22 52.973
Scenario #4 6552.85 53.316
Scenario #5 6540.79 48.947
Current 6811.311 108.8495
Scenario #1 6815.311 108.8495
Scenario #2 6816.787 105.2023
Scenario #3 6809.531 108.5713
Scenario #4 6810.966 97.1970
Scenario #5 6809.322 98.0109
Current 11149.5 97.3886
Scenario #1 11159.6 98.4512
Scenario #2 11146.33 99.5911
Scenario #3 11138.77 112.8393
Scenario #4 11132.07 97.0613
Scenario #5 11123.83 101.226
Conclusions
Scenario #1 Scenario #2 Scenario #3 Scenario #4 Scenario #5
Noticeable improvements in time and probability of delay was not realized until
higher scenario parameters were introduced.
Analysis of the results indicates that SafeTrack repairs
must reduce disruption severity and probability by
roughly 30% - 50% for Metro riders to experience
improved trip safety and reliability.
Conclusions
Improvements in
Stochastic System
Biases &
Assumptions
Data Quality
Springboard for
Future Work
SafeTrack’s improvements
may not be noticed if they
do not overcome the
system’s random noise
Recognizing biases and
stating assumptions is
key to data science
The importance of
accurate data cannot be
overstated
Our software can be
generalized and adapted
Questions?
?

More Related Content

Viewers also liked

Ta3alo ta3alo ya ta3aba
Ta3alo ta3alo ya ta3abaTa3alo ta3alo ya ta3aba
Ta3alo ta3alo ya ta3abaAt Minacenter
 
Valeska Mello Ama Nx
Valeska Mello Ama NxValeska Mello Ama Nx
Valeska Mello Ama Nxguestab4225
 
Presentación
Presentación Presentación
Presentación
Wilmer Cutacan
 
αναστασης
αναστασηςαναστασης
αναστασης
3dimhol
 
Liderazgo femenino
Liderazgo femenino Liderazgo femenino
Liderazgo femenino
Voces Vitales Argentina
 
Escuela de liderazgo
Escuela de liderazgoEscuela de liderazgo
Escuela de liderazgo
Bruno Moioli Montenegro
 
My daily routine
My daily routineMy daily routine
My daily routine
yiliyo
 
Arch Ahmed Ashraf Design,Fit-Out & Marble Works Samples
Arch Ahmed Ashraf Design,Fit-Out & Marble Works SamplesArch Ahmed Ashraf Design,Fit-Out & Marble Works Samples
Arch Ahmed Ashraf Design,Fit-Out & Marble Works Samplesahmed ashraf
 
Geopolítica da ásia central
Geopolítica da ásia centralGeopolítica da ásia central
Geopolítica da ásia central
je1981
 
Aνθρώπινα Δικαιώματα
Aνθρώπινα ΔικαιώματαAνθρώπινα Δικαιώματα
Aνθρώπινα Δικαιώματα
xrysa123
 
Presentación taller visualizando lo mejor de ti.
Presentación taller visualizando lo mejor de ti.Presentación taller visualizando lo mejor de ti.
Presentación taller visualizando lo mejor de ti.
Computer Learning Centers
 

Viewers also liked (14)

Ta3alo ta3alo ya ta3aba
Ta3alo ta3alo ya ta3abaTa3alo ta3alo ya ta3aba
Ta3alo ta3alo ya ta3aba
 
Valeska Mello Ama Nx
Valeska Mello Ama NxValeska Mello Ama Nx
Valeska Mello Ama Nx
 
Presentación
Presentación Presentación
Presentación
 
αναστασης
αναστασηςαναστασης
αναστασης
 
Liderazgo femenino
Liderazgo femenino Liderazgo femenino
Liderazgo femenino
 
Escuela de liderazgo
Escuela de liderazgoEscuela de liderazgo
Escuela de liderazgo
 
My daily routine
My daily routineMy daily routine
My daily routine
 
Arch Ahmed Ashraf Design,Fit-Out & Marble Works Samples
Arch Ahmed Ashraf Design,Fit-Out & Marble Works SamplesArch Ahmed Ashraf Design,Fit-Out & Marble Works Samples
Arch Ahmed Ashraf Design,Fit-Out & Marble Works Samples
 
Meen ra7 yefselny
Meen ra7 yefselnyMeen ra7 yefselny
Meen ra7 yefselny
 
Rab el 7asad
Rab el 7asadRab el 7asad
Rab el 7asad
 
Geopolítica da ásia central
Geopolítica da ásia centralGeopolítica da ásia central
Geopolítica da ásia central
 
Aνθρώπινα Δικαιώματα
Aνθρώπινα ΔικαιώματαAνθρώπινα Δικαιώματα
Aνθρώπινα Δικαιώματα
 
Presentación taller visualizando lo mejor de ti.
Presentación taller visualizando lo mejor de ti.Presentación taller visualizando lo mejor de ti.
Presentación taller visualizando lo mejor de ti.
 
Vulva
VulvaVulva
Vulva
 

Similar to Gu scs team2 a_metro project_pdf

[Tutorial] building machine learning models for predictive maintenance applic...
[Tutorial] building machine learning models for predictive maintenance applic...[Tutorial] building machine learning models for predictive maintenance applic...
[Tutorial] building machine learning models for predictive maintenance applic...
PAPIs.io
 
Urban flood prediction digital ocean august edition
Urban flood prediction   digital ocean august editionUrban flood prediction   digital ocean august edition
Urban flood prediction digital ocean august edition
transight
 
Data-Driven Security Assessment of Power Grids Based on Machine Learning Appr...
Data-Driven Security Assessment of Power Grids Based on Machine Learning Appr...Data-Driven Security Assessment of Power Grids Based on Machine Learning Appr...
Data-Driven Security Assessment of Power Grids Based on Machine Learning Appr...
Power System Operation
 
Data-Driven Security Assessment of Power Grids Based on Machine Learning Appr...
Data-Driven Security Assessment of Power Grids Based on Machine Learning Appr...Data-Driven Security Assessment of Power Grids Based on Machine Learning Appr...
Data-Driven Security Assessment of Power Grids Based on Machine Learning Appr...
Power System Operation
 
IRJET- Online Failure Prediction for Railway Transportation System based ...
IRJET-  	  Online Failure Prediction for Railway Transportation System based ...IRJET-  	  Online Failure Prediction for Railway Transportation System based ...
IRJET- Online Failure Prediction for Railway Transportation System based ...
IRJET Journal
 
IRJET - Steering Wheel Angle Prediction for Self-Driving Cars
IRJET - Steering Wheel Angle Prediction for Self-Driving CarsIRJET - Steering Wheel Angle Prediction for Self-Driving Cars
IRJET - Steering Wheel Angle Prediction for Self-Driving Cars
IRJET Journal
 
Real-Time Automated Overspeeding Detection and Identification System
Real-Time Automated Overspeeding Detection and Identification SystemReal-Time Automated Overspeeding Detection and Identification System
Real-Time Automated Overspeeding Detection and Identification System
IRJET Journal
 
Predicting Post-SafeTrack Metro Reliability
Predicting Post-SafeTrack Metro ReliabilityPredicting Post-SafeTrack Metro Reliability
Predicting Post-SafeTrack Metro ReliabilityMicah Melling
 
Study of Reliability Analysis to the Iraqi South Region Network
Study of Reliability Analysis to the Iraqi South Region NetworkStudy of Reliability Analysis to the Iraqi South Region Network
Study of Reliability Analysis to the Iraqi South Region Network
IRJET Journal
 
Data mining & predictive analytics for US Airlines' performance
Data mining & predictive analytics for US Airlines' performanceData mining & predictive analytics for US Airlines' performance
Data mining & predictive analytics for US Airlines' performanceAkiso Yadav
 
Reliability analysis of wireless automotive applications with transceiver red...
Reliability analysis of wireless automotive applications with transceiver red...Reliability analysis of wireless automotive applications with transceiver red...
Reliability analysis of wireless automotive applications with transceiver red...
rchulyada
 
Thesis_Final_Afnan_27072016_EngD (1)
Thesis_Final_Afnan_27072016_EngD (1)Thesis_Final_Afnan_27072016_EngD (1)
Thesis_Final_Afnan_27072016_EngD (1)Dr. Afnan Ullah Khan
 
Machine Learning Impact on IoT - Part 2
Machine Learning Impact on IoT - Part 2Machine Learning Impact on IoT - Part 2
Machine Learning Impact on IoT - Part 2
Value Amplify Consulting
 
Distributed Traffic management framework
Distributed Traffic management frameworkDistributed Traffic management framework
Distributed Traffic management framework
Saurabh Nambiar
 
Safety Helmet Detection in Engineering and Management
Safety Helmet Detection in Engineering and ManagementSafety Helmet Detection in Engineering and Management
Safety Helmet Detection in Engineering and Management
IRJET Journal
 
IRJET- Accident Detection and Vehicle Safety using Zigbee
IRJET-  	  Accident Detection and Vehicle Safety using ZigbeeIRJET-  	  Accident Detection and Vehicle Safety using Zigbee
IRJET- Accident Detection and Vehicle Safety using Zigbee
IRJET Journal
 
Machine Learning Model for M.S admissions
Machine Learning Model for M.S admissionsMachine Learning Model for M.S admissions
Machine Learning Model for M.S admissions
Omkar Rane
 
Surface profile detection for laser beam auto focusing
Surface profile detection for laser beam auto focusingSurface profile detection for laser beam auto focusing
Surface profile detection for laser beam auto focusing
Dana Lee Church
 

Similar to Gu scs team2 a_metro project_pdf (20)

[Tutorial] building machine learning models for predictive maintenance applic...
[Tutorial] building machine learning models for predictive maintenance applic...[Tutorial] building machine learning models for predictive maintenance applic...
[Tutorial] building machine learning models for predictive maintenance applic...
 
Urban flood prediction digital ocean august edition
Urban flood prediction   digital ocean august editionUrban flood prediction   digital ocean august edition
Urban flood prediction digital ocean august edition
 
Data-Driven Security Assessment of Power Grids Based on Machine Learning Appr...
Data-Driven Security Assessment of Power Grids Based on Machine Learning Appr...Data-Driven Security Assessment of Power Grids Based on Machine Learning Appr...
Data-Driven Security Assessment of Power Grids Based on Machine Learning Appr...
 
Data-Driven Security Assessment of Power Grids Based on Machine Learning Appr...
Data-Driven Security Assessment of Power Grids Based on Machine Learning Appr...Data-Driven Security Assessment of Power Grids Based on Machine Learning Appr...
Data-Driven Security Assessment of Power Grids Based on Machine Learning Appr...
 
IRJET- Online Failure Prediction for Railway Transportation System based ...
IRJET-  	  Online Failure Prediction for Railway Transportation System based ...IRJET-  	  Online Failure Prediction for Railway Transportation System based ...
IRJET- Online Failure Prediction for Railway Transportation System based ...
 
IRJET - Steering Wheel Angle Prediction for Self-Driving Cars
IRJET - Steering Wheel Angle Prediction for Self-Driving CarsIRJET - Steering Wheel Angle Prediction for Self-Driving Cars
IRJET - Steering Wheel Angle Prediction for Self-Driving Cars
 
Real-Time Automated Overspeeding Detection and Identification System
Real-Time Automated Overspeeding Detection and Identification SystemReal-Time Automated Overspeeding Detection and Identification System
Real-Time Automated Overspeeding Detection and Identification System
 
Predicting Post-SafeTrack Metro Reliability
Predicting Post-SafeTrack Metro ReliabilityPredicting Post-SafeTrack Metro Reliability
Predicting Post-SafeTrack Metro Reliability
 
Study of Reliability Analysis to the Iraqi South Region Network
Study of Reliability Analysis to the Iraqi South Region NetworkStudy of Reliability Analysis to the Iraqi South Region Network
Study of Reliability Analysis to the Iraqi South Region Network
 
Data mining & predictive analytics for US Airlines' performance
Data mining & predictive analytics for US Airlines' performanceData mining & predictive analytics for US Airlines' performance
Data mining & predictive analytics for US Airlines' performance
 
Reliability analysis of wireless automotive applications with transceiver red...
Reliability analysis of wireless automotive applications with transceiver red...Reliability analysis of wireless automotive applications with transceiver red...
Reliability analysis of wireless automotive applications with transceiver red...
 
Thesis_Final_Afnan_27072016_EngD (1)
Thesis_Final_Afnan_27072016_EngD (1)Thesis_Final_Afnan_27072016_EngD (1)
Thesis_Final_Afnan_27072016_EngD (1)
 
finalReport
finalReportfinalReport
finalReport
 
Machine Learning Impact on IoT - Part 2
Machine Learning Impact on IoT - Part 2Machine Learning Impact on IoT - Part 2
Machine Learning Impact on IoT - Part 2
 
Report
ReportReport
Report
 
Distributed Traffic management framework
Distributed Traffic management frameworkDistributed Traffic management framework
Distributed Traffic management framework
 
Safety Helmet Detection in Engineering and Management
Safety Helmet Detection in Engineering and ManagementSafety Helmet Detection in Engineering and Management
Safety Helmet Detection in Engineering and Management
 
IRJET- Accident Detection and Vehicle Safety using Zigbee
IRJET-  	  Accident Detection and Vehicle Safety using ZigbeeIRJET-  	  Accident Detection and Vehicle Safety using Zigbee
IRJET- Accident Detection and Vehicle Safety using Zigbee
 
Machine Learning Model for M.S admissions
Machine Learning Model for M.S admissionsMachine Learning Model for M.S admissions
Machine Learning Model for M.S admissions
 
Surface profile detection for laser beam auto focusing
Surface profile detection for laser beam auto focusingSurface profile detection for laser beam auto focusing
Surface profile detection for laser beam auto focusing
 

Recently uploaded

一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
oz8q3jxlp
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
rwarrenll
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
Roger Valdez
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
pchutichetpong
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
mbawufebxi
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
ahzuo
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
ewymefz
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTESAdjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Subhajit Sahu
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 

Recently uploaded (20)

一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTESAdjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 

Gu scs team2 a_metro project_pdf

  • 1. Predicting Post-Safetrack Metro Reliability GU SCS Data Science Capstone Project September 10, 2016 Micah Melling Drew Wheatley Patrick McGrady
  • 2. over 250m riders annually 118 miles of track Facts over 13 disruptions per day Problem Statement
  • 3. Problem Statement highly publicized safety lapses & deferred maintenance 1 Year timeframe estimated $60,000,000 price tag improved safety & reliability?
  • 4. Hypothesis The DC Metro System is a pivotal transportation asset for Washington DC and the surrounding regions. The SafeTrack project is meant to increase system safety and reliability. While technical and operational disruptions are inevitable, we believe that available data can provide insight into how frequently Metro riders will experience post-SafeTrack disruptions and ultimately improve their Metro commute expectations. Scenario #1 Improvement Scenario #2 Improvement Scenario #3 Improvement To quantify the outcome, we will explore several scenarios to provide riders with a clearer picture of their post-safetrack commute. Scenario #4 Improvement Scenario #5 Improvement
  • 5. Data Ingestion & Wrangling System Operations Data: used to determine system behavior under optimal conditions Disruption Data: historical data used to analyze the frequency and effect of technical and operational disruptions (ie: delays) Ridership Data: in conjunction with operational datasets, ridership data used to quantify and extrapolate the scope of Metro delays.
  • 6. The Data ON TIME ON TIME ON TIME DELAYED DELAYED DELAYED Planned Operating Schedule Disruption Data Data_Source: wmata.com Data_Scope: Provided operating data under a perfectly efficient system with no delays or disruptions Data_Scope: Provided 5 years of daily disruption logs, including; cause of disruption and minutes delayed Data_Source: opendatadc Planned Operating Schedule and Disruption Data provided a basis for comparing pre and post- safetrack system behavior LN CAR DEST MINLN CAR DEST MIN RD 6 RD 6 RD 6 RD 6 RD 6 RD 6
  • 7. The Data 24,335 records between April 2012 - July 2016 All Metro lines represented in the dataset Description of disruption cause. Translated as technical or operational Delay, in minutes
  • 8. Computation & Analysis: Limitations AccuracyLocation Station - To - Station ‘Garbage in - Garbage out’ concept Opted to take a two-pronged approach: 1.) Build data product 2.) Develop simulation based on available data Completeness Compounding Delays
  • 9. Computation & Analysis: Methodology 1 Calculated the number of minutes of trips per day on each line. Broke daily delays into five tiers based on severity. Scenario:1 Scenario:2 Scenario:3 Scenario:5 Tier 2 Tier 3 Tier 4 Tier 5 Tier 1 Tier 2 Tier 3 Tier 4 Tier 5 Tier 1 Tier 2 Tier 3 Tier 4 Tier 5 Tier 1 Tier 2 Tier 3 Tier 4 Tier 5 Tier 1 Tier 2 Tier 3 Tier 4 Tier 1 Scenario:4 Built in compounding delays based on expected train departures. Injected random noise into the system. 2 3 4
  • 11. A Look Under The Hood [software system demo]
  • 12. Results Created visualizations of the various simulations Analyzed results to determine the shape of the data
  • 13. Results Current 9861.402 102.51522 Scenario #1 9868.713 97.10936 Scenario #2 9854.400 108.57028 Scenario #3 9852.256 102.1384 Scenario #4 9850.429 101.7149 Scenario #5 9848.057 104.1241 Current 8121.386 95.954 Scenario #1 8117.496 97.341 Scenario #2 8115.761 99.953 Scenario #3 8114.653 104.407 Scenario #4 8104.47 99.702 Scenario #5 8093.36 98.429 Current 5280.572 100.5566 Scenario #1 5261.651 92.5748 Scenario #2 5262.043 114.093 Scenario #3 5020.293 41.431 Scenario #4 5014.868 41.251 Scenario #5 5013.92 40.980 Current 6762.053 97.839 Scenario #1 6765.053 97.839 Scenario #2 6759.09 103.266 Scenario #3 6562.22 52.973 Scenario #4 6552.85 53.316 Scenario #5 6540.79 48.947 Current 6811.311 108.8495 Scenario #1 6815.311 108.8495 Scenario #2 6816.787 105.2023 Scenario #3 6809.531 108.5713 Scenario #4 6810.966 97.1970 Scenario #5 6809.322 98.0109 Current 11149.5 97.3886 Scenario #1 11159.6 98.4512 Scenario #2 11146.33 99.5911 Scenario #3 11138.77 112.8393 Scenario #4 11132.07 97.0613 Scenario #5 11123.83 101.226
  • 14. Conclusions Scenario #1 Scenario #2 Scenario #3 Scenario #4 Scenario #5 Noticeable improvements in time and probability of delay was not realized until higher scenario parameters were introduced. Analysis of the results indicates that SafeTrack repairs must reduce disruption severity and probability by roughly 30% - 50% for Metro riders to experience improved trip safety and reliability.
  • 15. Conclusions Improvements in Stochastic System Biases & Assumptions Data Quality Springboard for Future Work SafeTrack’s improvements may not be noticed if they do not overcome the system’s random noise Recognizing biases and stating assumptions is key to data science The importance of accurate data cannot be overstated Our software can be generalized and adapted