SlideShare a Scribd company logo
Deep Reinforcement Learning based
Dynamic Optimization of Bus
Timetable
Ankit Sharma
Bus Timetable Optimization
 Bus timetable optimization is a key issue to reduce operational cost of bus
companies and improve the service quality.
 Heuristic algorithms work in offline and does not account for people flow change.
https://doi.org/10.48550/arXiv.2107.07066
Bus Timetable Optimization
 The optimization of the bus timetable aims to consider the interests of both
passengers and the bus company, and set the departure time of buses to meet
the demand of passenger flow.
 The main quantitative indicators are bus congestion and the waiting time of
passengers, while the interests of bus companies are mainly affected by the
number of departures (departure intervals) in the timetable.
https://doi.org/10.48550/arXiv.2107.07066
Deep Reinforcement Learning Formulation
 Actions – No Departure(0), Departure(1)
 States – Time(hr-𝑿𝟏𝒕,),Time(min-𝑿𝟐𝒕,),Load Rate(𝑿𝟑𝒕), Waiting Time(𝑿𝟒𝒕),
Carrying Capability(𝑿𝟓𝒕), Stranded Passengers(𝑿𝟔𝒕).
 Reward(from reference paper):
1 − (𝑿𝟓𝒕) − 𝛼 ∗ 𝑿𝟒𝒕 − 𝛽 ∗ 𝑿𝟔𝒕, (𝑎𝑐𝑡𝑖𝑜𝑛 = 0)
(𝑿𝟓𝒕) − 𝛽 ∗ 𝑿𝟔𝒕, (𝑎𝑐𝑡𝑖𝑜𝑛 = 1)
Alternate reward functions were also tested.
https://doi.org/10.48550/arXiv.2107.07066
, , ,
Bus Environment
 Bus timetable considered as episodic task (star-end schedule) with 6 states changing w.r.t
time.
𝑺𝒕 = [𝑿𝟏𝒕, 𝑿𝟐𝒕, 𝑿𝟑𝒕, 𝑿𝟒𝒕, 𝑿𝟓𝒕, 𝑿𝟔𝒕]
 𝑋1𝑡 − 𝑡ℎ/24, 𝑋2𝑡 −
𝑡𝑚
60
 𝑋3𝑡 −
𝑀𝑎𝑥 𝑃𝑎𝑠𝑠𝑒𝑛𝑔𝑒𝑟𝑠
𝑀𝑎𝑥 𝐵𝑢𝑠 𝐶𝑎𝑝𝑎𝑐𝑖𝑡𝑦
 𝑋4𝑡 − Normalized Waiting time of all passengers
 𝑋5𝑡 −
𝑁𝑒𝑒𝑑 𝑜𝑓 𝐶𝑎𝑟𝑟𝑦𝑖𝑛𝑔 𝐶𝑎𝑝𝑎𝑐𝑖𝑡𝑦
𝐶𝑎𝑟𝑟𝑦𝑖𝑛𝑔 𝑐𝑎𝑝𝑎𝑐𝑖𝑡𝑦 𝑜𝑓 𝑣𝑒ℎ𝑖𝑐𝑙𝑒
 𝑋6𝑡 − 𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑠𝑡𝑟𝑎𝑛𝑑𝑒𝑑 𝑝𝑎𝑠𝑠𝑒𝑛𝑔𝑒𝑟𝑠
Assumptions*:
• 𝑋6𝑡 was assumed to be exponential distribution after every departure.
• Based on number of stranded passengers awaiting bus or remainder post departure
are used for calculation of 𝑋4𝑡, 𝑋5𝑡
• 𝑋3𝑡 is calculated based on max bus capacity and no. of stranded passengers at the
time of departure
• Episode considered from 06:00-10:00Am with decision point at every 5 mins
* Real data was not available
http://www.muxingyun.com/en/digital-factory
DRL - Algorithms
Deep SARSA
S.No. Parameter Value
1. DNN – Hidden
Layers
2
2. DNN-Hidden
Units
188
3. Activation
Function
ReLU
4. Epsilon 0.2
5. Gamma .99
6. Experience
memory size
10000
7. Batch Size 32
8. Learning rate .001
DQN
S.No. Parameter Value
1. DNN – Hidden
Layers
2
2. DNN-Hidden
Units
188
3. Activation
Function
ReLU
4. Epsilon 0.2
5. Gamma .99
6. Experience
memory size
10000
7. Batch Size 32
8. Learning rate .001
Bus Timetable-Fixed Interval(30 mins)
TEST - 1
Testing with Deep SARSA & DQN
SARSA
Episodes 1500
Epsilon 0.2
DQN
Episodes 1500
Epsilon 0.2
Reward: Same as in paper.
Testing with Deep SARSA & DQN
Load Factor
Fixed Interval(30 mins): 8, SARSA Departures:23,DQN Departures:23
Normalized Waiting Time
Bus Timetable
DQN
Deep SARSA
Testing with Deep SARSA & DQN
Required Carrying Capacity
Fixed Interval(30 mins): 8, SARSA Departures:23,DQN Departures:23
Stranded Passenger
Bus Timetable
DQN
Deep SARSA
TEST - 2
Testing with Deep SARSA & DQN
SARSA
Episodes 1500
Epsilon 0.2
DQN
Episodes 1500
Epsilon 0.2
Reward: Modified to reduce number of departures and increase load rate to account
for bus agency. Condition added to have Load Rate > 0.7
Testing with Deep SARSA & DQN
Load Factor
Fixed Interval(30 mins): 8, SARSA Departures:15,DQN Departures:15
Normalized Waiting Time
Bus Timetable
DQN
Deep SARSA
Testing with Deep SARSA & DQN
Required Carrying Capacity
Fixed Interval(30 mins): 8, SARSA Departures:15,DQN Departures:15
Stranded Passenger
Bus Timetable
DQN
Deep SARSA
TEST - 3
Testing with Deep SARSA & DQN
SARSA
Episodes 1500
Epsilon 0.2
DQN
Episodes 1500
Epsilon 0.2
Reward: Modified to reduce number of departures and increase load rate more to
account for bus agency. Condition added to have Load Rate > 0.8
Testing with Deep SARSA & DQN
Load Factor
Fixed Interval(30 mins): 8, SARSA Departures:11,DQN Departures:11
Normalized Waiting Time
Bus Timetable
DQN
Deep SARSA
Testing with Deep SARSA & DQN
Required Carrying Capacity
Fixed Interval(30 mins): 8, SARSA Departures:11,DQN Departures:11
Stranded Passenger
Bus Timetable
DQN
Deep SARSA
Conclusion
 Timetable created with fixed interval of 30 mins had waiting time and
stranded passenger going up and couldn’t be contained. Load Rate was also
100% which doesn’t go well with travelers due to heavy congestion.
 With right reward function DQN and Deep SARSA were able to understand
stranded passenger rate and reduce waiting time and number of stranded
passengers while keeping Load Rate less than 90% with just 3 more
departures in 4 hours (6:00-10:00AM)
 With Deep SARSA & DQN real time decision based on number of stranded
passengers can be taken without re-computing the whole problem.

More Related Content

Similar to Deep_Reinforcement_Learning_based_Dynamic_Timetable.pptx

Service analytics: A proposal to LTA Singapore for on-demand bus trials
Service analytics: A proposal to LTA Singapore for on-demand bus trialsService analytics: A proposal to LTA Singapore for on-demand bus trials
Service analytics: A proposal to LTA Singapore for on-demand bus trials
Kriti Srivastava
 
Augmenting Decisions of Taxi Drivers through Reinforcement Learning for Impro...
Augmenting Decisions of Taxi Drivers through Reinforcement Learning for Impro...Augmenting Decisions of Taxi Drivers through Reinforcement Learning for Impro...
Augmenting Decisions of Taxi Drivers through Reinforcement Learning for Impro...
ivaderivader
 
A Dynamic Logistic Dispatching System With Set-Based Particle Swarm Optimization
A Dynamic Logistic Dispatching System With Set-Based Particle Swarm OptimizationA Dynamic Logistic Dispatching System With Set-Based Particle Swarm Optimization
A Dynamic Logistic Dispatching System With Set-Based Particle Swarm Optimization
Rajib Roy
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)
theijes
 
Cost minimization model
Cost minimization modelCost minimization model
Cost minimization model
rowell balala
 
full report
full reportfull report
full report
Rahul M.R
 
queuingtheory-091005084417-phpapp01 (2).pdf
queuingtheory-091005084417-phpapp01 (2).pdfqueuingtheory-091005084417-phpapp01 (2).pdf
queuingtheory-091005084417-phpapp01 (2).pdf
Aditya Mane
 
A Longitudinal Control Algorithm for Smart Cruise Control with Virtual Parame...
A Longitudinal Control Algorithm for Smart Cruise Control with Virtual Parame...A Longitudinal Control Algorithm for Smart Cruise Control with Virtual Parame...
A Longitudinal Control Algorithm for Smart Cruise Control with Virtual Parame...
ijceronline
 
Collision Warning....SAMRIDDHA SHIL & MY TEAM
Collision Warning....SAMRIDDHA SHIL & MY TEAMCollision Warning....SAMRIDDHA SHIL & MY TEAM
Collision Warning....SAMRIDDHA SHIL & MY TEAM
Samriddha Shil
 
Traffic state estimation with multi-sensor data for large networks with macro...
Traffic state estimation with multi-sensor data for large networks with macro...Traffic state estimation with multi-sensor data for large networks with macro...
Traffic state estimation with multi-sensor data for large networks with macro...
YazanSafadi
 
IRJET- Prediction of Cab Demand using Machine Learning
IRJET- Prediction of Cab Demand using Machine LearningIRJET- Prediction of Cab Demand using Machine Learning
IRJET- Prediction of Cab Demand using Machine Learning
IRJET Journal
 
Automated Parameterization of Performance Models from Measurements
Automated Parameterization of Performance Models from MeasurementsAutomated Parameterization of Performance Models from Measurements
Automated Parameterization of Performance Models from Measurements
Weikun Wang
 
TRAFFIC SIMULATION AT TOLL ROAD SECTION USING VISSIM SOFTWARE
TRAFFIC SIMULATION AT TOLL ROAD SECTION USING VISSIM SOFTWARETRAFFIC SIMULATION AT TOLL ROAD SECTION USING VISSIM SOFTWARE
TRAFFIC SIMULATION AT TOLL ROAD SECTION USING VISSIM SOFTWARE
shrikrishna kesharwani
 
PRESENTATION
PRESENTATIONPRESENTATION
PRESENTATION
Adnan Khan
 
Contoh Desain Slide Presentasi Ilmiah Kreatif dan Menarik #1
Contoh Desain Slide Presentasi Ilmiah Kreatif dan Menarik #1Contoh Desain Slide Presentasi Ilmiah Kreatif dan Menarik #1
Contoh Desain Slide Presentasi Ilmiah Kreatif dan Menarik #1
Arry Rahmawan
 
Traffic signal
Traffic signalTraffic signal
Traffic signal
aloknitb
 
How Analytic Reporting Can Identify and Solve Paratransit Service Shortcomings
How Analytic Reporting Can Identify and Solve Paratransit Service ShortcomingsHow Analytic Reporting Can Identify and Solve Paratransit Service Shortcomings
How Analytic Reporting Can Identify and Solve Paratransit Service Shortcomings
TSSParatransit
 
Deep reinforcement learning for traffic light cycle control
Deep reinforcement learning for traffic light cycle controlDeep reinforcement learning for traffic light cycle control
Deep reinforcement learning for traffic light cycle control
PRITIJHA21
 
PROPOSED INTELLIGENT TRANSPORT SYSTEM DEPLOYMENTS IN KAJANG CITY
PROPOSED INTELLIGENT TRANSPORT SYSTEM DEPLOYMENTS IN KAJANG CITYPROPOSED INTELLIGENT TRANSPORT SYSTEM DEPLOYMENTS IN KAJANG CITY
PROPOSED INTELLIGENT TRANSPORT SYSTEM DEPLOYMENTS IN KAJANG CITY
664601
 
Battery Powered and Hybrid Drive Opportunities in Heavy Duty, Large Capacity,...
Battery Powered and Hybrid Drive Opportunities in Heavy Duty, Large Capacity,...Battery Powered and Hybrid Drive Opportunities in Heavy Duty, Large Capacity,...
Battery Powered and Hybrid Drive Opportunities in Heavy Duty, Large Capacity,...
Newton Montano
 

Similar to Deep_Reinforcement_Learning_based_Dynamic_Timetable.pptx (20)

Service analytics: A proposal to LTA Singapore for on-demand bus trials
Service analytics: A proposal to LTA Singapore for on-demand bus trialsService analytics: A proposal to LTA Singapore for on-demand bus trials
Service analytics: A proposal to LTA Singapore for on-demand bus trials
 
Augmenting Decisions of Taxi Drivers through Reinforcement Learning for Impro...
Augmenting Decisions of Taxi Drivers through Reinforcement Learning for Impro...Augmenting Decisions of Taxi Drivers through Reinforcement Learning for Impro...
Augmenting Decisions of Taxi Drivers through Reinforcement Learning for Impro...
 
A Dynamic Logistic Dispatching System With Set-Based Particle Swarm Optimization
A Dynamic Logistic Dispatching System With Set-Based Particle Swarm OptimizationA Dynamic Logistic Dispatching System With Set-Based Particle Swarm Optimization
A Dynamic Logistic Dispatching System With Set-Based Particle Swarm Optimization
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)
 
Cost minimization model
Cost minimization modelCost minimization model
Cost minimization model
 
full report
full reportfull report
full report
 
queuingtheory-091005084417-phpapp01 (2).pdf
queuingtheory-091005084417-phpapp01 (2).pdfqueuingtheory-091005084417-phpapp01 (2).pdf
queuingtheory-091005084417-phpapp01 (2).pdf
 
A Longitudinal Control Algorithm for Smart Cruise Control with Virtual Parame...
A Longitudinal Control Algorithm for Smart Cruise Control with Virtual Parame...A Longitudinal Control Algorithm for Smart Cruise Control with Virtual Parame...
A Longitudinal Control Algorithm for Smart Cruise Control with Virtual Parame...
 
Collision Warning....SAMRIDDHA SHIL & MY TEAM
Collision Warning....SAMRIDDHA SHIL & MY TEAMCollision Warning....SAMRIDDHA SHIL & MY TEAM
Collision Warning....SAMRIDDHA SHIL & MY TEAM
 
Traffic state estimation with multi-sensor data for large networks with macro...
Traffic state estimation with multi-sensor data for large networks with macro...Traffic state estimation with multi-sensor data for large networks with macro...
Traffic state estimation with multi-sensor data for large networks with macro...
 
IRJET- Prediction of Cab Demand using Machine Learning
IRJET- Prediction of Cab Demand using Machine LearningIRJET- Prediction of Cab Demand using Machine Learning
IRJET- Prediction of Cab Demand using Machine Learning
 
Automated Parameterization of Performance Models from Measurements
Automated Parameterization of Performance Models from MeasurementsAutomated Parameterization of Performance Models from Measurements
Automated Parameterization of Performance Models from Measurements
 
TRAFFIC SIMULATION AT TOLL ROAD SECTION USING VISSIM SOFTWARE
TRAFFIC SIMULATION AT TOLL ROAD SECTION USING VISSIM SOFTWARETRAFFIC SIMULATION AT TOLL ROAD SECTION USING VISSIM SOFTWARE
TRAFFIC SIMULATION AT TOLL ROAD SECTION USING VISSIM SOFTWARE
 
PRESENTATION
PRESENTATIONPRESENTATION
PRESENTATION
 
Contoh Desain Slide Presentasi Ilmiah Kreatif dan Menarik #1
Contoh Desain Slide Presentasi Ilmiah Kreatif dan Menarik #1Contoh Desain Slide Presentasi Ilmiah Kreatif dan Menarik #1
Contoh Desain Slide Presentasi Ilmiah Kreatif dan Menarik #1
 
Traffic signal
Traffic signalTraffic signal
Traffic signal
 
How Analytic Reporting Can Identify and Solve Paratransit Service Shortcomings
How Analytic Reporting Can Identify and Solve Paratransit Service ShortcomingsHow Analytic Reporting Can Identify and Solve Paratransit Service Shortcomings
How Analytic Reporting Can Identify and Solve Paratransit Service Shortcomings
 
Deep reinforcement learning for traffic light cycle control
Deep reinforcement learning for traffic light cycle controlDeep reinforcement learning for traffic light cycle control
Deep reinforcement learning for traffic light cycle control
 
PROPOSED INTELLIGENT TRANSPORT SYSTEM DEPLOYMENTS IN KAJANG CITY
PROPOSED INTELLIGENT TRANSPORT SYSTEM DEPLOYMENTS IN KAJANG CITYPROPOSED INTELLIGENT TRANSPORT SYSTEM DEPLOYMENTS IN KAJANG CITY
PROPOSED INTELLIGENT TRANSPORT SYSTEM DEPLOYMENTS IN KAJANG CITY
 
Battery Powered and Hybrid Drive Opportunities in Heavy Duty, Large Capacity,...
Battery Powered and Hybrid Drive Opportunities in Heavy Duty, Large Capacity,...Battery Powered and Hybrid Drive Opportunities in Heavy Duty, Large Capacity,...
Battery Powered and Hybrid Drive Opportunities in Heavy Duty, Large Capacity,...
 

Recently uploaded

Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
vikram sood
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
Bill641377
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
nyfuhyz
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
Lars Albertsson
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
Social Samosa
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
aqzctr7x
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
nuttdpt
 
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
zsjl4mimo
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Kiwi Creative
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
v7oacc3l
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
Roger Valdez
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 
Intelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicineIntelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicine
AndrzejJarynowski
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
Social Samosa
 
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
g4dpvqap0
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
Sm321
 

Recently uploaded (20)

Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
 
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 
Intelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicineIntelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicine
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
 
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
 

Deep_Reinforcement_Learning_based_Dynamic_Timetable.pptx

  • 1. Deep Reinforcement Learning based Dynamic Optimization of Bus Timetable Ankit Sharma
  • 2. Bus Timetable Optimization  Bus timetable optimization is a key issue to reduce operational cost of bus companies and improve the service quality.  Heuristic algorithms work in offline and does not account for people flow change. https://doi.org/10.48550/arXiv.2107.07066
  • 3. Bus Timetable Optimization  The optimization of the bus timetable aims to consider the interests of both passengers and the bus company, and set the departure time of buses to meet the demand of passenger flow.  The main quantitative indicators are bus congestion and the waiting time of passengers, while the interests of bus companies are mainly affected by the number of departures (departure intervals) in the timetable. https://doi.org/10.48550/arXiv.2107.07066
  • 4. Deep Reinforcement Learning Formulation  Actions – No Departure(0), Departure(1)  States – Time(hr-𝑿𝟏𝒕,),Time(min-𝑿𝟐𝒕,),Load Rate(𝑿𝟑𝒕), Waiting Time(𝑿𝟒𝒕), Carrying Capability(𝑿𝟓𝒕), Stranded Passengers(𝑿𝟔𝒕).  Reward(from reference paper): 1 − (𝑿𝟓𝒕) − 𝛼 ∗ 𝑿𝟒𝒕 − 𝛽 ∗ 𝑿𝟔𝒕, (𝑎𝑐𝑡𝑖𝑜𝑛 = 0) (𝑿𝟓𝒕) − 𝛽 ∗ 𝑿𝟔𝒕, (𝑎𝑐𝑡𝑖𝑜𝑛 = 1) Alternate reward functions were also tested. https://doi.org/10.48550/arXiv.2107.07066 , , ,
  • 5. Bus Environment  Bus timetable considered as episodic task (star-end schedule) with 6 states changing w.r.t time. 𝑺𝒕 = [𝑿𝟏𝒕, 𝑿𝟐𝒕, 𝑿𝟑𝒕, 𝑿𝟒𝒕, 𝑿𝟓𝒕, 𝑿𝟔𝒕]  𝑋1𝑡 − 𝑡ℎ/24, 𝑋2𝑡 − 𝑡𝑚 60  𝑋3𝑡 − 𝑀𝑎𝑥 𝑃𝑎𝑠𝑠𝑒𝑛𝑔𝑒𝑟𝑠 𝑀𝑎𝑥 𝐵𝑢𝑠 𝐶𝑎𝑝𝑎𝑐𝑖𝑡𝑦  𝑋4𝑡 − Normalized Waiting time of all passengers  𝑋5𝑡 − 𝑁𝑒𝑒𝑑 𝑜𝑓 𝐶𝑎𝑟𝑟𝑦𝑖𝑛𝑔 𝐶𝑎𝑝𝑎𝑐𝑖𝑡𝑦 𝐶𝑎𝑟𝑟𝑦𝑖𝑛𝑔 𝑐𝑎𝑝𝑎𝑐𝑖𝑡𝑦 𝑜𝑓 𝑣𝑒ℎ𝑖𝑐𝑙𝑒  𝑋6𝑡 − 𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑠𝑡𝑟𝑎𝑛𝑑𝑒𝑑 𝑝𝑎𝑠𝑠𝑒𝑛𝑔𝑒𝑟𝑠 Assumptions*: • 𝑋6𝑡 was assumed to be exponential distribution after every departure. • Based on number of stranded passengers awaiting bus or remainder post departure are used for calculation of 𝑋4𝑡, 𝑋5𝑡 • 𝑋3𝑡 is calculated based on max bus capacity and no. of stranded passengers at the time of departure • Episode considered from 06:00-10:00Am with decision point at every 5 mins * Real data was not available http://www.muxingyun.com/en/digital-factory
  • 6. DRL - Algorithms Deep SARSA S.No. Parameter Value 1. DNN – Hidden Layers 2 2. DNN-Hidden Units 188 3. Activation Function ReLU 4. Epsilon 0.2 5. Gamma .99 6. Experience memory size 10000 7. Batch Size 32 8. Learning rate .001 DQN S.No. Parameter Value 1. DNN – Hidden Layers 2 2. DNN-Hidden Units 188 3. Activation Function ReLU 4. Epsilon 0.2 5. Gamma .99 6. Experience memory size 10000 7. Batch Size 32 8. Learning rate .001
  • 9. Testing with Deep SARSA & DQN SARSA Episodes 1500 Epsilon 0.2 DQN Episodes 1500 Epsilon 0.2 Reward: Same as in paper.
  • 10. Testing with Deep SARSA & DQN Load Factor Fixed Interval(30 mins): 8, SARSA Departures:23,DQN Departures:23 Normalized Waiting Time Bus Timetable DQN Deep SARSA
  • 11. Testing with Deep SARSA & DQN Required Carrying Capacity Fixed Interval(30 mins): 8, SARSA Departures:23,DQN Departures:23 Stranded Passenger Bus Timetable DQN Deep SARSA
  • 13. Testing with Deep SARSA & DQN SARSA Episodes 1500 Epsilon 0.2 DQN Episodes 1500 Epsilon 0.2 Reward: Modified to reduce number of departures and increase load rate to account for bus agency. Condition added to have Load Rate > 0.7
  • 14. Testing with Deep SARSA & DQN Load Factor Fixed Interval(30 mins): 8, SARSA Departures:15,DQN Departures:15 Normalized Waiting Time Bus Timetable DQN Deep SARSA
  • 15. Testing with Deep SARSA & DQN Required Carrying Capacity Fixed Interval(30 mins): 8, SARSA Departures:15,DQN Departures:15 Stranded Passenger Bus Timetable DQN Deep SARSA
  • 17. Testing with Deep SARSA & DQN SARSA Episodes 1500 Epsilon 0.2 DQN Episodes 1500 Epsilon 0.2 Reward: Modified to reduce number of departures and increase load rate more to account for bus agency. Condition added to have Load Rate > 0.8
  • 18. Testing with Deep SARSA & DQN Load Factor Fixed Interval(30 mins): 8, SARSA Departures:11,DQN Departures:11 Normalized Waiting Time Bus Timetable DQN Deep SARSA
  • 19. Testing with Deep SARSA & DQN Required Carrying Capacity Fixed Interval(30 mins): 8, SARSA Departures:11,DQN Departures:11 Stranded Passenger Bus Timetable DQN Deep SARSA
  • 20. Conclusion  Timetable created with fixed interval of 30 mins had waiting time and stranded passenger going up and couldn’t be contained. Load Rate was also 100% which doesn’t go well with travelers due to heavy congestion.  With right reward function DQN and Deep SARSA were able to understand stranded passenger rate and reduce waiting time and number of stranded passengers while keeping Load Rate less than 90% with just 3 more departures in 4 hours (6:00-10:00AM)  With Deep SARSA & DQN real time decision based on number of stranded passengers can be taken without re-computing the whole problem.