SlideShare a Scribd company logo
| Copyright © 2021 Tata Consultancy Services Limited
TCS Internal | Copyright © 2021 Tata Consultancy Services Limited
A Reinforcement Learning Approach for
Electric Vehicle Routing Problem with Vehicle-to-Grid Supply
Ajay Narayanan, Prasant Misra, Ankush Ojha, Vivek Bandhu, Supratim Ghosh, Arunchandar Vasan
Tata Consultancy Services – Research, India
Adaptive and Learning Agents Workshop
| Copyright © 2021 Tata Consultancy Services Limited
TCS Internal
Problem Overview
Objective (What?): Design a scalable vehicle routing algorithm that reduces the trip cost for a fleet of last-mile delivery EVs
Challenges
▪ Delivery demand (i.e., goods/energy) is present at specific time periods and locations; the supply (i.e., delivery EVs) needs to match that demand
(both in space and time) for making a successful transaction
▪ The computationally difficult VRP problem becomes even harder with the introduction of new constraints related to EVs and energy discharge
▪ Existing solutions do not scale well to large problem instances in quick time with reasonable solution accuracy
Motivation (Why?)
▪ Last-mile delivery is the most expensive part of the logistic and e-commerce process; accounts for more than 50% of the overall shipping expenses
▪ EV adoption in last-mile delivery operations is increasing; expected to grow to 8 million by 2030
▪ (Last-mile EVs + Multi-service operation model) can lead to additional cost reduction of the last-mile delivery segment; which is over and above the cost
benefits that come naturally from electrification!
Approach (HOW?): Design a routing algorithm for EV fleets using reinforcement learning (RL) that learns optimal routing policies for a multi-service
(delivery of good + delivery of energy) operation model
2
Proposal: New (multi-service) delivery model using last-mile EVs:
✓ Delivery of goods to customer (MANDATORY) +
✓ Delivery (or selling) of energy to the grid during its period of
peak demand (OPTIONAL)
Electricity Grid
Deliver/Sell Energy
Deliver Goods
Customers
| Copyright © 2021 Tata Consultancy Services Limited
TCS Internal
Problem Description
3
Demand
EV
Customer
Supply
Goods
Time
Time
Goods
EV
Energy
Time
Time
Energy
Supply
Demand
Grid
Fleet Operator Problem:
▪ GIVEN (a priori):
(i) Customer order fulfillment list (from fulfillment center)
(ii) Peak energy demand periods (from grid)
(iii) Starting state-of-charge (SOC) of all EVs in the fleet (Q)
▪ CONSTRAINT:
(i) (Mandatory) EVs must complete all customer deliveries
(ii) (Optional) EVs can sell power to the grid, where possible
(iii) The entire trip (depot-depot) must be managed within Q; without recharging
▪ Assumption:
(i) All EVs are charged to Q at the depot, before trip commencement
▪ DECIDE (offline): Route of all EVs in the fleet
▪ OBJECTIVE: Minimize trip cost of the EV Fleet
Customer
(demand for goods)
Discharging Station
(demand for energy)
| Copyright © 2021 Tata Consultancy Services Limited
TCS Internal
System Model
4
ⅰ
j
ⅰ
Customer Set 𝐾 Discharging Station Set 𝑃
EV Set 𝑋 {𝑣0}
Complete Undirected Graph 𝐺 = (𝑉, 𝐸)
▪ Set of nodes 𝑉 = {𝑣0} ∪ 𝐾 ∪ 𝑃
▪ Set of edges 𝐸 = { (𝑖, 𝑗 ) : 𝑖, 𝑗 ∈ 𝑉, 𝑖 ≠𝑗 }
Parameters
C carrying capacity of each EV
Q starting state-of-charge (SoC) of each EV
d𝑖j
distance between nodes 𝑖 and 𝑗
t𝑖j travel time between nodes 𝑖 and 𝑗
H charge consumption rate (kWh/km)
b𝑖j energy consumed in travelling between 𝑖 and 𝑗
(= H . d𝑖j )
𝑐𝑖 demand (of goods) at node 𝑖
𝑠𝑖 service time at node 𝑖
𝑒𝑖 earliest start of service at node 𝑖
𝑙𝑖 latest start of service at node 𝑖
ⅰ
R discharging rate of each discharging station
𝐺𝑖
1 start time – grid demand at node 𝑖
𝐺𝑖
2 end time – grid demand at node 𝑖
Decision Variables
α𝑖j indicates, if edge 𝑖𝑗 is traversed by an EV (binary)
γ𝑖 service time at discharging station node 𝑖
τ𝑖 time-of-arrival at node 𝑖
Ɵ𝑖 remaining battery capacity on arrival at node 𝑖
λ𝑖 remaining cargo on arrival at node 𝑖
▪ |X| = u
▪ |K| = m
▪ |P| = n
| Copyright © 2021 Tata Consultancy Services Limited
TCS Internal
Optimization Problem
5
M = 𝑚𝑖𝑛 (𝑌1. ෍ 𝑑𝑖𝑗 . 𝛼𝑖𝑗 + 𝑌2. ෍ 𝛼0𝑖 − 𝑌3. ෍ 𝛾𝑖 )
total distance travelled by EVs
total time spend by EVs at discharging stations
total EVs used in a trip
[C1]: Ensure every customer is visited exactly once, while making it optional to visit any of the discharging stations
෍
𝑗 ∈ 𝑉, 𝑖 ≠ 𝑗
𝛼𝑖𝑗 =1 ∀𝑖 ∈ 𝐾
Objective: Minimize the Trip Cost of the EV Fleet
Constraints
෍
𝑗 ∈ 𝑉, 𝑖 ≠ 𝑗
𝛼𝑖𝑗 = ෍
𝑗 ∈ 𝑉, 𝑖 ≠ 𝑗
𝛼𝑗𝑖 ∀𝑖 ∈ 𝑉
[C2]: Ensure flow conservation (at each node: # incoming edges = # outgoing edges)
| Copyright © 2021 Tata Consultancy Services Limited
TCS Internal
Optimization Problem
6
[C3]: Ensure time feasibility of arcs leaving customers and the depot
𝜏𝑖 + (𝑡𝑖𝑗 + 𝑠𝑖) . 𝛼𝑖𝑗 − 𝑙0 . 1 − 𝛼𝑖𝑗 ≤ 𝜏𝑗 ∀𝑖 ∈ 𝐾, ∀𝑖 ∈ 𝑉, 𝑖 ≠ 𝑗
[C4]: Ensure time feasibility of arcs leaving discharging stations and the depot
𝜏𝑖 + (𝑡𝑖𝑗 + 𝑠𝑖) . 𝛼𝑖𝑗 − 𝑙0 . 1 − 𝛼𝑖𝑗 ≤ 𝜏𝑗 ∀𝑖 ∈ 𝑃, ∀𝑗 ∈ 𝑉, 𝑖 ≠ 𝑗
[C5]: Ensure that each customer node is visited within its time window
𝑒𝑖 ≤ 𝜏𝑖 ≤ 𝑙𝑖 ∀𝑖 ∈ 𝐾
[C6]: Ensure remaining charge (energy) feasibility for arcs leaving customers and the depot
0 ≤ 𝜃𝑗 ≤ 𝜃𝑖 − 𝐻 . 𝑑𝑖𝑗 . 𝛼𝑖𝑗 + 𝑄 1 − 𝛼𝑖𝑗 ∀𝑖 ∈ 𝐾, ∀𝑗 ∈ 𝑉, 𝑖 ≠ 𝑗
[C7]: Ensure remaining charge (energy) feasibility for arcs leaving discharging stations and the depot
0 ≤ 𝜃𝑗 ≤ 𝜃𝑖 − 𝐻 . 𝑑𝑖𝑗 + 𝑅 . 𝛾𝑗 . 𝛼𝑖𝑗 + 𝑄 1 − 𝛼𝑖𝑗 ∀𝑖 ∈ 𝑃, ∀𝑗 ∈ 𝑉, 𝑖 ≠ 𝑗
[C8]: Ensure all customer demands are fulfilled
0 ≤ λ𝑗 ≤ λ𝑖 − 𝑐𝑖𝑗. 𝛼𝑖𝑗 + 𝐶 1 − 𝛼𝑖𝑗 ∀𝑖, 𝑗 ∈ 𝑉
[C9 & 10]: Ensure discharge service time aligns with grid peak demand period, if discharging station is visited
𝜏𝑖 ≥ 𝐺𝑖
1
∀𝑖 ∈ 𝑃, 𝜏𝑖 ≠ 0 𝜏𝑖 + 𝛾𝑖 ≤ 𝐺𝑖
2
∀𝑖 ∈ 𝑃, 𝜏𝑖 ≠ 0
| Copyright © 2021 Tata Consultancy Services Limited
TCS Internal
RL Representation
7
ⅰ j
u
Masking scheme for finding FEASIBLE (Vehicle -> Node) pairs
▪ The earliest arrival time at node 𝑗 violates the time window constraint [C5, C9, C10]
▪ Node 𝑗 is a customer; and the current SoC of the vehicle cannot support the complete trip from node 𝑖 to node 𝑗 and back to the depot [C6]
▪ Node 𝑗 is a discharging station; and the current SoC of the vehicle cannot support the complete trip from node 𝑖 to node 𝑗 and back to the
depot; as well as the discharge operation at node 𝑗 [C7]
▪ Node 𝑗 is a customer with unfulfilled demand that is either nil or exceeds the remaining carrying load of vehicle 𝑢 [C8]
System decision := find routes that minimize the trip cost of the fleet, subject to constraints
Routing decision := which vehicle 𝑢 should be assigned to which node 𝑖 ?
Are all FREE
vehicles assigned ?
NO
| Copyright © 2021 Tata Consultancy Services Limited
TCS Internal
RL Representation
8
j i
u
State Variables
𝑏𝑖𝑗
energy consumed in travelling between nodes 𝑖 and 𝑗
(proxy for distance travelled between nodes 𝑖 and 𝑗)
z𝑖 energy spent at node 𝑖 (0, if at customer; else z𝑖)
Idepo flag: indicates if vehicle u 𝑖𝑠 starting from the depot
Icust flag: indicates if node 𝑖 is a customer
𝑤𝑢
𝑖 wait time of vehicle 𝑢 at node 𝑖 before it can start service
Normalizing Factor
𝐸
energy required to travel the
diagonal length of the graph
𝐸
-
-
𝑇 Decision time horizon
State 𝑺𝒕(𝒖, 𝒊)
Action 𝑨𝒕(𝒖, 𝒊)
▪ Assign vehicle 𝑢 → 𝑛𝑜𝑑𝑒 𝑖
▪ If vehicle 𝑢 is BUSY; perform local update
(i) remove assigned 𝑛𝑜𝑑𝑒 𝑖 from the service list;
(ii) update distance and time
▪ If the assignment of all FREE vehicles is done,
perform global update and get reward
Routing decision := from all feasible 𝑢, 𝑖 pairs, which pair is the best choice?
Reward 𝑹𝒕(𝒖, 𝒊)
− 𝐴1 ∗ 𝑏𝑖𝑗 𝐴1 = 0.15 -VE reward for choosing longer route segments
+ 𝐴2 ∗ 𝑧𝑖 𝐴2 = 0.001 +VE reward for visiting discharging station node
+ 𝐴3 ∗ 𝐼𝑐𝑢𝑠𝑡 𝐴3 = 0.15 +VE reward for visiting customer node
− (𝐴4 ∗ 𝑤𝑢
𝑖
) 𝐴1 = 0.15 -VE reward for assignments that lead to waiting time
− (𝐴5 ∗ 𝐼𝑑𝑒𝑝𝑜) 𝐴1 = 0.55 -VE reward for sending new vehicles from the depot
| Copyright © 2021 Tata Consultancy Services Limited
TCS Internal
RL Training Algorithm
9
1) Initialize the neural network with weights 𝜙
2) Initialize batch size 𝛽, replay buffer 𝐵
3) :FOR <Episode> = 1 :TO <Total-Num-Episodes> :DO
i. Randomly choose data instance from training set
ii. Initialize environment
iii. :WHILE t < T :DO
a. Create a copy of the environment for local updates
b. :WHILE <free-vehicle> is unassigned :DO
i. Find feasible (vehicle 𝑢, node 𝑖) pairs ∀ 𝑢 (whether free or busy)
a. :IF no feasible pairs found, then BREAK
ii. Calculate 𝑞𝑡 (𝑢, 𝑖) = 𝜙(𝑆𝑡(𝑢, 𝑖)) ∀ feasible (vehicle 𝑢, node 𝑖) pairs,
regardless of the current state of each vehicle 𝑢 (busy or free)
iii. Choose (𝑢, 𝑖) pair with highest 𝑞𝑡 value (using 𝜖-greedy assignment)
iv. Perform local update on the environment copy
c. Execute new (𝑢, 𝑖) assignments in the global environment and get reward 𝑅𝑡(𝑢, 𝑖)
d. Add [𝑆𝑡(𝑢, 𝑖); 𝑅𝑡(𝑢, 𝑖); 𝑞𝑡(𝑢, 𝑖)] to replay buffer
iv. Delete oldest entries in B if size exceeds buffer capacity
v. Draw 𝛽 samples from B
vi. Update 𝜙 by minimizing MSE loss between 𝑞𝑡(𝑢, 𝑖) and 𝑅𝑡 (𝑢, 𝑖)
| Copyright © 2021 Tata Consultancy Services Limited
TCS Internal
A Representative Example for the RL Approach
10
E=15; t=25
E=25;
t=35
E=05; t=15
E=25;
t=35
Depot
[0,200]
Cus-1
[0,55]
Cus-3
[45,75]
Cus-2
[10,45]
DS-1
[20,75]
Depot
[0,200]
Cus-1
[0,55]
Cus-3
[45,75]
Cus-2
[10,45]
DS-1
[20,75]
Veh-2 (Q=100)
Veh-1 (Q=100)
State(Veh-1, Cus-1)
State(Veh-1, Cus-2)
State(Veh-2, Cus-1)
State(Veh-2, Cus-2)
Argmax
State(Veh-1, Cus-1)
State(Veh-1, DS-1)
State(Veh-2, Cus-1)
Argmax
Val(Veh-1, DS-1)
Environment:
Global Update +
Reward Collection
t=0
Depot
[0,200]
Cus-1
[0,55]
Cus-3
[45,75]
Cus-2
[10,45]
DS-1
[20,75]
Veh-1 (Q=70)
Veh-2 (Q=100)
t=45
Depot
[0,200]
Cus-1
[0,55]
Cus-3
[45,75]
Cus-2
[10,45]
DS-1
[20,75]
Veh-1 (Q=70)
Veh-2 (Q=75)
(Veh-1, Cus-2)
(update GLOBAL environment IFF all free vehicles are
assigned; else update LOCAL environment copy)
(Veh-2, Cus-1)
| Copyright © 2021 Tata Consultancy Services Limited
TCS Internal
A Representative Example for the RL Approach
11
E=15; t=25
E=25;
t=35
E=05; t=15
E=25;
t=35
Depot
[0,200]
Cus-1
[0,55]
Cus-3
[45,75]
Cus-2
[10,45]
DS-1
[20,75]
State(Veh-1, DS-1)
State(Veh-2, DS-1)
State(Veh-2, Cus-3)
Argmax
State(Veh-2, Cus-3)
State(Veh-2, DS-1)
Argmax
Environment:
Global Update +
Reward Collection
t=80
(Veh-1, DS-1)
(update GLOBAL environment IFF all free vehicles are
assigned; else update LOCAL environment copy)
t=45
Depot
[0,200]
Cus-1
[0,55]
Cus-3
[45,75]
Cus-2
[10,45]
DS-1
[20,75]
Veh-1 (Q=70)
Veh-2 (Q=75)
Cus-3
[45,75]
DS-1
[20,75]
Veh-1 (Q=65)
Depot
[0,200]
Cus-1
[0,55]
Cus-2
[10,45]
Veh-2 (Q=75)
Battery
Constraint
(Veh-2, Cus-3)
Cus-3
[45,75]
DS-1
[20,75]
Veh-1 (Q=65)
Depot
[0,200]
Cus-1
[0,55]
Cus-2
[10,45]
Veh-2 (Q=60)
DS-1
[20,75]
Cus-3
[45,75]
| Copyright © 2021 Tata Consultancy Services Limited
TCS Internal
A Representative Example for the RL Approach
12
E=15; t=25
E=25;
t=35
E=05; t=15
E=25;
t=35
Depot
[0,200]
Cus-1
[0,55]
Cus-3
[45,75]
Cus-2
[10,45]
DS-1
[20,75]
State(Veh-1, Depot)
State(Veh-2, Depot)
Argmax
State(Veh-1, Depot)
Argmax
Val(Veh-2, Depot)
Environment:
Global Update +
Reward Collection
t=155
(Veh-2, Depot)
(update GLOBAL environment IFF all free vehicles are
assigned; else update LOCAL environment copy)
t=80
t=80
(Veh-1, Depot)
Depot
[0,200]
Cus-1
[0,55]
Cus-2
[10,45]
Veh-2 (Q=60)
Veh-1 (Q=65)
DS-1
[20,75]
Cus-3
[45,75]
Depot
[0,200]
Cus-1
[0,55]
Cus-2
[10,45]
Veh-2 (Q=60)
DS-1
[20,75]
Cus-3
[45,75]
Veh-1 (Q=65)
Depot
[0,200]
Cus-1
[0,55]
Cus-2
[10,45]
Veh-2 (Q=60)
DS-1
[20,75]
Cus-3
[45,75]
Veh-1 (Q=65)
| Copyright © 2021 Tata Consultancy Services Limited
TCS Internal
Evaluation
13
Neural Network Architecture
Layer # Description Neurons Justification Activation Func
1 Input Layer 5 5 state variables -
2 FC 12 ReLU
3 FC 6 ReLU
4 FC 3 ReLU
4 Output Layer 1 -
Hyperparameters
Parameter Value Parameter Value
Optimizer Adam Exploration factor
Linear decay from 1 to 0
over 75 episodes
Batch Size (β) 16 Exploration policy ε-greedy
Replay Buffer Size
(B)
5000
Learning Rate 0.001
Training Epochs per
Episode
Min -> # customers
▪ Implementation framework: PyTorch
▪ Training: 200 episodes (random combination of customers;
discharge stations, vehicles)
▪ Testing: Solomon Dataset [6-10 instances per dataset]
▪ Clustered (CL)
▪ Random (RA)
▪ Random Clustered (RC)
RL Training
| Copyright © 2021 Tata Consultancy Services Limited
TCS Internal
Mean Optimality Gap between GA and RL: 17.23% (std. dev. 7.98%)
Mean Optimality Gap between GA and RL: 22.38% (std. dev. 8.35%)
Mean Optimality Gap between GA and RL: 20.02% (std. dev. 6.58%)
1. Accuracy: GA is better than RL by an average of 19.8% (range 8.3% - 36.52%)
2. Execution Time: RL is faster than GA by an average of 24 times
| Copyright © 2021 Tata Consultancy Services Limited
TCS Internal
Table shows specific instances where the MILP formulation converges within a reasonable amount of time
GA accuracy is as a good as MILP
Evaluation
| Copyright © 2021 Tata Consultancy Services Limited
TCS Internal
Final Remarks
Summary
▪ We model the electric vehicle routing problem with constraints on loading capacity; time window; vehicle-to-
grid energy supply (CEVRPTW-D) ; and formulate the multi-objective optimization problem to minimize the
trip cost of the fleet.
▪ We design QuikRouteFinder: a value-based reinforcement learning algorithm by defining the (state, action)
space, and engineer the reward signal for the RL agent to find the cost-effective delivery routes.
▪ We design and implement a genetic algorithm (GA) metaheuristic to derive optimal results for CEVRPTW-D.
▪ Using Solomon datasets, we evaluate and compare the computation speed and solution accuracy of the
proposed RL model against GA and MILP.
Key Finding
QuikRouteFinder (based on RL) is 24 times faster than the GA and MILP baselines in terms of solutioning speed,
but with ≈ 20% decrease in solution quality.
16

More Related Content

What's hot

Train lighting system
Train lighting systemTrain lighting system
Train lighting system
siddhartha shukla
 
Fault diagnosis & fault tolerance in wind turbines
Fault diagnosis & fault tolerance in wind turbinesFault diagnosis & fault tolerance in wind turbines
Fault diagnosis & fault tolerance in wind turbines
Nitin Goyal
 
Solmax Global Business Opportunity
Solmax Global Business OpportunitySolmax Global Business Opportunity
Solmax Global Business Opportunity
Liz Evans
 
Understanding kalman filter for soc estimation.
Understanding kalman filter for soc estimation.Understanding kalman filter for soc estimation.
Understanding kalman filter for soc estimation.
Ratul
 
V2G and G2V power transfer issues
V2G and G2V power transfer issuesV2G and G2V power transfer issues
V2G and G2V power transfer issues
Mohd Javed
 
و كيفية تصحيحها (Power Factor) معامل القدرة
و كيفية تصحيحها (Power Factor)  معامل القدرةو كيفية تصحيحها (Power Factor)  معامل القدرة
و كيفية تصحيحها (Power Factor) معامل القدرة
Islam Hesham
 
Hybrid Electric Vehicles.pptx
Hybrid Electric Vehicles.pptxHybrid Electric Vehicles.pptx
Hybrid Electric Vehicles.pptx
BhaveshBorse7
 
Presentation1.pptx
Presentation1.pptxPresentation1.pptx
Presentation1.pptx
VishalLabde
 
Available transfer capability (atc) sbw ppt
Available transfer capability (atc) sbw pptAvailable transfer capability (atc) sbw ppt
Available transfer capability (atc) sbw ppt
Ravi Sekpure
 
E-mobility | Part 4 - EV charging and the next frontier (English)
E-mobility | Part 4 - EV charging and the next frontier (English)E-mobility | Part 4 - EV charging and the next frontier (English)
E-mobility | Part 4 - EV charging and the next frontier (English)
Vertex Holdings
 
Demand Side Management in Micro Grid
Demand Side Management in Micro GridDemand Side Management in Micro Grid
Demand Side Management in Micro Grid
Krishnakumar R. Vasudevan
 
Matlab simpowersystem
Matlab simpowersystemMatlab simpowersystem
Matlab simpowersystem
Ameen San
 
Soft Starter ماهو الـ
 Soft Starter  ماهو الـ Soft Starter  ماهو الـ
Soft Starter ماهو الـ
Islam Hesham
 
EV Charging Infrastructure: Littelfuse Solutions to Enhance Safety, Efficienc...
EV Charging Infrastructure: Littelfuse Solutions to Enhance Safety, Efficienc...EV Charging Infrastructure: Littelfuse Solutions to Enhance Safety, Efficienc...
EV Charging Infrastructure: Littelfuse Solutions to Enhance Safety, Efficienc...
Littelfuse
 
Hybrid vehicle architecture
Hybrid vehicle architectureHybrid vehicle architecture
Hybrid vehicle architecture
INTAKHAB KHAN
 
induction generator
induction generatorinduction generator
induction generator
vishalgohel12195
 
A seminar on flywheel energy storage system
A seminar on flywheel energy storage systemA seminar on flywheel energy storage system
A seminar on flywheel energy storage system
Karansinh Parmar
 
Optimal load scheduling
Optimal load schedulingOptimal load scheduling
Optimal load scheduling
Mayank Sharma
 
UNIT-4 Power Electronics in HEVs.pdf
UNIT-4  Power Electronics  in HEVs.pdfUNIT-4  Power Electronics  in HEVs.pdf
UNIT-4 Power Electronics in HEVs.pdf
Dr. Gudipudi Nageswara Rao
 
A hybrid vehicle configuration with zero emission
A hybrid vehicle configuration with zero emissionA hybrid vehicle configuration with zero emission
A hybrid vehicle configuration with zero emission
Kichu Joy
 

What's hot (20)

Train lighting system
Train lighting systemTrain lighting system
Train lighting system
 
Fault diagnosis & fault tolerance in wind turbines
Fault diagnosis & fault tolerance in wind turbinesFault diagnosis & fault tolerance in wind turbines
Fault diagnosis & fault tolerance in wind turbines
 
Solmax Global Business Opportunity
Solmax Global Business OpportunitySolmax Global Business Opportunity
Solmax Global Business Opportunity
 
Understanding kalman filter for soc estimation.
Understanding kalman filter for soc estimation.Understanding kalman filter for soc estimation.
Understanding kalman filter for soc estimation.
 
V2G and G2V power transfer issues
V2G and G2V power transfer issuesV2G and G2V power transfer issues
V2G and G2V power transfer issues
 
و كيفية تصحيحها (Power Factor) معامل القدرة
و كيفية تصحيحها (Power Factor)  معامل القدرةو كيفية تصحيحها (Power Factor)  معامل القدرة
و كيفية تصحيحها (Power Factor) معامل القدرة
 
Hybrid Electric Vehicles.pptx
Hybrid Electric Vehicles.pptxHybrid Electric Vehicles.pptx
Hybrid Electric Vehicles.pptx
 
Presentation1.pptx
Presentation1.pptxPresentation1.pptx
Presentation1.pptx
 
Available transfer capability (atc) sbw ppt
Available transfer capability (atc) sbw pptAvailable transfer capability (atc) sbw ppt
Available transfer capability (atc) sbw ppt
 
E-mobility | Part 4 - EV charging and the next frontier (English)
E-mobility | Part 4 - EV charging and the next frontier (English)E-mobility | Part 4 - EV charging and the next frontier (English)
E-mobility | Part 4 - EV charging and the next frontier (English)
 
Demand Side Management in Micro Grid
Demand Side Management in Micro GridDemand Side Management in Micro Grid
Demand Side Management in Micro Grid
 
Matlab simpowersystem
Matlab simpowersystemMatlab simpowersystem
Matlab simpowersystem
 
Soft Starter ماهو الـ
 Soft Starter  ماهو الـ Soft Starter  ماهو الـ
Soft Starter ماهو الـ
 
EV Charging Infrastructure: Littelfuse Solutions to Enhance Safety, Efficienc...
EV Charging Infrastructure: Littelfuse Solutions to Enhance Safety, Efficienc...EV Charging Infrastructure: Littelfuse Solutions to Enhance Safety, Efficienc...
EV Charging Infrastructure: Littelfuse Solutions to Enhance Safety, Efficienc...
 
Hybrid vehicle architecture
Hybrid vehicle architectureHybrid vehicle architecture
Hybrid vehicle architecture
 
induction generator
induction generatorinduction generator
induction generator
 
A seminar on flywheel energy storage system
A seminar on flywheel energy storage systemA seminar on flywheel energy storage system
A seminar on flywheel energy storage system
 
Optimal load scheduling
Optimal load schedulingOptimal load scheduling
Optimal load scheduling
 
UNIT-4 Power Electronics in HEVs.pdf
UNIT-4  Power Electronics  in HEVs.pdfUNIT-4  Power Electronics  in HEVs.pdf
UNIT-4 Power Electronics in HEVs.pdf
 
A hybrid vehicle configuration with zero emission
A hybrid vehicle configuration with zero emissionA hybrid vehicle configuration with zero emission
A hybrid vehicle configuration with zero emission
 

Similar to Reinforcement Learning for EVRP with V2G

Hybrid Planner for Smart Charging of Electric Fleets
Hybrid Planner for Smart Charging of Electric FleetsHybrid Planner for Smart Charging of Electric Fleets
Hybrid Planner for Smart Charging of Electric Fleets
Prasant Misra
 
Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...
Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...
Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...
ivaderivader
 
IRJET- Real-Time Forecasting of EV Charging Station Scheduling for Smart Ener...
IRJET- Real-Time Forecasting of EV Charging Station Scheduling for Smart Ener...IRJET- Real-Time Forecasting of EV Charging Station Scheduling for Smart Ener...
IRJET- Real-Time Forecasting of EV Charging Station Scheduling for Smart Ener...
IRJET Journal
 
Simulation of electric bike using Scilab Xcos
Simulation of electric bike using Scilab XcosSimulation of electric bike using Scilab Xcos
Simulation of electric bike using Scilab Xcos
IRJET Journal
 
Supply chain logistics : vehicle routing and scheduling
Supply chain logistics : vehicle  routing and  schedulingSupply chain logistics : vehicle  routing and  scheduling
Supply chain logistics : vehicle routing and scheduling
Retigence Technologies
 
IRJET- Business Scaling and Rebalancing in Shared Bicycle Systems
IRJET- Business Scaling and Rebalancing in Shared Bicycle SystemsIRJET- Business Scaling and Rebalancing in Shared Bicycle Systems
IRJET- Business Scaling and Rebalancing in Shared Bicycle Systems
IRJET Journal
 
Tteng 422 s2021 module 2b: Roundabout Capacity Analysis and Level of Service
Tteng 422  s2021 module 2b: Roundabout Capacity Analysis and Level of ServiceTteng 422  s2021 module 2b: Roundabout Capacity Analysis and Level of Service
Tteng 422 s2021 module 2b: Roundabout Capacity Analysis and Level of Service
Wael ElDessouki
 
Optimizing last mile delivery
Optimizing last mile deliveryOptimizing last mile delivery
Optimizing last mile delivery
YashwanthKumarYelama
 
5-Modal Split & Traffic Assignment-( Transportation and Traffic Engineering D...
5-Modal Split & Traffic Assignment-( Transportation and Traffic Engineering D...5-Modal Split & Traffic Assignment-( Transportation and Traffic Engineering D...
5-Modal Split & Traffic Assignment-( Transportation and Traffic Engineering D...
Hossam Shafiq I
 
Balancing the line by using heuristic method based on cpm in salbp –a case study
Balancing the line by using heuristic method based on cpm in salbp –a case studyBalancing the line by using heuristic method based on cpm in salbp –a case study
Balancing the line by using heuristic method based on cpm in salbp –a case study
eSAT Journals
 
Material Handling System
Material Handling SystemMaterial Handling System
Material Handling System
wombaty
 
Webinar: How to design express services on a bus transit network
Webinar: How to design express services on a bus transit networkWebinar: How to design express services on a bus transit network
Webinar: How to design express services on a bus transit network
BRTCoE
 
A feasible solution algorithm for a primitive vehicle routing problem
A feasible solution algorithm for a primitive vehicle routing problemA feasible solution algorithm for a primitive vehicle routing problem
A feasible solution algorithm for a primitive vehicle routing problem
Cem Recai Çırak
 
UCT Electrical Vehicle Infrastructure
UCT Electrical Vehicle InfrastructureUCT Electrical Vehicle Infrastructure
UCT Electrical Vehicle Infrastructure
The IOT Academy
 
Vocational training Report for ECE department
Vocational training  Report for ECE departmentVocational training  Report for ECE department
Vocational training Report for ECE department
Manoj Verma
 
City specific Cost Minimization for EV charging centers
City specific Cost Minimization for EV charging centersCity specific Cost Minimization for EV charging centers
City specific Cost Minimization for EV charging centers
IRJET Journal
 
A Dynamic Logistic Dispatching System With Set-Based Particle Swarm Optimization
A Dynamic Logistic Dispatching System With Set-Based Particle Swarm OptimizationA Dynamic Logistic Dispatching System With Set-Based Particle Swarm Optimization
A Dynamic Logistic Dispatching System With Set-Based Particle Swarm Optimization
Rajib Roy
 
toyota-Challenges towards New Software Platform for Automated Driving.pdf
toyota-Challenges towards New Software Platform for Automated Driving.pdftoyota-Challenges towards New Software Platform for Automated Driving.pdf
toyota-Challenges towards New Software Platform for Automated Driving.pdf
xmumiao
 
Multiple Vehicle Motion Planning: An Infinite Diminsion Newton Optimization M...
Multiple Vehicle Motion Planning: An Infinite Diminsion Newton Optimization M...Multiple Vehicle Motion Planning: An Infinite Diminsion Newton Optimization M...
Multiple Vehicle Motion Planning: An Infinite Diminsion Newton Optimization M...
AJHaeusler
 
Rail Technical Strategy, Key Capability 6 - More Space on Trains
Rail Technical Strategy, Key Capability 6 - More Space on TrainsRail Technical Strategy, Key Capability 6 - More Space on Trains
Rail Technical Strategy, Key Capability 6 - More Space on Trains
James Brown
 

Similar to Reinforcement Learning for EVRP with V2G (20)

Hybrid Planner for Smart Charging of Electric Fleets
Hybrid Planner for Smart Charging of Electric FleetsHybrid Planner for Smart Charging of Electric Fleets
Hybrid Planner for Smart Charging of Electric Fleets
 
Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...
Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...
Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...
 
IRJET- Real-Time Forecasting of EV Charging Station Scheduling for Smart Ener...
IRJET- Real-Time Forecasting of EV Charging Station Scheduling for Smart Ener...IRJET- Real-Time Forecasting of EV Charging Station Scheduling for Smart Ener...
IRJET- Real-Time Forecasting of EV Charging Station Scheduling for Smart Ener...
 
Simulation of electric bike using Scilab Xcos
Simulation of electric bike using Scilab XcosSimulation of electric bike using Scilab Xcos
Simulation of electric bike using Scilab Xcos
 
Supply chain logistics : vehicle routing and scheduling
Supply chain logistics : vehicle  routing and  schedulingSupply chain logistics : vehicle  routing and  scheduling
Supply chain logistics : vehicle routing and scheduling
 
IRJET- Business Scaling and Rebalancing in Shared Bicycle Systems
IRJET- Business Scaling and Rebalancing in Shared Bicycle SystemsIRJET- Business Scaling and Rebalancing in Shared Bicycle Systems
IRJET- Business Scaling and Rebalancing in Shared Bicycle Systems
 
Tteng 422 s2021 module 2b: Roundabout Capacity Analysis and Level of Service
Tteng 422  s2021 module 2b: Roundabout Capacity Analysis and Level of ServiceTteng 422  s2021 module 2b: Roundabout Capacity Analysis and Level of Service
Tteng 422 s2021 module 2b: Roundabout Capacity Analysis and Level of Service
 
Optimizing last mile delivery
Optimizing last mile deliveryOptimizing last mile delivery
Optimizing last mile delivery
 
5-Modal Split & Traffic Assignment-( Transportation and Traffic Engineering D...
5-Modal Split & Traffic Assignment-( Transportation and Traffic Engineering D...5-Modal Split & Traffic Assignment-( Transportation and Traffic Engineering D...
5-Modal Split & Traffic Assignment-( Transportation and Traffic Engineering D...
 
Balancing the line by using heuristic method based on cpm in salbp –a case study
Balancing the line by using heuristic method based on cpm in salbp –a case studyBalancing the line by using heuristic method based on cpm in salbp –a case study
Balancing the line by using heuristic method based on cpm in salbp –a case study
 
Material Handling System
Material Handling SystemMaterial Handling System
Material Handling System
 
Webinar: How to design express services on a bus transit network
Webinar: How to design express services on a bus transit networkWebinar: How to design express services on a bus transit network
Webinar: How to design express services on a bus transit network
 
A feasible solution algorithm for a primitive vehicle routing problem
A feasible solution algorithm for a primitive vehicle routing problemA feasible solution algorithm for a primitive vehicle routing problem
A feasible solution algorithm for a primitive vehicle routing problem
 
UCT Electrical Vehicle Infrastructure
UCT Electrical Vehicle InfrastructureUCT Electrical Vehicle Infrastructure
UCT Electrical Vehicle Infrastructure
 
Vocational training Report for ECE department
Vocational training  Report for ECE departmentVocational training  Report for ECE department
Vocational training Report for ECE department
 
City specific Cost Minimization for EV charging centers
City specific Cost Minimization for EV charging centersCity specific Cost Minimization for EV charging centers
City specific Cost Minimization for EV charging centers
 
A Dynamic Logistic Dispatching System With Set-Based Particle Swarm Optimization
A Dynamic Logistic Dispatching System With Set-Based Particle Swarm OptimizationA Dynamic Logistic Dispatching System With Set-Based Particle Swarm Optimization
A Dynamic Logistic Dispatching System With Set-Based Particle Swarm Optimization
 
toyota-Challenges towards New Software Platform for Automated Driving.pdf
toyota-Challenges towards New Software Platform for Automated Driving.pdftoyota-Challenges towards New Software Platform for Automated Driving.pdf
toyota-Challenges towards New Software Platform for Automated Driving.pdf
 
Multiple Vehicle Motion Planning: An Infinite Diminsion Newton Optimization M...
Multiple Vehicle Motion Planning: An Infinite Diminsion Newton Optimization M...Multiple Vehicle Motion Planning: An Infinite Diminsion Newton Optimization M...
Multiple Vehicle Motion Planning: An Infinite Diminsion Newton Optimization M...
 
Rail Technical Strategy, Key Capability 6 - More Space on Trains
Rail Technical Strategy, Key Capability 6 - More Space on TrainsRail Technical Strategy, Key Capability 6 - More Space on Trains
Rail Technical Strategy, Key Capability 6 - More Space on Trains
 

More from Prasant Misra

How can Mobility be used to solve Social Problems?
How can Mobility be used to solve Social Problems?How can Mobility be used to solve Social Problems?
How can Mobility be used to solve Social Problems?
Prasant Misra
 
A Short Course on the Internet of Things
A Short Course on the Internet of ThingsA Short Course on the Internet of Things
A Short Course on the Internet of Things
Prasant Misra
 
Emerging Networking Technologies for Industrial Applications
Emerging Networking Technologies for Industrial ApplicationsEmerging Networking Technologies for Industrial Applications
Emerging Networking Technologies for Industrial Applications
Prasant Misra
 
Energy Efficient GPS Acquisition with Sparse-GPS
Energy Efficient GPS Acquisition with Sparse-GPSEnergy Efficient GPS Acquisition with Sparse-GPS
Energy Efficient GPS Acquisition with Sparse-GPS
Prasant Misra
 
The NEEDS vs. the WANTS in IoT
The NEEDS vs. the WANTS in IoTThe NEEDS vs. the WANTS in IoT
The NEEDS vs. the WANTS in IoT
Prasant Misra
 
Link Layer Protocols for WSN-based IoT
Link Layer Protocols for WSN-based IoTLink Layer Protocols for WSN-based IoT
Link Layer Protocols for WSN-based IoT
Prasant Misra
 
Challenges in Analytics for BIG Data
Challenges in Analytics for BIG DataChallenges in Analytics for BIG Data
Challenges in Analytics for BIG Data
Prasant Misra
 

More from Prasant Misra (7)

How can Mobility be used to solve Social Problems?
How can Mobility be used to solve Social Problems?How can Mobility be used to solve Social Problems?
How can Mobility be used to solve Social Problems?
 
A Short Course on the Internet of Things
A Short Course on the Internet of ThingsA Short Course on the Internet of Things
A Short Course on the Internet of Things
 
Emerging Networking Technologies for Industrial Applications
Emerging Networking Technologies for Industrial ApplicationsEmerging Networking Technologies for Industrial Applications
Emerging Networking Technologies for Industrial Applications
 
Energy Efficient GPS Acquisition with Sparse-GPS
Energy Efficient GPS Acquisition with Sparse-GPSEnergy Efficient GPS Acquisition with Sparse-GPS
Energy Efficient GPS Acquisition with Sparse-GPS
 
The NEEDS vs. the WANTS in IoT
The NEEDS vs. the WANTS in IoTThe NEEDS vs. the WANTS in IoT
The NEEDS vs. the WANTS in IoT
 
Link Layer Protocols for WSN-based IoT
Link Layer Protocols for WSN-based IoTLink Layer Protocols for WSN-based IoT
Link Layer Protocols for WSN-based IoT
 
Challenges in Analytics for BIG Data
Challenges in Analytics for BIG DataChallenges in Analytics for BIG Data
Challenges in Analytics for BIG Data
 

Recently uploaded

South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)
Academy of Science of South Africa
 
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
Nguyen Thanh Tu Collection
 
How to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRMHow to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRM
Celine George
 
PIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf IslamabadPIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf Islamabad
AyyanKhan40
 
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
Priyankaranawat4
 
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
Digital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental DesignDigital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental Design
amberjdewit93
 
Pengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptxPengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptx
Fajar Baskoro
 
How to deliver Powerpoint Presentations.pptx
How to deliver Powerpoint  Presentations.pptxHow to deliver Powerpoint  Presentations.pptx
How to deliver Powerpoint Presentations.pptx
HajraNaeem15
 
Wound healing PPT
Wound healing PPTWound healing PPT
Wound healing PPT
Jyoti Chand
 
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptxNEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
iammrhaywood
 
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective UpskillingYour Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Excellence Foundation for South Sudan
 
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UP
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UPLAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UP
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UP
RAHUL
 
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem studentsRHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
Himanshu Rai
 
UGC NET Exam Paper 1- Unit 1:Teaching Aptitude
UGC NET Exam Paper 1- Unit 1:Teaching AptitudeUGC NET Exam Paper 1- Unit 1:Teaching Aptitude
UGC NET Exam Paper 1- Unit 1:Teaching Aptitude
S. Raj Kumar
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
WaniBasim
 
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
GeorgeMilliken2
 
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdfবাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
eBook.com.bd (প্রয়োজনীয় বাংলা বই)
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
Nguyen Thanh Tu Collection
 
Chapter wise All Notes of First year Basic Civil Engineering.pptx
Chapter wise All Notes of First year Basic Civil Engineering.pptxChapter wise All Notes of First year Basic Civil Engineering.pptx
Chapter wise All Notes of First year Basic Civil Engineering.pptx
Denish Jangid
 

Recently uploaded (20)

South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)
 
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
 
How to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRMHow to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRM
 
PIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf IslamabadPIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf Islamabad
 
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
 
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
 
Digital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental DesignDigital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental Design
 
Pengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptxPengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptx
 
How to deliver Powerpoint Presentations.pptx
How to deliver Powerpoint  Presentations.pptxHow to deliver Powerpoint  Presentations.pptx
How to deliver Powerpoint Presentations.pptx
 
Wound healing PPT
Wound healing PPTWound healing PPT
Wound healing PPT
 
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptxNEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
 
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective UpskillingYour Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective Upskilling
 
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UP
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UPLAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UP
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UP
 
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem studentsRHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
 
UGC NET Exam Paper 1- Unit 1:Teaching Aptitude
UGC NET Exam Paper 1- Unit 1:Teaching AptitudeUGC NET Exam Paper 1- Unit 1:Teaching Aptitude
UGC NET Exam Paper 1- Unit 1:Teaching Aptitude
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
 
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
 
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdfবাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
 
Chapter wise All Notes of First year Basic Civil Engineering.pptx
Chapter wise All Notes of First year Basic Civil Engineering.pptxChapter wise All Notes of First year Basic Civil Engineering.pptx
Chapter wise All Notes of First year Basic Civil Engineering.pptx
 

Reinforcement Learning for EVRP with V2G

  • 1. | Copyright © 2021 Tata Consultancy Services Limited TCS Internal | Copyright © 2021 Tata Consultancy Services Limited A Reinforcement Learning Approach for Electric Vehicle Routing Problem with Vehicle-to-Grid Supply Ajay Narayanan, Prasant Misra, Ankush Ojha, Vivek Bandhu, Supratim Ghosh, Arunchandar Vasan Tata Consultancy Services – Research, India Adaptive and Learning Agents Workshop
  • 2. | Copyright © 2021 Tata Consultancy Services Limited TCS Internal Problem Overview Objective (What?): Design a scalable vehicle routing algorithm that reduces the trip cost for a fleet of last-mile delivery EVs Challenges ▪ Delivery demand (i.e., goods/energy) is present at specific time periods and locations; the supply (i.e., delivery EVs) needs to match that demand (both in space and time) for making a successful transaction ▪ The computationally difficult VRP problem becomes even harder with the introduction of new constraints related to EVs and energy discharge ▪ Existing solutions do not scale well to large problem instances in quick time with reasonable solution accuracy Motivation (Why?) ▪ Last-mile delivery is the most expensive part of the logistic and e-commerce process; accounts for more than 50% of the overall shipping expenses ▪ EV adoption in last-mile delivery operations is increasing; expected to grow to 8 million by 2030 ▪ (Last-mile EVs + Multi-service operation model) can lead to additional cost reduction of the last-mile delivery segment; which is over and above the cost benefits that come naturally from electrification! Approach (HOW?): Design a routing algorithm for EV fleets using reinforcement learning (RL) that learns optimal routing policies for a multi-service (delivery of good + delivery of energy) operation model 2 Proposal: New (multi-service) delivery model using last-mile EVs: ✓ Delivery of goods to customer (MANDATORY) + ✓ Delivery (or selling) of energy to the grid during its period of peak demand (OPTIONAL) Electricity Grid Deliver/Sell Energy Deliver Goods Customers
  • 3. | Copyright © 2021 Tata Consultancy Services Limited TCS Internal Problem Description 3 Demand EV Customer Supply Goods Time Time Goods EV Energy Time Time Energy Supply Demand Grid Fleet Operator Problem: ▪ GIVEN (a priori): (i) Customer order fulfillment list (from fulfillment center) (ii) Peak energy demand periods (from grid) (iii) Starting state-of-charge (SOC) of all EVs in the fleet (Q) ▪ CONSTRAINT: (i) (Mandatory) EVs must complete all customer deliveries (ii) (Optional) EVs can sell power to the grid, where possible (iii) The entire trip (depot-depot) must be managed within Q; without recharging ▪ Assumption: (i) All EVs are charged to Q at the depot, before trip commencement ▪ DECIDE (offline): Route of all EVs in the fleet ▪ OBJECTIVE: Minimize trip cost of the EV Fleet Customer (demand for goods) Discharging Station (demand for energy)
  • 4. | Copyright © 2021 Tata Consultancy Services Limited TCS Internal System Model 4 ⅰ j ⅰ Customer Set 𝐾 Discharging Station Set 𝑃 EV Set 𝑋 {𝑣0} Complete Undirected Graph 𝐺 = (𝑉, 𝐸) ▪ Set of nodes 𝑉 = {𝑣0} ∪ 𝐾 ∪ 𝑃 ▪ Set of edges 𝐸 = { (𝑖, 𝑗 ) : 𝑖, 𝑗 ∈ 𝑉, 𝑖 ≠𝑗 } Parameters C carrying capacity of each EV Q starting state-of-charge (SoC) of each EV d𝑖j distance between nodes 𝑖 and 𝑗 t𝑖j travel time between nodes 𝑖 and 𝑗 H charge consumption rate (kWh/km) b𝑖j energy consumed in travelling between 𝑖 and 𝑗 (= H . d𝑖j ) 𝑐𝑖 demand (of goods) at node 𝑖 𝑠𝑖 service time at node 𝑖 𝑒𝑖 earliest start of service at node 𝑖 𝑙𝑖 latest start of service at node 𝑖 ⅰ R discharging rate of each discharging station 𝐺𝑖 1 start time – grid demand at node 𝑖 𝐺𝑖 2 end time – grid demand at node 𝑖 Decision Variables α𝑖j indicates, if edge 𝑖𝑗 is traversed by an EV (binary) γ𝑖 service time at discharging station node 𝑖 τ𝑖 time-of-arrival at node 𝑖 Ɵ𝑖 remaining battery capacity on arrival at node 𝑖 λ𝑖 remaining cargo on arrival at node 𝑖 ▪ |X| = u ▪ |K| = m ▪ |P| = n
  • 5. | Copyright © 2021 Tata Consultancy Services Limited TCS Internal Optimization Problem 5 M = 𝑚𝑖𝑛 (𝑌1. ෍ 𝑑𝑖𝑗 . 𝛼𝑖𝑗 + 𝑌2. ෍ 𝛼0𝑖 − 𝑌3. ෍ 𝛾𝑖 ) total distance travelled by EVs total time spend by EVs at discharging stations total EVs used in a trip [C1]: Ensure every customer is visited exactly once, while making it optional to visit any of the discharging stations ෍ 𝑗 ∈ 𝑉, 𝑖 ≠ 𝑗 𝛼𝑖𝑗 =1 ∀𝑖 ∈ 𝐾 Objective: Minimize the Trip Cost of the EV Fleet Constraints ෍ 𝑗 ∈ 𝑉, 𝑖 ≠ 𝑗 𝛼𝑖𝑗 = ෍ 𝑗 ∈ 𝑉, 𝑖 ≠ 𝑗 𝛼𝑗𝑖 ∀𝑖 ∈ 𝑉 [C2]: Ensure flow conservation (at each node: # incoming edges = # outgoing edges)
  • 6. | Copyright © 2021 Tata Consultancy Services Limited TCS Internal Optimization Problem 6 [C3]: Ensure time feasibility of arcs leaving customers and the depot 𝜏𝑖 + (𝑡𝑖𝑗 + 𝑠𝑖) . 𝛼𝑖𝑗 − 𝑙0 . 1 − 𝛼𝑖𝑗 ≤ 𝜏𝑗 ∀𝑖 ∈ 𝐾, ∀𝑖 ∈ 𝑉, 𝑖 ≠ 𝑗 [C4]: Ensure time feasibility of arcs leaving discharging stations and the depot 𝜏𝑖 + (𝑡𝑖𝑗 + 𝑠𝑖) . 𝛼𝑖𝑗 − 𝑙0 . 1 − 𝛼𝑖𝑗 ≤ 𝜏𝑗 ∀𝑖 ∈ 𝑃, ∀𝑗 ∈ 𝑉, 𝑖 ≠ 𝑗 [C5]: Ensure that each customer node is visited within its time window 𝑒𝑖 ≤ 𝜏𝑖 ≤ 𝑙𝑖 ∀𝑖 ∈ 𝐾 [C6]: Ensure remaining charge (energy) feasibility for arcs leaving customers and the depot 0 ≤ 𝜃𝑗 ≤ 𝜃𝑖 − 𝐻 . 𝑑𝑖𝑗 . 𝛼𝑖𝑗 + 𝑄 1 − 𝛼𝑖𝑗 ∀𝑖 ∈ 𝐾, ∀𝑗 ∈ 𝑉, 𝑖 ≠ 𝑗 [C7]: Ensure remaining charge (energy) feasibility for arcs leaving discharging stations and the depot 0 ≤ 𝜃𝑗 ≤ 𝜃𝑖 − 𝐻 . 𝑑𝑖𝑗 + 𝑅 . 𝛾𝑗 . 𝛼𝑖𝑗 + 𝑄 1 − 𝛼𝑖𝑗 ∀𝑖 ∈ 𝑃, ∀𝑗 ∈ 𝑉, 𝑖 ≠ 𝑗 [C8]: Ensure all customer demands are fulfilled 0 ≤ λ𝑗 ≤ λ𝑖 − 𝑐𝑖𝑗. 𝛼𝑖𝑗 + 𝐶 1 − 𝛼𝑖𝑗 ∀𝑖, 𝑗 ∈ 𝑉 [C9 & 10]: Ensure discharge service time aligns with grid peak demand period, if discharging station is visited 𝜏𝑖 ≥ 𝐺𝑖 1 ∀𝑖 ∈ 𝑃, 𝜏𝑖 ≠ 0 𝜏𝑖 + 𝛾𝑖 ≤ 𝐺𝑖 2 ∀𝑖 ∈ 𝑃, 𝜏𝑖 ≠ 0
  • 7. | Copyright © 2021 Tata Consultancy Services Limited TCS Internal RL Representation 7 ⅰ j u Masking scheme for finding FEASIBLE (Vehicle -> Node) pairs ▪ The earliest arrival time at node 𝑗 violates the time window constraint [C5, C9, C10] ▪ Node 𝑗 is a customer; and the current SoC of the vehicle cannot support the complete trip from node 𝑖 to node 𝑗 and back to the depot [C6] ▪ Node 𝑗 is a discharging station; and the current SoC of the vehicle cannot support the complete trip from node 𝑖 to node 𝑗 and back to the depot; as well as the discharge operation at node 𝑗 [C7] ▪ Node 𝑗 is a customer with unfulfilled demand that is either nil or exceeds the remaining carrying load of vehicle 𝑢 [C8] System decision := find routes that minimize the trip cost of the fleet, subject to constraints Routing decision := which vehicle 𝑢 should be assigned to which node 𝑖 ? Are all FREE vehicles assigned ? NO
  • 8. | Copyright © 2021 Tata Consultancy Services Limited TCS Internal RL Representation 8 j i u State Variables 𝑏𝑖𝑗 energy consumed in travelling between nodes 𝑖 and 𝑗 (proxy for distance travelled between nodes 𝑖 and 𝑗) z𝑖 energy spent at node 𝑖 (0, if at customer; else z𝑖) Idepo flag: indicates if vehicle u 𝑖𝑠 starting from the depot Icust flag: indicates if node 𝑖 is a customer 𝑤𝑢 𝑖 wait time of vehicle 𝑢 at node 𝑖 before it can start service Normalizing Factor 𝐸 energy required to travel the diagonal length of the graph 𝐸 - - 𝑇 Decision time horizon State 𝑺𝒕(𝒖, 𝒊) Action 𝑨𝒕(𝒖, 𝒊) ▪ Assign vehicle 𝑢 → 𝑛𝑜𝑑𝑒 𝑖 ▪ If vehicle 𝑢 is BUSY; perform local update (i) remove assigned 𝑛𝑜𝑑𝑒 𝑖 from the service list; (ii) update distance and time ▪ If the assignment of all FREE vehicles is done, perform global update and get reward Routing decision := from all feasible 𝑢, 𝑖 pairs, which pair is the best choice? Reward 𝑹𝒕(𝒖, 𝒊) − 𝐴1 ∗ 𝑏𝑖𝑗 𝐴1 = 0.15 -VE reward for choosing longer route segments + 𝐴2 ∗ 𝑧𝑖 𝐴2 = 0.001 +VE reward for visiting discharging station node + 𝐴3 ∗ 𝐼𝑐𝑢𝑠𝑡 𝐴3 = 0.15 +VE reward for visiting customer node − (𝐴4 ∗ 𝑤𝑢 𝑖 ) 𝐴1 = 0.15 -VE reward for assignments that lead to waiting time − (𝐴5 ∗ 𝐼𝑑𝑒𝑝𝑜) 𝐴1 = 0.55 -VE reward for sending new vehicles from the depot
  • 9. | Copyright © 2021 Tata Consultancy Services Limited TCS Internal RL Training Algorithm 9 1) Initialize the neural network with weights 𝜙 2) Initialize batch size 𝛽, replay buffer 𝐵 3) :FOR <Episode> = 1 :TO <Total-Num-Episodes> :DO i. Randomly choose data instance from training set ii. Initialize environment iii. :WHILE t < T :DO a. Create a copy of the environment for local updates b. :WHILE <free-vehicle> is unassigned :DO i. Find feasible (vehicle 𝑢, node 𝑖) pairs ∀ 𝑢 (whether free or busy) a. :IF no feasible pairs found, then BREAK ii. Calculate 𝑞𝑡 (𝑢, 𝑖) = 𝜙(𝑆𝑡(𝑢, 𝑖)) ∀ feasible (vehicle 𝑢, node 𝑖) pairs, regardless of the current state of each vehicle 𝑢 (busy or free) iii. Choose (𝑢, 𝑖) pair with highest 𝑞𝑡 value (using 𝜖-greedy assignment) iv. Perform local update on the environment copy c. Execute new (𝑢, 𝑖) assignments in the global environment and get reward 𝑅𝑡(𝑢, 𝑖) d. Add [𝑆𝑡(𝑢, 𝑖); 𝑅𝑡(𝑢, 𝑖); 𝑞𝑡(𝑢, 𝑖)] to replay buffer iv. Delete oldest entries in B if size exceeds buffer capacity v. Draw 𝛽 samples from B vi. Update 𝜙 by minimizing MSE loss between 𝑞𝑡(𝑢, 𝑖) and 𝑅𝑡 (𝑢, 𝑖)
  • 10. | Copyright © 2021 Tata Consultancy Services Limited TCS Internal A Representative Example for the RL Approach 10 E=15; t=25 E=25; t=35 E=05; t=15 E=25; t=35 Depot [0,200] Cus-1 [0,55] Cus-3 [45,75] Cus-2 [10,45] DS-1 [20,75] Depot [0,200] Cus-1 [0,55] Cus-3 [45,75] Cus-2 [10,45] DS-1 [20,75] Veh-2 (Q=100) Veh-1 (Q=100) State(Veh-1, Cus-1) State(Veh-1, Cus-2) State(Veh-2, Cus-1) State(Veh-2, Cus-2) Argmax State(Veh-1, Cus-1) State(Veh-1, DS-1) State(Veh-2, Cus-1) Argmax Val(Veh-1, DS-1) Environment: Global Update + Reward Collection t=0 Depot [0,200] Cus-1 [0,55] Cus-3 [45,75] Cus-2 [10,45] DS-1 [20,75] Veh-1 (Q=70) Veh-2 (Q=100) t=45 Depot [0,200] Cus-1 [0,55] Cus-3 [45,75] Cus-2 [10,45] DS-1 [20,75] Veh-1 (Q=70) Veh-2 (Q=75) (Veh-1, Cus-2) (update GLOBAL environment IFF all free vehicles are assigned; else update LOCAL environment copy) (Veh-2, Cus-1)
  • 11. | Copyright © 2021 Tata Consultancy Services Limited TCS Internal A Representative Example for the RL Approach 11 E=15; t=25 E=25; t=35 E=05; t=15 E=25; t=35 Depot [0,200] Cus-1 [0,55] Cus-3 [45,75] Cus-2 [10,45] DS-1 [20,75] State(Veh-1, DS-1) State(Veh-2, DS-1) State(Veh-2, Cus-3) Argmax State(Veh-2, Cus-3) State(Veh-2, DS-1) Argmax Environment: Global Update + Reward Collection t=80 (Veh-1, DS-1) (update GLOBAL environment IFF all free vehicles are assigned; else update LOCAL environment copy) t=45 Depot [0,200] Cus-1 [0,55] Cus-3 [45,75] Cus-2 [10,45] DS-1 [20,75] Veh-1 (Q=70) Veh-2 (Q=75) Cus-3 [45,75] DS-1 [20,75] Veh-1 (Q=65) Depot [0,200] Cus-1 [0,55] Cus-2 [10,45] Veh-2 (Q=75) Battery Constraint (Veh-2, Cus-3) Cus-3 [45,75] DS-1 [20,75] Veh-1 (Q=65) Depot [0,200] Cus-1 [0,55] Cus-2 [10,45] Veh-2 (Q=60) DS-1 [20,75] Cus-3 [45,75]
  • 12. | Copyright © 2021 Tata Consultancy Services Limited TCS Internal A Representative Example for the RL Approach 12 E=15; t=25 E=25; t=35 E=05; t=15 E=25; t=35 Depot [0,200] Cus-1 [0,55] Cus-3 [45,75] Cus-2 [10,45] DS-1 [20,75] State(Veh-1, Depot) State(Veh-2, Depot) Argmax State(Veh-1, Depot) Argmax Val(Veh-2, Depot) Environment: Global Update + Reward Collection t=155 (Veh-2, Depot) (update GLOBAL environment IFF all free vehicles are assigned; else update LOCAL environment copy) t=80 t=80 (Veh-1, Depot) Depot [0,200] Cus-1 [0,55] Cus-2 [10,45] Veh-2 (Q=60) Veh-1 (Q=65) DS-1 [20,75] Cus-3 [45,75] Depot [0,200] Cus-1 [0,55] Cus-2 [10,45] Veh-2 (Q=60) DS-1 [20,75] Cus-3 [45,75] Veh-1 (Q=65) Depot [0,200] Cus-1 [0,55] Cus-2 [10,45] Veh-2 (Q=60) DS-1 [20,75] Cus-3 [45,75] Veh-1 (Q=65)
  • 13. | Copyright © 2021 Tata Consultancy Services Limited TCS Internal Evaluation 13 Neural Network Architecture Layer # Description Neurons Justification Activation Func 1 Input Layer 5 5 state variables - 2 FC 12 ReLU 3 FC 6 ReLU 4 FC 3 ReLU 4 Output Layer 1 - Hyperparameters Parameter Value Parameter Value Optimizer Adam Exploration factor Linear decay from 1 to 0 over 75 episodes Batch Size (β) 16 Exploration policy ε-greedy Replay Buffer Size (B) 5000 Learning Rate 0.001 Training Epochs per Episode Min -> # customers ▪ Implementation framework: PyTorch ▪ Training: 200 episodes (random combination of customers; discharge stations, vehicles) ▪ Testing: Solomon Dataset [6-10 instances per dataset] ▪ Clustered (CL) ▪ Random (RA) ▪ Random Clustered (RC) RL Training
  • 14. | Copyright © 2021 Tata Consultancy Services Limited TCS Internal Mean Optimality Gap between GA and RL: 17.23% (std. dev. 7.98%) Mean Optimality Gap between GA and RL: 22.38% (std. dev. 8.35%) Mean Optimality Gap between GA and RL: 20.02% (std. dev. 6.58%) 1. Accuracy: GA is better than RL by an average of 19.8% (range 8.3% - 36.52%) 2. Execution Time: RL is faster than GA by an average of 24 times
  • 15. | Copyright © 2021 Tata Consultancy Services Limited TCS Internal Table shows specific instances where the MILP formulation converges within a reasonable amount of time GA accuracy is as a good as MILP Evaluation
  • 16. | Copyright © 2021 Tata Consultancy Services Limited TCS Internal Final Remarks Summary ▪ We model the electric vehicle routing problem with constraints on loading capacity; time window; vehicle-to- grid energy supply (CEVRPTW-D) ; and formulate the multi-objective optimization problem to minimize the trip cost of the fleet. ▪ We design QuikRouteFinder: a value-based reinforcement learning algorithm by defining the (state, action) space, and engineer the reward signal for the RL agent to find the cost-effective delivery routes. ▪ We design and implement a genetic algorithm (GA) metaheuristic to derive optimal results for CEVRPTW-D. ▪ Using Solomon datasets, we evaluate and compare the computation speed and solution accuracy of the proposed RL model against GA and MILP. Key Finding QuikRouteFinder (based on RL) is 24 times faster than the GA and MILP baselines in terms of solutioning speed, but with ≈ 20% decrease in solution quality. 16