SlideShare a Scribd company logo
1 of 29
Download to read offline
Generation
company
Wholesale market/transmission grid
Generation
company
Generation
company
Generation
company
Retailer Retailer Retailer
Large
consumers
Large
prosumers
Consumers Consumers Consumers Prosumers
Dominant model in Europe for
more than ten years.
When you buy electricity, you buy
a quantity of electrical energy
that will be delivered over a
specific time period. In the EU,
this is typically 15 or 30 minutes.
The current market structure is
evolving with new players
appearing (renewable energy
communities, etc.) and
consumer-centric models
becoming popular.
Energy sales
The retail model for electricity markets
Electricity markets – Overview
Forward/Future
market
Day-ahead
market
Intraday
market
Balancing
market
Ancillary
services
Delivery
Year Y
Quarter Q
Month M
Day D
Time t
Y – 6 years D - 1
00:00
D - 1
12:00
D - 1
16:00
t – 15 min
3
Y – a few days
Q - a few days
M - a few days
Regulation market (balancing)
Note: time at which the markets are closing may change from one country to another.
Prices for monthly, quarterly and yearly products on
the forward markets: an example
Why deploy RL for interacting with electricity
markets?
Reinforcement learning (RL) refers to a set of techniques that enable one to solve a
complex stochastic sequential optimal decision-making problem solely from the
information generated through interaction with the environment defining the dynamics of
the problem and its reward function.
Reason #1: RL techniques target the maximization of a numerical signal (monetisation of
the RL agent behaviour) which is convenient when you want to maximize profits.
Reason #2: Retailers, generators and consumers interacting with electricity markets are
interacting with a sequence of markets. In these markets, they must buy/sell the exact
amount of power they consume/produce for every market period or be exposed to the
balancing market. The many loads/generation devices/batteries whose flexibility can be
exploited to maximize gains also lead to a time-coupling of the decisions. Hence the
problem is naturally sequential.
Reason #3: Problems are highly stochastic and it is often very difficult to model the
environment in an appropriate way.
Ambitious research objective: To design an RL agent able to simultaneously interact near-
optimally with all these markets, whatever the player (producer, retailer, prosumer, etc.).
5
RL for interacting with the continuous intraday market
The continuous intraday market (CID) allows market participants to adjust their positions
through bilateral contracts. It is an opportunity for producers and consumers to make last-
minute adjustments and balance their positions closer to real time. This electricity market
authorises continuous trading, meaning that a trade is executed as soon as two orders
match.
We focus on the problem of an RL agent that controls a battery while interacting at the
same time with the intraday market to maximize its profits. The agent has a constraint to
be balanced for every market period. That leads to a decoupling with the balancing market.
Based on the following paper:
A Deep Reinforcement Learning Framework for Continuous Intraday Market Bidding
BOUKAS, IOANNIS; ERNST, DAMIEN; THÉATE, THIBAUT; BOLLAND, ADRIEN et al. In Machine
Learning (2021)
6
CID Market Design
We assume that the
agent can only grab
(partially or totally)
orders in the market or
change the setting of the
battery at the beginning
of every market period.
RL agent
Environment
State
Reward
Action
Numerical Value
The Generic RL Scheme
The learning agent
that controls the battery and
determines the interaction
with the CID market.
State: (i) the battery
state of charge (ii)
everything you know
about the market –
knowledge assumed to
be limited to the
contents of the order
book.
Reward: the money
made during the market
period.
Action: the power
generated by the
battery for the next
market period and the
market orders that
have been grabbed.
+ the energy
market
RL scheme applied to CID trading with a battery
Reducing the complexity of the problem
State-space: comprised of what exists between the space of all possible order
books and the set of possible states for the battery. Solution: reduction of the order
book by representing it through simple features.
Action space: Number of actions at one instant equal to the number of elements in
the order book plus one, and the power setting for the battery. Solution: defining
two high-level actions namely, Idle or Trade. The Trade action grabs orders and
computes all future settings of the battery to maximize profits under the
assumption that no new orders will be posted in the order book and that the
system should be balanced all the time.
Adversarial setting: When you play with markets, you are essentially in an
adversarial setting, but we have used an RL algorithm designed for a non-
adversarial setting for training our agent. This is a reasonable solution if you are not
a too big a player in the market. 10
The way the learning has been done
Trajectories have been generated by an RL agent interacting with the ‘training part’
of the order book and the battery. An Epsilon-greedy policy is used.
The Fitted Q Iteration (FQI) algorithm has been used for extracting high-performing
policies from the trajectories. Due to the finite time setting of the problem, one Q-
function has been learned for each time step t.
Approximation architecture made of one layer of LSTM neurons and five fully
connected feedd-forward layers.
The quality of the policy is assessed on the ‘testing parts’ of the order book. A
comparison is done with the so-called rolling intrinsic (RI) policy that applies the
Trade action at every instant t.
11
FQI policy RI policy
V is the profit per testing day and r is the probitability ratio with respect to the RI policy.
Testing would reflect the exact real-life performances of the FQI policy if its behavior would indeed not
influence the trading behavior of other agents.
Renewable Energy Communities
The new Renewable Energy Directive 2018/2001 introduces the concept of
Renewable Energy Communities (REC)
An REC is a goup of members connected to the main utility grid (often the
distribution network) who consume electricity and/or produce renewable
electricity. Batteries and/or flexible loads may be present in the REC.
Periodically, surplus electricity production is reallocated by the REC manager to the
consumers through a system of repartition keys. These determine the fraction of
the surplus of electricity production to be reallocated to each member. They define
the way electricity is shared among the different members.
13
Each member of the REC is associated with a
consumption meter (C) and a production meter
(P). At each discrete time step t, these meters
are incremented by the consumption and the
production of each member during the time
interval (t; t + 1], respectively. These meters are
monitored at each end of a metering period by
an Energy Management System (EMS) that
controls the flexibility devices. The EMS
computes optimal repartition keys with the
values read from all the meters of the REC
members and sends them to their respective
retailers. Retailers periodically (e.g., monthly)
compute the electricity bills for each REC
member based on the repartition keys and
sends them to their respective customers.
Controlling the REC
The problem of controlling the REC to maximize gains can be formalized as an optimal-
sequential decision-making problem. More information about this rather difficult
formalization:
Optimal Control of Renewable Energy Communities with Controllable Assets
AITTAHAR, SAMY; MANUEL DE VILLENA MILLAN, MIGUEL; CASTRONOVO, MICHAËL; BOUKAS,
IOANNIS et al. In E-print/Working paper (in press)
It is very important to jointly optimize the repartition keys and the controllable assets.
Reinforcement learning techniques combined with load/generation prediction
techniques can be used.
Sizing complex energy systems
Sizing an energy system refers to the process of computation of the investments that
will maximize benefits when the system is operated in a (near-)optimal way.
It has become more and more difficult to size modern
energy systems such as a Renewable Energy
Community because of the complexity of the
environment they are associated with (different types
of markets, dynamic grid fees, new devices such as e.g.
(autonomous) electrical vehicles, power-to-X devices
appearing, etc.).
The solution we propose: the use of newly developed reinforcement learning
algorithms for jointly optimizing the environment and the control policies.
Jointly designing and controlling a system
There are two paradigms for solving such problems:
(i) Mathematical programming → Does not account properly for
stochasticity, non-linearities, etc.
(ii) Sample-based (RL) techniques → Do not scale well to complex
problems. Example the Joint Optimization of Design and Control
(JODC) algorithm.
We propose a hybrid algorithm called Direct Environment and Policy Search
(DEPS). The algorithm lies in between reinforcement learning and model-
based optimization. It combines sample-based techniques with the
computation of derivatives of the dynamics and the reward function for
efficient solution search. More information:
Jointly Learning Environments and Control Policies with Projected
Stochastic Gradient Ascent BOLLAND, ADRIEN; BOUKAS, IOANNIS; BERGER,
MATHIAS; ERNST, DAMIEN. To be published in JAIR (2021)
Environment
Reward
𝑟𝑡 = 𝜌𝜓 𝑠𝑡, 𝑎𝑡, 𝜉𝑡
Make a decision
(action)
Dynamics
𝑠𝑡+1 = 𝑓𝜓 𝑠𝑡, 𝑎𝑡, 𝜉𝑡
Observe the state of the
system and the reward
Control agent Design agent
Direct Environment and Policy Search (DEPS)
1. Start with random environment parameters 𝜓 .
2. Define a control agent whose policy is modelled using a Deep
Neural Network with 𝜃 as parameters.
3. Initialise the parameters 𝜃 at random
4. Repeat iteratively
1. Simulate trajectories in the system with the control agent
2. From the trajectories, the equations of the system dynamics and of the
reward function compute the gradients V(𝜓, 𝜃) = 𝐸 σ 𝑟𝑡 with respect to
𝜃 and 𝜓.
3. Update the parameters:
𝜃 ← 𝜃 + 𝛼 𝛻𝜃V(𝜓, 𝜃)
𝜓 ← 𝜓 + 𝛼 𝛻𝜓V(𝜓, 𝜃)
Design choices: relates to the installed capacity of
PV panels, the battery capacity and the maximal
output of the diesel generator.
Control policy: sets the power output of the
battery and the generator.
Objective: minimizing the sum of the investment
and operational costs.
Joint design and control of a microgrid
Results: DEPS outperforms the baseline JODC.
It gives better overall performance and is a
more stable algorithm.
Joint design and control of a drone
Design choices: the morphology of the drone defined
by the parameters H, W, D and R.
Control policy: sets the speed of the four propellers.
Objective: flying as accurately as possible on an
ellipse.
Results:
Let us not forget the electrical grid!
Up to now we have focused on control/design problems relevant for
agents interacting with the electrical grid and energy markets.
However, grid operators are themselves facing complex control
problems related to energy transition and modifications of the control
structure of the electrical grid!
Reinforcement learning could help grid operators to design efficient
control strategies.
The grid before the quest for renewable energy
Distribution
Network (DN)
Transmission
Network (TN)
DN
DN
Advanced control strategies
implemented
Fit and forget doctrine
Direct Current (HVDC)
DN
TN
DN
DN DN
DN
TN
Backbone of the SuperGrid (or future
Global Grid) comprising HVDC links
Need for advanced
control strategies
Need for advanced
control strategies, often
called active network
management (ANMs)
schemes
Need for more-advanced
control strategies
The grid as it is becoming
Gym-ANM
Observation: Decision-making problems related to the electrical grid have drawn less
attention from the RL community than other fields over the last decade (e.g., video games,
robotics, autonomous driving). We believe this is because:
1. RL research relies heavily on the availability of software-based simulators to model the
real world during training.
2. Power systems are hard to simulate without extensive knowledge in power system
engineering.
3. This makes it difficult for RL researchers to start working on power system decision-
making problems.
Our solution: Provide an open-source software library (Gym-ANM) with which:
1. Power system experts can design new environments (tasks) that model Active Network
Management (ANM) problems.
2. RL researchers can train and evaluate the performance of their agents on existing tasks
(created in 1.).
More information: GYM-ANM: Reinforcement learning environments for active network management tasks in
electricity distribution systems HENRY, ROBIN; ERNST, DAMIEN in Energy and AI (2021), 5
Key Features of Gym-ANM
The environments (tasks) constructed with Gym-ANM follow the OpenAI
Gym framework, which a major portion of the RL community is already
using.
The different customisable components of Gym-ANM makes it suitable for
modelling a wide range of ANM tasks:
➢ Example: simple ones for educational purposes.
➢ Example: complex ones for advanced research.
Once designed, each environment can be used as a “blackbox” → RL
researchers need not be concerned with its implementation details.
Video example
Video also available on Youtube: https://youtu.be/D8kGH94kavY
References
A Deep Reinforcement Learning Framework for Continuous Intraday Market Bidding BOUKAS, IOANNIS; ERNST,
DAMIEN; THÉATE, THIBAUT; BOLLAND, ADRIEN et al. In Machine Learning (2021)
Optimal Control of Renewable Energy Communities with Controllable Assets
AITTAHAR, SAMY; MANUEL DE VILLENA MILLAN, MIGUEL; CASTRONOVO, MICHAËL; BOUKAS, IOANNIS et al. E-print/Working
paper (in press)
Jointly Learning Environments and Control Policies with Projected Stochastic Gradient Ascent
BOLLAND, ADRIEN; BOUKAS, IOANNIS; BERGER, MATHIAS; ERNST, DAMIEN. To be published in JAIR (2021)
GYM-ANM: Reinforcement learning environments for active network management tasks in electricity
distribution systems HENRY, ROBIN; ERNST, DAMIEN. In Energy and AI (2021), 5
GitHub repository (3K+ downloads after 6 months of release) for Gym-ANM:
https://github.com/robinhenry/gym-anm and the online documentation: https://gym-
anm.readthedocs.io/en/latest/index.html
Reinforcement learning for electrical markets and the energy transition

More Related Content

What's hot

Introduction to smart grids
Introduction to smart gridsIntroduction to smart grids
Introduction to smart gridsWaqasai
 
Presentation on Smart Grid
Presentation on Smart GridPresentation on Smart Grid
Presentation on Smart Gridnickitran
 
Smart Grid- By Rahul Mehra
Smart Grid- By Rahul MehraSmart Grid- By Rahul Mehra
Smart Grid- By Rahul MehraRahul Mehra
 
The Benefits of Smart Grid Technology for Buildings, Cities, and Sustainability
The Benefits of Smart Grid Technology for Buildings, Cities, and SustainabilityThe Benefits of Smart Grid Technology for Buildings, Cities, and Sustainability
The Benefits of Smart Grid Technology for Buildings, Cities, and SustainabilityPenn Institute for Urban Research
 
Renewable Energy Developments in IndiaRenewable energy developments
Renewable Energy Developments  in IndiaRenewable energy developmentsRenewable Energy Developments  in IndiaRenewable energy developments
Renewable Energy Developments in IndiaRenewable energy developmentschakri218
 
smart Grid
smart Gridsmart Grid
smart Gridmkanth
 
Microgriid PPT- Smart grid
Microgriid PPT- Smart gridMicrogriid PPT- Smart grid
Microgriid PPT- Smart gridrajatverma369
 
Smart Grid Technology
Smart Grid TechnologySmart Grid Technology
Smart Grid TechnologyAditya Jalan
 
Smart Grids: An Introduction
Smart Grids: An IntroductionSmart Grids: An Introduction
Smart Grids: An IntroductionHarshad Desai
 
Smart grid- Future technology
Smart grid- Future technologySmart grid- Future technology
Smart grid- Future technologyrajatverma369
 
Smart grid ppt seminar topic
Smart grid ppt seminar topic Smart grid ppt seminar topic
Smart grid ppt seminar topic ramesh kumawat
 
Selex ES @ Innovation Lab 2014-Smart Energy Innovation
Selex ES @ Innovation Lab 2014-Smart Energy InnovationSelex ES @ Innovation Lab 2014-Smart Energy Innovation
Selex ES @ Innovation Lab 2014-Smart Energy InnovationLeonardo
 
Short notes on Smart Grid
Short notes on Smart GridShort notes on Smart Grid
Short notes on Smart Gridislam_vip
 

What's hot (20)

Smart Grid Challenges
Smart Grid ChallengesSmart Grid Challenges
Smart Grid Challenges
 
Introduction to smart grids
Introduction to smart gridsIntroduction to smart grids
Introduction to smart grids
 
Smart grids
Smart gridsSmart grids
Smart grids
 
Presentation on Smart Grid
Presentation on Smart GridPresentation on Smart Grid
Presentation on Smart Grid
 
Smart Grid- By Rahul Mehra
Smart Grid- By Rahul MehraSmart Grid- By Rahul Mehra
Smart Grid- By Rahul Mehra
 
The Benefits of Smart Grid Technology for Buildings, Cities, and Sustainability
The Benefits of Smart Grid Technology for Buildings, Cities, and SustainabilityThe Benefits of Smart Grid Technology for Buildings, Cities, and Sustainability
The Benefits of Smart Grid Technology for Buildings, Cities, and Sustainability
 
Renewable Energy Developments in IndiaRenewable energy developments
Renewable Energy Developments  in IndiaRenewable energy developmentsRenewable Energy Developments  in IndiaRenewable energy developments
Renewable Energy Developments in IndiaRenewable energy developments
 
smart Grid
smart Gridsmart Grid
smart Grid
 
Microgriid PPT- Smart grid
Microgriid PPT- Smart gridMicrogriid PPT- Smart grid
Microgriid PPT- Smart grid
 
Smart grid
Smart gridSmart grid
Smart grid
 
Smart Grid Technology
Smart Grid TechnologySmart Grid Technology
Smart Grid Technology
 
Smart Grids: An Introduction
Smart Grids: An IntroductionSmart Grids: An Introduction
Smart Grids: An Introduction
 
Smart grid- Future technology
Smart grid- Future technologySmart grid- Future technology
Smart grid- Future technology
 
Smart grid ppt seminar topic
Smart grid ppt seminar topic Smart grid ppt seminar topic
Smart grid ppt seminar topic
 
Selex ES @ Innovation Lab 2014-Smart Energy Innovation
Selex ES @ Innovation Lab 2014-Smart Energy InnovationSelex ES @ Innovation Lab 2014-Smart Energy Innovation
Selex ES @ Innovation Lab 2014-Smart Energy Innovation
 
Smart grid
Smart gridSmart grid
Smart grid
 
Smart grid
Smart gridSmart grid
Smart grid
 
4.3_Modeling Microgrids with HOMER_Glassmire_EPRI/SNL Microgrid
4.3_Modeling Microgrids with HOMER_Glassmire_EPRI/SNL Microgrid4.3_Modeling Microgrids with HOMER_Glassmire_EPRI/SNL Microgrid
4.3_Modeling Microgrids with HOMER_Glassmire_EPRI/SNL Microgrid
 
Smart Grid
Smart GridSmart Grid
Smart Grid
 
Short notes on Smart Grid
Short notes on Smart GridShort notes on Smart Grid
Short notes on Smart Grid
 

Similar to Reinforcement learning for electrical markets and the energy transition

What are Balancing Services in power system
What are Balancing Services in power system What are Balancing Services in power system
What are Balancing Services in power system Power System Operation
 
PLEXOS applications
PLEXOS applicationsPLEXOS applications
PLEXOS applicationsTarun Reddy
 
Bi-Level Optimization based Coordinated Bidding Strategy of a Supplier in Ele...
Bi-Level Optimization based Coordinated Bidding Strategy of a Supplier in Ele...Bi-Level Optimization based Coordinated Bidding Strategy of a Supplier in Ele...
Bi-Level Optimization based Coordinated Bidding Strategy of a Supplier in Ele...IJERD Editor
 
Bidding strategies in deregulated power market
Bidding strategies in deregulated power marketBidding strategies in deregulated power market
Bidding strategies in deregulated power marketGautham Reddy
 
Market Based Criteria for Congestion Management and Transmission Pricing
Market Based Criteria for Congestion Management and Transmission PricingMarket Based Criteria for Congestion Management and Transmission Pricing
Market Based Criteria for Congestion Management and Transmission PricingIJERA Editor
 
Smart Grid Infrastructure for Efficient Power Consumption Using Real Time Pri...
Smart Grid Infrastructure for Efficient Power Consumption Using Real Time Pri...Smart Grid Infrastructure for Efficient Power Consumption Using Real Time Pri...
Smart Grid Infrastructure for Efficient Power Consumption Using Real Time Pri...ijtsrd
 
Electricity Market Day 2015 - Colas Chabanne, ENTSO-E WG Market Design and ...
Electricity Market Day 2015 -  Colas Chabanne, ENTSO-E  WG Market Design and ...Electricity Market Day 2015 -  Colas Chabanne, ENTSO-E  WG Market Design and ...
Electricity Market Day 2015 - Colas Chabanne, ENTSO-E WG Market Design and ...Fingrid Oyj
 
Executive summary Utilities opportunity and betterment
Executive summary Utilities opportunity and bettermentExecutive summary Utilities opportunity and betterment
Executive summary Utilities opportunity and bettermentSagar Zilpe
 
Real Time Pricing Simulator for Smart Grids
Real Time Pricing Simulator for Smart GridsReal Time Pricing Simulator for Smart Grids
Real Time Pricing Simulator for Smart GridsSwantika Dhundia
 
Capgemini ses - smart meter valuation model (gr)
Capgemini   ses - smart meter valuation model (gr)Capgemini   ses - smart meter valuation model (gr)
Capgemini ses - smart meter valuation model (gr)Gord Reynolds
 
Capacity Remuneration Mechanisms (CRMs)
Capacity Remuneration Mechanisms (CRMs)Capacity Remuneration Mechanisms (CRMs)
Capacity Remuneration Mechanisms (CRMs)Leonardo ENERGY
 
Disruptive Business Models for the Digitised Power Sector
Disruptive Business Models for the Digitised Power SectorDisruptive Business Models for the Digitised Power Sector
Disruptive Business Models for the Digitised Power SectorWorld Bank Infrastructure
 
A Time Series Analysis of Power Markets
A Time Series Analysis of Power MarketsA Time Series Analysis of Power Markets
A Time Series Analysis of Power MarketsIbrahim Khan
 
P20021 enel-x-monetizing-energy-assets-e book
P20021 enel-x-monetizing-energy-assets-e bookP20021 enel-x-monetizing-energy-assets-e book
P20021 enel-x-monetizing-energy-assets-e bookWalterLopez553897
 
Congestion Management Approaches in Deregulated Electricity Market: A Compreh...
Congestion Management Approaches in Deregulated Electricity Market: A Compreh...Congestion Management Approaches in Deregulated Electricity Market: A Compreh...
Congestion Management Approaches in Deregulated Electricity Market: A Compreh...Dr. Sudhir Kumar Srivastava
 
Deregulation in power industry
Deregulation in power industryDeregulation in power industry
Deregulation in power industryjerry patel
 
Market Analysis and Business Plan for Residential Energy Efficiency Services
Market Analysis and Business Plan for Residential Energy Efficiency ServicesMarket Analysis and Business Plan for Residential Energy Efficiency Services
Market Analysis and Business Plan for Residential Energy Efficiency ServicesAlbert Graells Vilella
 

Similar to Reinforcement learning for electrical markets and the energy transition (20)

Future Design of Electricity Markets
 Future Design of Electricity Markets Future Design of Electricity Markets
Future Design of Electricity Markets
 
What are Balancing Services in power system
What are Balancing Services in power system What are Balancing Services in power system
What are Balancing Services in power system
 
PLEXOS applications
PLEXOS applicationsPLEXOS applications
PLEXOS applications
 
What are Balancing Services ?
What are  Balancing Services ?What are  Balancing Services ?
What are Balancing Services ?
 
Bi-Level Optimization based Coordinated Bidding Strategy of a Supplier in Ele...
Bi-Level Optimization based Coordinated Bidding Strategy of a Supplier in Ele...Bi-Level Optimization based Coordinated Bidding Strategy of a Supplier in Ele...
Bi-Level Optimization based Coordinated Bidding Strategy of a Supplier in Ele...
 
IEEE2011
IEEE2011IEEE2011
IEEE2011
 
Bidding strategies in deregulated power market
Bidding strategies in deregulated power marketBidding strategies in deregulated power market
Bidding strategies in deregulated power market
 
Market Based Criteria for Congestion Management and Transmission Pricing
Market Based Criteria for Congestion Management and Transmission PricingMarket Based Criteria for Congestion Management and Transmission Pricing
Market Based Criteria for Congestion Management and Transmission Pricing
 
Smart Grid Infrastructure for Efficient Power Consumption Using Real Time Pri...
Smart Grid Infrastructure for Efficient Power Consumption Using Real Time Pri...Smart Grid Infrastructure for Efficient Power Consumption Using Real Time Pri...
Smart Grid Infrastructure for Efficient Power Consumption Using Real Time Pri...
 
Electricity Market Day 2015 - Colas Chabanne, ENTSO-E WG Market Design and ...
Electricity Market Day 2015 -  Colas Chabanne, ENTSO-E  WG Market Design and ...Electricity Market Day 2015 -  Colas Chabanne, ENTSO-E  WG Market Design and ...
Electricity Market Day 2015 - Colas Chabanne, ENTSO-E WG Market Design and ...
 
Executive summary Utilities opportunity and betterment
Executive summary Utilities opportunity and bettermentExecutive summary Utilities opportunity and betterment
Executive summary Utilities opportunity and betterment
 
Real Time Pricing Simulator for Smart Grids
Real Time Pricing Simulator for Smart GridsReal Time Pricing Simulator for Smart Grids
Real Time Pricing Simulator for Smart Grids
 
Capgemini ses - smart meter valuation model (gr)
Capgemini   ses - smart meter valuation model (gr)Capgemini   ses - smart meter valuation model (gr)
Capgemini ses - smart meter valuation model (gr)
 
Capacity Remuneration Mechanisms (CRMs)
Capacity Remuneration Mechanisms (CRMs)Capacity Remuneration Mechanisms (CRMs)
Capacity Remuneration Mechanisms (CRMs)
 
Disruptive Business Models for the Digitised Power Sector
Disruptive Business Models for the Digitised Power SectorDisruptive Business Models for the Digitised Power Sector
Disruptive Business Models for the Digitised Power Sector
 
A Time Series Analysis of Power Markets
A Time Series Analysis of Power MarketsA Time Series Analysis of Power Markets
A Time Series Analysis of Power Markets
 
P20021 enel-x-monetizing-energy-assets-e book
P20021 enel-x-monetizing-energy-assets-e bookP20021 enel-x-monetizing-energy-assets-e book
P20021 enel-x-monetizing-energy-assets-e book
 
Congestion Management Approaches in Deregulated Electricity Market: A Compreh...
Congestion Management Approaches in Deregulated Electricity Market: A Compreh...Congestion Management Approaches in Deregulated Electricity Market: A Compreh...
Congestion Management Approaches in Deregulated Electricity Market: A Compreh...
 
Deregulation in power industry
Deregulation in power industryDeregulation in power industry
Deregulation in power industry
 
Market Analysis and Business Plan for Residential Energy Efficiency Services
Market Analysis and Business Plan for Residential Energy Efficiency ServicesMarket Analysis and Business Plan for Residential Energy Efficiency Services
Market Analysis and Business Plan for Residential Energy Efficiency Services
 

More from Université de Liège (ULg)

Algorithms for the control and sizing of renewable energy communities
Algorithms for the control and sizing of renewable energy communitiesAlgorithms for the control and sizing of renewable energy communities
Algorithms for the control and sizing of renewable energy communitiesUniversité de Liège (ULg)
 
AI for energy: a bright and uncertain future ahead
AI for energy: a bright and uncertain future aheadAI for energy: a bright and uncertain future ahead
AI for energy: a bright and uncertain future aheadUniversité de Liège (ULg)
 
Extreme engineering for fighting climate change and the Katabata project
Extreme engineering for fighting climate change and the Katabata projectExtreme engineering for fighting climate change and the Katabata project
Extreme engineering for fighting climate change and the Katabata projectUniversité de Liège (ULg)
 
Ex-post allocation of electricity and real-time control strategy for renewabl...
Ex-post allocation of electricity and real-time control strategy for renewabl...Ex-post allocation of electricity and real-time control strategy for renewabl...
Ex-post allocation of electricity and real-time control strategy for renewabl...Université de Liège (ULg)
 
Harvesting wind energy in Greenland: a project for Europe and a huge step tow...
Harvesting wind energy in Greenland: a project for Europe and a huge step tow...Harvesting wind energy in Greenland: a project for Europe and a huge step tow...
Harvesting wind energy in Greenland: a project for Europe and a huge step tow...Université de Liège (ULg)
 
Décret favorisant le développement des communautés d’énergie renouvelable
Décret favorisant le développement des communautés d’énergie renouvelableDécret favorisant le développement des communautés d’énergie renouvelable
Décret favorisant le développement des communautés d’énergie renouvelableUniversité de Liège (ULg)
 
Harnessing the Potential of Power-to-Gas Technologies. Insights from a prelim...
Harnessing the Potential of Power-to-Gas Technologies. Insights from a prelim...Harnessing the Potential of Power-to-Gas Technologies. Insights from a prelim...
Harnessing the Potential of Power-to-Gas Technologies. Insights from a prelim...Université de Liège (ULg)
 
Reinforcement learning, energy systems and deep neural nets
Reinforcement learning, energy systems and deep neural nets Reinforcement learning, energy systems and deep neural nets
Reinforcement learning, energy systems and deep neural nets Université de Liège (ULg)
 
Soirée des Grands Prix SEE - A glimpse at the research work of the laureate o...
Soirée des Grands Prix SEE - A glimpse at the research work of the laureate o...Soirée des Grands Prix SEE - A glimpse at the research work of the laureate o...
Soirée des Grands Prix SEE - A glimpse at the research work of the laureate o...Université de Liège (ULg)
 
Reinforcement learning for data-driven optimisation
Reinforcement learning for data-driven optimisationReinforcement learning for data-driven optimisation
Reinforcement learning for data-driven optimisationUniversité de Liège (ULg)
 
Electricity retailing in Europe: remarkable events (with a special focus on B...
Electricity retailing in Europe: remarkable events (with a special focus on B...Electricity retailing in Europe: remarkable events (with a special focus on B...
Electricity retailing in Europe: remarkable events (with a special focus on B...Université de Liège (ULg)
 
Projet de décret « GRD »: quelques remarques du Prof. Damien ERNST
Projet de décret « GRD »: quelques remarques du Prof. Damien ERNSTProjet de décret « GRD »: quelques remarques du Prof. Damien ERNST
Projet de décret « GRD »: quelques remarques du Prof. Damien ERNSTUniversité de Liège (ULg)
 
Electrification and the Democratic Republic of the Congo
Electrification and the Democratic Republic of the CongoElectrification and the Democratic Republic of the Congo
Electrification and the Democratic Republic of the CongoUniversité de Liège (ULg)
 
Analyse du projet de décret relatif à la méthodologie tarifaire applicable a...
Analyse du projet de décret relatif à  la méthodologie tarifaire applicable a...Analyse du projet de décret relatif à  la méthodologie tarifaire applicable a...
Analyse du projet de décret relatif à la méthodologie tarifaire applicable a...Université de Liège (ULg)
 
Batteries and disrupting business models for the energy sector
Batteries and disrupting business models for the energy sectorBatteries and disrupting business models for the energy sector
Batteries and disrupting business models for the energy sectorUniversité de Liège (ULg)
 
Capacity mechanisms for improving security of supply: quick fixes or thoughtf...
Capacity mechanisms for improving security of supply: quick fixes or thoughtf...Capacity mechanisms for improving security of supply: quick fixes or thoughtf...
Capacity mechanisms for improving security of supply: quick fixes or thoughtf...Université de Liège (ULg)
 

More from Université de Liège (ULg) (20)

Algorithms for the control and sizing of renewable energy communities
Algorithms for the control and sizing of renewable energy communitiesAlgorithms for the control and sizing of renewable energy communities
Algorithms for the control and sizing of renewable energy communities
 
AI for energy: a bright and uncertain future ahead
AI for energy: a bright and uncertain future aheadAI for energy: a bright and uncertain future ahead
AI for energy: a bright and uncertain future ahead
 
Extreme engineering for fighting climate change and the Katabata project
Extreme engineering for fighting climate change and the Katabata projectExtreme engineering for fighting climate change and the Katabata project
Extreme engineering for fighting climate change and the Katabata project
 
Ex-post allocation of electricity and real-time control strategy for renewabl...
Ex-post allocation of electricity and real-time control strategy for renewabl...Ex-post allocation of electricity and real-time control strategy for renewabl...
Ex-post allocation of electricity and real-time control strategy for renewabl...
 
Big infrastructures for fighting climate change
Big infrastructures for fighting climate changeBig infrastructures for fighting climate change
Big infrastructures for fighting climate change
 
Harvesting wind energy in Greenland: a project for Europe and a huge step tow...
Harvesting wind energy in Greenland: a project for Europe and a huge step tow...Harvesting wind energy in Greenland: a project for Europe and a huge step tow...
Harvesting wind energy in Greenland: a project for Europe and a huge step tow...
 
Décret favorisant le développement des communautés d’énergie renouvelable
Décret favorisant le développement des communautés d’énergie renouvelableDécret favorisant le développement des communautés d’énergie renouvelable
Décret favorisant le développement des communautés d’énergie renouvelable
 
Harnessing the Potential of Power-to-Gas Technologies. Insights from a prelim...
Harnessing the Potential of Power-to-Gas Technologies. Insights from a prelim...Harnessing the Potential of Power-to-Gas Technologies. Insights from a prelim...
Harnessing the Potential of Power-to-Gas Technologies. Insights from a prelim...
 
Reinforcement learning, energy systems and deep neural nets
Reinforcement learning, energy systems and deep neural nets Reinforcement learning, energy systems and deep neural nets
Reinforcement learning, energy systems and deep neural nets
 
Soirée des Grands Prix SEE - A glimpse at the research work of the laureate o...
Soirée des Grands Prix SEE - A glimpse at the research work of the laureate o...Soirée des Grands Prix SEE - A glimpse at the research work of the laureate o...
Soirée des Grands Prix SEE - A glimpse at the research work of the laureate o...
 
Reinforcement learning for data-driven optimisation
Reinforcement learning for data-driven optimisationReinforcement learning for data-driven optimisation
Reinforcement learning for data-driven optimisation
 
Electricity retailing in Europe: remarkable events (with a special focus on B...
Electricity retailing in Europe: remarkable events (with a special focus on B...Electricity retailing in Europe: remarkable events (with a special focus on B...
Electricity retailing in Europe: remarkable events (with a special focus on B...
 
Projet de décret « GRD »: quelques remarques du Prof. Damien ERNST
Projet de décret « GRD »: quelques remarques du Prof. Damien ERNSTProjet de décret « GRD »: quelques remarques du Prof. Damien ERNST
Projet de décret « GRD »: quelques remarques du Prof. Damien ERNST
 
Belgian offshore wind potential
Belgian offshore wind potentialBelgian offshore wind potential
Belgian offshore wind potential
 
Electrification and the Democratic Republic of the Congo
Electrification and the Democratic Republic of the CongoElectrification and the Democratic Republic of the Congo
Electrification and the Democratic Republic of the Congo
 
Uber-like Models for the Electrical Industry
Uber-like Models for the Electrical IndustryUber-like Models for the Electrical Industry
Uber-like Models for the Electrical Industry
 
Analyse du projet de décret relatif à la méthodologie tarifaire applicable a...
Analyse du projet de décret relatif à  la méthodologie tarifaire applicable a...Analyse du projet de décret relatif à  la méthodologie tarifaire applicable a...
Analyse du projet de décret relatif à la méthodologie tarifaire applicable a...
 
Batteries and disrupting business models for the energy sector
Batteries and disrupting business models for the energy sectorBatteries and disrupting business models for the energy sector
Batteries and disrupting business models for the energy sector
 
La transition énergétique, l'affaire de tous
La transition énergétique, l'affaire de tousLa transition énergétique, l'affaire de tous
La transition énergétique, l'affaire de tous
 
Capacity mechanisms for improving security of supply: quick fixes or thoughtf...
Capacity mechanisms for improving security of supply: quick fixes or thoughtf...Capacity mechanisms for improving security of supply: quick fixes or thoughtf...
Capacity mechanisms for improving security of supply: quick fixes or thoughtf...
 

Recently uploaded

Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionDr.Costas Sachpazis
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AIabhishek36461
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxPoojaBan
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxDeepakSakkari2
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.eptoze12
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLDeelipZope
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2RajaP95
 
microprocessor 8085 and its interfacing
microprocessor 8085  and its interfacingmicroprocessor 8085  and its interfacing
microprocessor 8085 and its interfacingjaychoudhary37
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSCAESB
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile servicerehmti665
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...srsj9000
 

Recently uploaded (20)

Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AI
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptx
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptx
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCL
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
 
microprocessor 8085 and its interfacing
microprocessor 8085  and its interfacingmicroprocessor 8085  and its interfacing
microprocessor 8085 and its interfacing
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentation
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
 

Reinforcement learning for electrical markets and the energy transition

  • 1.
  • 2. Generation company Wholesale market/transmission grid Generation company Generation company Generation company Retailer Retailer Retailer Large consumers Large prosumers Consumers Consumers Consumers Prosumers Dominant model in Europe for more than ten years. When you buy electricity, you buy a quantity of electrical energy that will be delivered over a specific time period. In the EU, this is typically 15 or 30 minutes. The current market structure is evolving with new players appearing (renewable energy communities, etc.) and consumer-centric models becoming popular. Energy sales The retail model for electricity markets
  • 3. Electricity markets – Overview Forward/Future market Day-ahead market Intraday market Balancing market Ancillary services Delivery Year Y Quarter Q Month M Day D Time t Y – 6 years D - 1 00:00 D - 1 12:00 D - 1 16:00 t – 15 min 3 Y – a few days Q - a few days M - a few days Regulation market (balancing) Note: time at which the markets are closing may change from one country to another.
  • 4. Prices for monthly, quarterly and yearly products on the forward markets: an example
  • 5. Why deploy RL for interacting with electricity markets? Reinforcement learning (RL) refers to a set of techniques that enable one to solve a complex stochastic sequential optimal decision-making problem solely from the information generated through interaction with the environment defining the dynamics of the problem and its reward function. Reason #1: RL techniques target the maximization of a numerical signal (monetisation of the RL agent behaviour) which is convenient when you want to maximize profits. Reason #2: Retailers, generators and consumers interacting with electricity markets are interacting with a sequence of markets. In these markets, they must buy/sell the exact amount of power they consume/produce for every market period or be exposed to the balancing market. The many loads/generation devices/batteries whose flexibility can be exploited to maximize gains also lead to a time-coupling of the decisions. Hence the problem is naturally sequential. Reason #3: Problems are highly stochastic and it is often very difficult to model the environment in an appropriate way. Ambitious research objective: To design an RL agent able to simultaneously interact near- optimally with all these markets, whatever the player (producer, retailer, prosumer, etc.). 5
  • 6. RL for interacting with the continuous intraday market The continuous intraday market (CID) allows market participants to adjust their positions through bilateral contracts. It is an opportunity for producers and consumers to make last- minute adjustments and balance their positions closer to real time. This electricity market authorises continuous trading, meaning that a trade is executed as soon as two orders match. We focus on the problem of an RL agent that controls a battery while interacting at the same time with the intraday market to maximize its profits. The agent has a constraint to be balanced for every market period. That leads to a decoupling with the balancing market. Based on the following paper: A Deep Reinforcement Learning Framework for Continuous Intraday Market Bidding BOUKAS, IOANNIS; ERNST, DAMIEN; THÉATE, THIBAUT; BOLLAND, ADRIEN et al. In Machine Learning (2021) 6
  • 7. CID Market Design We assume that the agent can only grab (partially or totally) orders in the market or change the setting of the battery at the beginning of every market period.
  • 9. The learning agent that controls the battery and determines the interaction with the CID market. State: (i) the battery state of charge (ii) everything you know about the market – knowledge assumed to be limited to the contents of the order book. Reward: the money made during the market period. Action: the power generated by the battery for the next market period and the market orders that have been grabbed. + the energy market RL scheme applied to CID trading with a battery
  • 10. Reducing the complexity of the problem State-space: comprised of what exists between the space of all possible order books and the set of possible states for the battery. Solution: reduction of the order book by representing it through simple features. Action space: Number of actions at one instant equal to the number of elements in the order book plus one, and the power setting for the battery. Solution: defining two high-level actions namely, Idle or Trade. The Trade action grabs orders and computes all future settings of the battery to maximize profits under the assumption that no new orders will be posted in the order book and that the system should be balanced all the time. Adversarial setting: When you play with markets, you are essentially in an adversarial setting, but we have used an RL algorithm designed for a non- adversarial setting for training our agent. This is a reasonable solution if you are not a too big a player in the market. 10
  • 11. The way the learning has been done Trajectories have been generated by an RL agent interacting with the ‘training part’ of the order book and the battery. An Epsilon-greedy policy is used. The Fitted Q Iteration (FQI) algorithm has been used for extracting high-performing policies from the trajectories. Due to the finite time setting of the problem, one Q- function has been learned for each time step t. Approximation architecture made of one layer of LSTM neurons and five fully connected feedd-forward layers. The quality of the policy is assessed on the ‘testing parts’ of the order book. A comparison is done with the so-called rolling intrinsic (RI) policy that applies the Trade action at every instant t. 11
  • 12. FQI policy RI policy V is the profit per testing day and r is the probitability ratio with respect to the RI policy. Testing would reflect the exact real-life performances of the FQI policy if its behavior would indeed not influence the trading behavior of other agents.
  • 13. Renewable Energy Communities The new Renewable Energy Directive 2018/2001 introduces the concept of Renewable Energy Communities (REC) An REC is a goup of members connected to the main utility grid (often the distribution network) who consume electricity and/or produce renewable electricity. Batteries and/or flexible loads may be present in the REC. Periodically, surplus electricity production is reallocated by the REC manager to the consumers through a system of repartition keys. These determine the fraction of the surplus of electricity production to be reallocated to each member. They define the way electricity is shared among the different members. 13
  • 14. Each member of the REC is associated with a consumption meter (C) and a production meter (P). At each discrete time step t, these meters are incremented by the consumption and the production of each member during the time interval (t; t + 1], respectively. These meters are monitored at each end of a metering period by an Energy Management System (EMS) that controls the flexibility devices. The EMS computes optimal repartition keys with the values read from all the meters of the REC members and sends them to their respective retailers. Retailers periodically (e.g., monthly) compute the electricity bills for each REC member based on the repartition keys and sends them to their respective customers.
  • 15. Controlling the REC The problem of controlling the REC to maximize gains can be formalized as an optimal- sequential decision-making problem. More information about this rather difficult formalization: Optimal Control of Renewable Energy Communities with Controllable Assets AITTAHAR, SAMY; MANUEL DE VILLENA MILLAN, MIGUEL; CASTRONOVO, MICHAËL; BOUKAS, IOANNIS et al. In E-print/Working paper (in press) It is very important to jointly optimize the repartition keys and the controllable assets. Reinforcement learning techniques combined with load/generation prediction techniques can be used.
  • 16. Sizing complex energy systems Sizing an energy system refers to the process of computation of the investments that will maximize benefits when the system is operated in a (near-)optimal way. It has become more and more difficult to size modern energy systems such as a Renewable Energy Community because of the complexity of the environment they are associated with (different types of markets, dynamic grid fees, new devices such as e.g. (autonomous) electrical vehicles, power-to-X devices appearing, etc.). The solution we propose: the use of newly developed reinforcement learning algorithms for jointly optimizing the environment and the control policies.
  • 17. Jointly designing and controlling a system There are two paradigms for solving such problems: (i) Mathematical programming → Does not account properly for stochasticity, non-linearities, etc. (ii) Sample-based (RL) techniques → Do not scale well to complex problems. Example the Joint Optimization of Design and Control (JODC) algorithm. We propose a hybrid algorithm called Direct Environment and Policy Search (DEPS). The algorithm lies in between reinforcement learning and model- based optimization. It combines sample-based techniques with the computation of derivatives of the dynamics and the reward function for efficient solution search. More information: Jointly Learning Environments and Control Policies with Projected Stochastic Gradient Ascent BOLLAND, ADRIEN; BOUKAS, IOANNIS; BERGER, MATHIAS; ERNST, DAMIEN. To be published in JAIR (2021)
  • 18. Environment Reward 𝑟𝑡 = 𝜌𝜓 𝑠𝑡, 𝑎𝑡, 𝜉𝑡 Make a decision (action) Dynamics 𝑠𝑡+1 = 𝑓𝜓 𝑠𝑡, 𝑎𝑡, 𝜉𝑡 Observe the state of the system and the reward Control agent Design agent Direct Environment and Policy Search (DEPS)
  • 19. 1. Start with random environment parameters 𝜓 . 2. Define a control agent whose policy is modelled using a Deep Neural Network with 𝜃 as parameters. 3. Initialise the parameters 𝜃 at random 4. Repeat iteratively 1. Simulate trajectories in the system with the control agent 2. From the trajectories, the equations of the system dynamics and of the reward function compute the gradients V(𝜓, 𝜃) = 𝐸 σ 𝑟𝑡 with respect to 𝜃 and 𝜓. 3. Update the parameters: 𝜃 ← 𝜃 + 𝛼 𝛻𝜃V(𝜓, 𝜃) 𝜓 ← 𝜓 + 𝛼 𝛻𝜓V(𝜓, 𝜃)
  • 20. Design choices: relates to the installed capacity of PV panels, the battery capacity and the maximal output of the diesel generator. Control policy: sets the power output of the battery and the generator. Objective: minimizing the sum of the investment and operational costs. Joint design and control of a microgrid Results: DEPS outperforms the baseline JODC. It gives better overall performance and is a more stable algorithm.
  • 21. Joint design and control of a drone Design choices: the morphology of the drone defined by the parameters H, W, D and R. Control policy: sets the speed of the four propellers. Objective: flying as accurately as possible on an ellipse. Results:
  • 22. Let us not forget the electrical grid! Up to now we have focused on control/design problems relevant for agents interacting with the electrical grid and energy markets. However, grid operators are themselves facing complex control problems related to energy transition and modifications of the control structure of the electrical grid! Reinforcement learning could help grid operators to design efficient control strategies.
  • 23. The grid before the quest for renewable energy Distribution Network (DN) Transmission Network (TN) DN DN Advanced control strategies implemented Fit and forget doctrine
  • 24. Direct Current (HVDC) DN TN DN DN DN DN TN Backbone of the SuperGrid (or future Global Grid) comprising HVDC links Need for advanced control strategies Need for advanced control strategies, often called active network management (ANMs) schemes Need for more-advanced control strategies The grid as it is becoming
  • 25. Gym-ANM Observation: Decision-making problems related to the electrical grid have drawn less attention from the RL community than other fields over the last decade (e.g., video games, robotics, autonomous driving). We believe this is because: 1. RL research relies heavily on the availability of software-based simulators to model the real world during training. 2. Power systems are hard to simulate without extensive knowledge in power system engineering. 3. This makes it difficult for RL researchers to start working on power system decision- making problems. Our solution: Provide an open-source software library (Gym-ANM) with which: 1. Power system experts can design new environments (tasks) that model Active Network Management (ANM) problems. 2. RL researchers can train and evaluate the performance of their agents on existing tasks (created in 1.). More information: GYM-ANM: Reinforcement learning environments for active network management tasks in electricity distribution systems HENRY, ROBIN; ERNST, DAMIEN in Energy and AI (2021), 5
  • 26. Key Features of Gym-ANM The environments (tasks) constructed with Gym-ANM follow the OpenAI Gym framework, which a major portion of the RL community is already using. The different customisable components of Gym-ANM makes it suitable for modelling a wide range of ANM tasks: ➢ Example: simple ones for educational purposes. ➢ Example: complex ones for advanced research. Once designed, each environment can be used as a “blackbox” → RL researchers need not be concerned with its implementation details.
  • 27. Video example Video also available on Youtube: https://youtu.be/D8kGH94kavY
  • 28. References A Deep Reinforcement Learning Framework for Continuous Intraday Market Bidding BOUKAS, IOANNIS; ERNST, DAMIEN; THÉATE, THIBAUT; BOLLAND, ADRIEN et al. In Machine Learning (2021) Optimal Control of Renewable Energy Communities with Controllable Assets AITTAHAR, SAMY; MANUEL DE VILLENA MILLAN, MIGUEL; CASTRONOVO, MICHAËL; BOUKAS, IOANNIS et al. E-print/Working paper (in press) Jointly Learning Environments and Control Policies with Projected Stochastic Gradient Ascent BOLLAND, ADRIEN; BOUKAS, IOANNIS; BERGER, MATHIAS; ERNST, DAMIEN. To be published in JAIR (2021) GYM-ANM: Reinforcement learning environments for active network management tasks in electricity distribution systems HENRY, ROBIN; ERNST, DAMIEN. In Energy and AI (2021), 5 GitHub repository (3K+ downloads after 6 months of release) for Gym-ANM: https://github.com/robinhenry/gym-anm and the online documentation: https://gym- anm.readthedocs.io/en/latest/index.html