Reinforcement learning, energy systems and deep neural nets

•

0 likes•92 views

This is a summary of the work I did over these last years in power systems and in reinforcement learning (and with my very latest work about neuromulation in reinforcement learning).

Engineering

Reinforcement learning, energy systems and deep
neural nets
Prof. Damien ERNST

Agent
Environment
state
reward
action
numerical value
Reinforcement learning agent

The battery
controller
State: (i) the battery
level (ii)
Everything you know
about the market
Reward: The money
you make during the
market period.
The battery setting
for the next market
period.
+ the energy
market

Table taken from: “Reinforcement Learning for Electric Power System Decision and Control: Past Considerations and Perspectives”. M. Glavic, R. Fonteneau and D. Ernst. Proceedings of the
20th IFAC World Congress.

Learning:
Exploration/exploitation: Not always
take the action that is believed to be
optimal to allow exploration.
Generalization: Generalize the
experience gained in some states to
other states.

Learning
phase
Effect of the
resulting control
policy
First control law for stabilizing power systems every computed using reinforcement learning. More at: “Reinforcement Learning Versus Model Predictive Control: A Comparison on a Power
System Problem”. D. Ernst, M. Glavic, F.Capitanescu, and L. Wehenkel. IEEE Transactions on Syestems, Man, An Cybernetics—PART B: Cybernetics, Vol. 39, No. 2, April 2009.

Reinforcement learning for trading in the intraday market
More: “Intra-day Bidding Strategies for Storage Devices Using Deep Reinforcement”. I. Boukas, D. Ernst, A. Papavasiliou, and B. Cornélusse. Proceedings of the 2018 15th International
Conference on the European Energy Market (EEM).
Complex problem:
• Adversarial environment
• Highly dimensional
• Partially observable
Best results obtained with optimisation
of strategies based on past data
together with supervised learning to
learn from the optimised
strategies (imitative-learning type of
approach)

“A critical present objective is to develop deep RL
methods that that can adapt rapidly to new tasks.”
Deepmind, “Learning to reinforcement learn.” (2016).

Synaptic
plasticity
Neuro-
modulation
Walking: a meta-RL problem solved
through synaptic plasticity and
neuro-modulation

Classical architecture for solving meta-RL problems:
Our new architecture:

Rectified Linear Unit:
Saturated Relu:
Parametrized sRelu:
Gated Recurrent Unit:

More: “Introducing neuromodulation in deep neural networks to learn adaptive behaviours”. N. Vecoven, D. Ernst,
A. Wehenkel and G. Drion. Download at: https://arxiv.org/abs/1812.09113

Similar to Reinforcement learning, energy systems and deep neural nets

AI: Learning in AI DataminingTools Inc

AI: Learning in AI Datamining Tools

NETWORK LEARNING AND TRAINING OF A CASCADED LINK-BASED FEED FORWARD NEURAL NE...ijaia

Introduction to Deep learningMassimiliano Patacchiola

Artificial Intelligence for Automated Decision Support ProjectValerii Klymchuk

Raai 2019 clinical unmet needs and its solutions of deep learning in medicine3Namkug Kim

Cao report 2007-2012Elif Ceylan

Neural Network Presentation Draft Updated March.pptxisaac405396

B42010712IJERA Editor

NEURAL NETWORKSESCOM

Artificial Neural NetworkBurhan Muzafar

Summary Of Thesisguestb452d6

Xin Yao: "What can evolutionary computation do for you?"ieee_cis_cyprus

Deep learning: Cutting through the Myths and HypeSiby Jose Plathottam

Welcome to International Journal of Engineering Research and Development (IJERD)IJERD Editor

Evolutionary Symbolic Discovery for Bioinformatics, Systems and Synthetic Bi...Natalio Krasnogor

Kantian Philosophy of Mathematics and Young Robots: Could a baby robot grow u...Aaron Sloman

PFP：材料探索のための汎用Neural Network Potential - 2021/10/4 QCMSR + DLAP共催Preferred Networks

Short Term Load Forecasting: One Week (With & Without Weekend) Using Artifici...IJLT EMAS

Similar to Reinforcement learning, energy systems and deep neural nets (20)

AI: Learning in AI

NETWORK LEARNING AND TRAINING OF A CASCADED LINK-BASED FEED FORWARD NEURAL NE...

Introduction to Deep learning

Artificial Intelligence for Automated Decision Support Project

Raai 2019 clinical unmet needs and its solutions of deep learning in medicine3

Cao report 2007-2012

Neural Network Presentation Draft Updated March.pptx

B42010712

NEURAL NETWORKS

Artificial Neural Network

Summary Of Thesis

Xin Yao: "What can evolutionary computation do for you?"

Deep learning: Cutting through the Myths and Hype

Welcome to International Journal of Engineering Research and Development (IJERD)

Evolutionary Symbolic Discovery for Bioinformatics, Systems and Synthetic Bi...

Kantian Philosophy of Mathematics and Young Robots: Could a baby robot grow u...

PFP：材料探索のための汎用Neural Network Potential - 2021/10/4 QCMSR + DLAP共催

Short Term Load Forecasting: One Week (With & Without Weekend) Using Artifici...

Recently uploaded

IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...RajaP95

APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3

Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis

HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95

HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla

Analog to Digital and Digital to Analog ConverterAbhinavSharma374939

Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis

(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat

High Profile Call Girls Nashik Megha 7001305949 Independent Escort Service Na...Call Girls in Nagpur High Profile

MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N

VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor

ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZTE

OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal

(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat

Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxJoão Esperancinha

Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan

Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile

★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR9953056974 Low Rate Call Girls In Saket, Delhi NCR

What are the advantages and disadvantages of membrane structures.pptxwendy cai

Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile

Recently uploaded (20)

IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...

APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS

Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...

HARMONY IN THE NATURE AND EXISTENCE - Unit-IV

HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS

Analog to Digital and Digital to Analog Converter

Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...

(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service

High Profile Call Girls Nashik Megha 7001305949 Independent Escort Service Na...

MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS

VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130

ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...

OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...

(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts

Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx

Coefficient of Thermal Expansion and their Importance.pptx

Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts

★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR

What are the advantages and disadvantages of membrane structures.pptx

Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts

Reinforcement learning, energy systems and deep neural nets

1. Reinforcement learning, energy systems and deep neural nets Prof. Damien ERNST

2. Agent Environment state reward action numerical value Reinforcement learning agent

3. The battery controller State: (i) the battery level (ii) Everything you know about the market Reward: The money you make during the market period. The battery setting for the next market period. + the energy market

4. Table taken from: “Reinforcement Learning for Electric Power System Decision and Control: Past Considerations and Perspectives”. M. Glavic, R. Fonteneau and D. Ernst. Proceedings of the 20th IFAC World Congress.

5. Learning: Exploration/exploitation: Not always take the action that is believed to be optimal to allow exploration. Generalization: Generalize the experience gained in some states to other states.

8. Learning phase Effect of the resulting control policy First control law for stabilizing power systems every computed using reinforcement learning. More at: “Reinforcement Learning Versus Model Predictive Control: A Comparison on a Power System Problem”. D. Ernst, M. Glavic, F.Capitanescu, and L. Wehenkel. IEEE Transactions on Syestems, Man, An Cybernetics—PART B: Cybernetics, Vol. 39, No. 2, April 2009.

10. Reinforcement learning for trading in the intraday market More: “Intra-day Bidding Strategies for Storage Devices Using Deep Reinforcement”. I. Boukas, D. Ernst, A. Papavasiliou, and B. Cornélusse. Proceedings of the 2018 15th International Conference on the European Energy Market (EEM). Complex problem: • Adversarial environment • Highly dimensional • Partially observable Best results obtained with optimisation of strategies based on past data together with supervised learning to learn from the optimised strategies (imitative-learning type of approach)

11. “A critical present objective is to develop deep RL methods that that can adapt rapidly to new tasks.” Deepmind, “Learning to reinforcement learn.” (2016).

12. Synaptic plasticity Neuro- modulation Walking: a meta-RL problem solved through synaptic plasticity and neuro-modulation

13. Classical architecture for solving meta-RL problems: Our new architecture:

14. Rectified Linear Unit: Saturated Relu: Parametrized sRelu: Gated Recurrent Unit:

15.

16.

17.

18. More: “Introducing neuromodulation in deep neural networks to learn adaptive behaviours”. N. Vecoven, D. Ernst, A. Wehenkel and G. Drion. Download at: https://arxiv.org/abs/1812.09113

Reinforcement learning, energy systems and deep neural nets

Recommended

Recommended

More Related Content

Similar to Reinforcement learning, energy systems and deep neural nets

Similar to Reinforcement learning, energy systems and deep neural nets (20)

More from Université de Liège (ULg)

More from Université de Liège (ULg) (20)

Recently uploaded

Recently uploaded (20)

Reinforcement learning, energy systems and deep neural nets