SlideShare a Scribd company logo
Optimal real-time landing
using deep networks
Carlos Sánchez and Dario Izzo
Advanced Concepts Team (ESA)
Daniel Hennes
Robotics Innovation Center (DFKI)
Learning the optimal state-feedback using deep networks
Falcon-9 Landing Simulation
An on-board real-time optimal control system
Goal is to make
• The mathematical model of a system usually leads to a system of equations
describing the nature of the interaction of the system.
• These equations are commonly known as governing laws or model
equations of the system.
• The model equations can be :
time independent  steady-state model equations
time dependent  dynamic model equations
In this course, we are mainly interested in dynamical systems.
 Systems that evolve with time are known as dynamic systems.
Dynamic Systems
Examples of Dynamic models – RLC Circuit
• Linear Differential Equations
• Example RLC circuit (Ohm‘s and Kirchhoff‘s Laws)
𝑥 = 𝐴𝑥 + 𝐵𝑢
• Nonlinear Differential equations : general cases
𝑥 = 𝑓(𝑥 𝑡 , 𝑢 𝑡 , 𝑡)
In mathematical systems theory, simulation is done by solving the
governing equations of the system for various input scenarios.
 This requires algorithms corresponding to the type of
systems model equation.
 Numerical methods for the solution of systems of
equations and differential equations.
Simulation ...
• A system with degrees of freedom can be always
manipulated to display certain useful behavior.
• Manipulation  possibility to control
• Control variables are usually systems degrees of freedom.
We ask:
What is the best control strategy that forces a system to
display required characterstics, output, follow a trajectory,
etc ?
 Optimal Control
 Methods of Numerical Optimization
Optimization of Dynamic Systems
Ex. : Optimal Control of Landing
𝑥1
𝑥1
𝑠
, 𝑥2
𝑠
Model Equations:
Objectives of the optimal control :
• Minimization of the error :
• Minimization of energy : 0
𝑡
𝑢 𝑡 𝑑𝑡
Ex. : Optimal Control of Landing
(𝑥1
𝑠
−𝑥1(𝑡)), (𝑥2
𝑠
−𝑥2(𝑡)) Cost @ terminal
Integration of cost rate
during flight
Problem formulation :
Cost function :
Model (state )
equations :
Initial states:
Desired final states:
How to solve the above optimal control problem in order to achieve
the desired goal ? That is, how to determine the optimal trajectories
𝑥1
∗
(t), 𝑥2
∗
(𝑡) that provide a minimum energy consumption 0
𝑡
𝑢 𝑡 𝑑𝑡 so
that the rocket can be halted at the desired position?
Optimization of Dynamic Systems
Regard simplified optimal control problem:
Cost rate @ t
Optimal feedback control for state x at time t is obtained from
Hamilton-Jacobi-Bellman equationoptimal control policy
a set of extremely challenging partial differential equations

Impossible to implement an Onboard Real Time Controller
Continuous Time Optimal Control
1. Pre-compute many optimal trajectories
2. Train an artificial neural network to approximate the optimal behaviour
3. Use the network to drive the spacecraft
Proposed Approach
Learning the optimal control policy using deep networks
Landing models
Training data generation
• The training data is generated using the Hermite-Simpson transcription and a
non-linear programming (NLP) solver
• The initial state of each trajectory is randomly selected from a training area
• 150,000 trajectories are generated for each one of the problems
(9,000,000 - 15,000,000 data points)
Approximate state-action with a DNN
Approximate state-action with a DNN
Approximate state-action with a DNN
Generalization ?
• Successful landings from states outside of the training initial conditions
• This suggest that an approximation to the solution of the HJB equations is
learned
Watch the video clip
https://youtu.be/8sDldAK4O0E
• Deep networks trained with large datasets successfully approximate
the optimal control even for regions out of the training data
• DNNs can be used as an on-board reactive control system, while
current methods fail to compute optimal trajectories in an efficient way
Conclusions

More Related Content

What's hot

Chap 8. Optimization for training deep models
Chap 8. Optimization for training deep modelsChap 8. Optimization for training deep models
Chap 8. Optimization for training deep models
Young-Geun Choi
 
Firefly exact MCMC for Big Data
Firefly exact MCMC for Big DataFirefly exact MCMC for Big Data
Firefly exact MCMC for Big Data
Gianvito Siciliano
 
Overview on Optimization algorithms in Deep Learning
Overview on Optimization algorithms in Deep LearningOverview on Optimization algorithms in Deep Learning
Overview on Optimization algorithms in Deep Learning
Khang Pham
 
Particle Filters and Applications in Computer Vision
Particle Filters and Applications in Computer VisionParticle Filters and Applications in Computer Vision
Particle Filters and Applications in Computer Visionzukun
 
CS8451 - Design and Analysis of Algorithms
CS8451 - Design and Analysis of AlgorithmsCS8451 - Design and Analysis of Algorithms
CS8451 - Design and Analysis of Algorithms
Krishnan MuthuManickam
 
Distributed Deep Q-Learning
Distributed Deep Q-LearningDistributed Deep Q-Learning
Distributed Deep Q-Learning
Lyft
 
Multilayer perceptron
Multilayer perceptronMultilayer perceptron
Multilayer perceptron
omaraldabash
 
Coordinate Descent method
Coordinate Descent methodCoordinate Descent method
Coordinate Descent method
Sanghyuk Chun
 
A note on word embedding
A note on word embeddingA note on word embedding
A note on word embedding
Khang Pham
 
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
The Statistical and Applied Mathematical Sciences Institute
 
Paper Study: A learning based iterative method for solving vehicle routing
Paper Study: A learning based iterative method for solving vehicle routingPaper Study: A learning based iterative method for solving vehicle routing
Paper Study: A learning based iterative method for solving vehicle routing
ChenYiHuang5
 
Multi Layer Network
Multi Layer NetworkMulti Layer Network
Multilayer Perceptron (DLAI D1L2 2017 UPC Deep Learning for Artificial Intell...
Multilayer Perceptron (DLAI D1L2 2017 UPC Deep Learning for Artificial Intell...Multilayer Perceptron (DLAI D1L2 2017 UPC Deep Learning for Artificial Intell...
Multilayer Perceptron (DLAI D1L2 2017 UPC Deep Learning for Artificial Intell...
Universitat Politècnica de Catalunya
 
An Optimal Iterative Algorithm for Extracting MUCs in a Black-box Constraint ...
An Optimal Iterative Algorithm for Extracting MUCs in a Black-box Constraint ...An Optimal Iterative Algorithm for Extracting MUCs in a Black-box Constraint ...
An Optimal Iterative Algorithm for Extracting MUCs in a Black-box Constraint ...
Philippe Laborie
 
ddpg seminar
ddpg seminarddpg seminar
ddpg seminar
민재 정
 
Safety Verification of Deep Neural Networks_.pdf
Safety Verification of Deep Neural Networks_.pdfSafety Verification of Deep Neural Networks_.pdf
Safety Verification of Deep Neural Networks_.pdf
Polytechnique Montréal
 
Gradient descent optimizer
Gradient descent optimizerGradient descent optimizer
Gradient descent optimizer
Hojin Yang
 

What's hot (20)

Chap 8. Optimization for training deep models
Chap 8. Optimization for training deep modelsChap 8. Optimization for training deep models
Chap 8. Optimization for training deep models
 
Firefly exact MCMC for Big Data
Firefly exact MCMC for Big DataFirefly exact MCMC for Big Data
Firefly exact MCMC for Big Data
 
Overview on Optimization algorithms in Deep Learning
Overview on Optimization algorithms in Deep LearningOverview on Optimization algorithms in Deep Learning
Overview on Optimization algorithms in Deep Learning
 
Particle Filters and Applications in Computer Vision
Particle Filters and Applications in Computer VisionParticle Filters and Applications in Computer Vision
Particle Filters and Applications in Computer Vision
 
CS8451 - Design and Analysis of Algorithms
CS8451 - Design and Analysis of AlgorithmsCS8451 - Design and Analysis of Algorithms
CS8451 - Design and Analysis of Algorithms
 
Distributed Deep Q-Learning
Distributed Deep Q-LearningDistributed Deep Q-Learning
Distributed Deep Q-Learning
 
Multilayer perceptron
Multilayer perceptronMultilayer perceptron
Multilayer perceptron
 
Coordinate Descent method
Coordinate Descent methodCoordinate Descent method
Coordinate Descent method
 
A note on word embedding
A note on word embeddingA note on word embedding
A note on word embedding
 
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
 
Paper Study: A learning based iterative method for solving vehicle routing
Paper Study: A learning based iterative method for solving vehicle routingPaper Study: A learning based iterative method for solving vehicle routing
Paper Study: A learning based iterative method for solving vehicle routing
 
Goprez sg
Goprez  sgGoprez  sg
Goprez sg
 
parallel
parallelparallel
parallel
 
Multi Layer Network
Multi Layer NetworkMulti Layer Network
Multi Layer Network
 
Multilayer Perceptron (DLAI D1L2 2017 UPC Deep Learning for Artificial Intell...
Multilayer Perceptron (DLAI D1L2 2017 UPC Deep Learning for Artificial Intell...Multilayer Perceptron (DLAI D1L2 2017 UPC Deep Learning for Artificial Intell...
Multilayer Perceptron (DLAI D1L2 2017 UPC Deep Learning for Artificial Intell...
 
An Optimal Iterative Algorithm for Extracting MUCs in a Black-box Constraint ...
An Optimal Iterative Algorithm for Extracting MUCs in a Black-box Constraint ...An Optimal Iterative Algorithm for Extracting MUCs in a Black-box Constraint ...
An Optimal Iterative Algorithm for Extracting MUCs in a Black-box Constraint ...
 
ddpg seminar
ddpg seminarddpg seminar
ddpg seminar
 
Safety Verification of Deep Neural Networks_.pdf
Safety Verification of Deep Neural Networks_.pdfSafety Verification of Deep Neural Networks_.pdf
Safety Verification of Deep Neural Networks_.pdf
 
Gradient descent optimizer
Gradient descent optimizerGradient descent optimizer
Gradient descent optimizer
 
Bioalgo 2012-04-hmm
Bioalgo 2012-04-hmmBioalgo 2012-04-hmm
Bioalgo 2012-04-hmm
 

Similar to Optimal real-time landing using DNN

Modern Control - Lec 02 - Mathematical Modeling of Systems
Modern Control - Lec 02 - Mathematical Modeling of SystemsModern Control - Lec 02 - Mathematical Modeling of Systems
Modern Control - Lec 02 - Mathematical Modeling of Systems
Amr E. Mohamed
 
COEN507 introduction to linear time invariant.pptx
COEN507 introduction to linear time invariant.pptxCOEN507 introduction to linear time invariant.pptx
COEN507 introduction to linear time invariant.pptx
MustaphaAbdullahi35
 
Introduction to mathematical control theory - Dr. Purnima Pandit
Introduction to mathematical control theory - Dr. Purnima PanditIntroduction to mathematical control theory - Dr. Purnima Pandit
Introduction to mathematical control theory - Dr. Purnima Pandit
Purnima Pandit
 
new approach fro reduced ordee modelling of fractional order systems in delts...
new approach fro reduced ordee modelling of fractional order systems in delts...new approach fro reduced ordee modelling of fractional order systems in delts...
new approach fro reduced ordee modelling of fractional order systems in delts...
VivekKumar265461
 
Av 738-Adaptive Filters - Extended Kalman Filter
Av 738-Adaptive Filters - Extended Kalman FilterAv 738-Adaptive Filters - Extended Kalman Filter
Av 738-Adaptive Filters - Extended Kalman Filter
Dr. Bilal Siddiqui, C.Eng., MIMechE, FRAeS
 
State space analysis.pptx
State space analysis.pptxState space analysis.pptx
State space analysis.pptx
RaviMuthamala1
 
CHAC Algorithm ECAL'07 Presentation
CHAC Algorithm ECAL'07 PresentationCHAC Algorithm ECAL'07 Presentation
CHAC Algorithm ECAL'07 Presentation
Antonio Mora
 
linear algebra in control systems
linear algebra in control systemslinear algebra in control systems
linear algebra in control systemsGanesh Bhat
 
Two dimensional-cellular-automata
Two dimensional-cellular-automataTwo dimensional-cellular-automata
Two dimensional-cellular-automata
Mazharul Shaik
 
What is the likely future of real-time transient stability?
What is the likely future of real-time transient stability?What is the likely future of real-time transient stability?
What is the likely future of real-time transient stability?Université de Liège (ULg)
 
01-system_models_slides.pdf
01-system_models_slides.pdf01-system_models_slides.pdf
01-system_models_slides.pdf
MammarSoulimane
 
Tracking and control problem of an aircraft
Tracking and control problem of an aircraftTracking and control problem of an aircraft
Tracking and control problem of an aircraft
ANSUMAN MISHRA
 
Trajectory Transformer.pptx
Trajectory Transformer.pptxTrajectory Transformer.pptx
Trajectory Transformer.pptx
Seungeon Baek
 
02 physical.system.modelling mechanical.systems.
02 physical.system.modelling mechanical.systems.02 physical.system.modelling mechanical.systems.
02 physical.system.modelling mechanical.systems.
Mahmoud Hussein
 
Cracking the Code: Using Quantitative Models in MATLAB to Solve Problems
Cracking the Code: Using Quantitative Models in MATLAB to Solve ProblemsCracking the Code: Using Quantitative Models in MATLAB to Solve Problems
Cracking the Code: Using Quantitative Models in MATLAB to Solve ProblemsSERC at Carleton College
 
hCHAC Lambda (NICSO 2010)
hCHAC Lambda (NICSO 2010)hCHAC Lambda (NICSO 2010)
hCHAC Lambda (NICSO 2010)
Antonio Mora
 
Power System Dynamics & Stability Overview & Electromagnetic Transients
Power System Dynamics & Stability Overview  &  Electromagnetic TransientsPower System Dynamics & Stability Overview  &  Electromagnetic Transients
Power System Dynamics & Stability Overview & Electromagnetic Transients
Power System Operation
 

Similar to Optimal real-time landing using DNN (20)

solver (1)
solver (1)solver (1)
solver (1)
 
Modern Control - Lec 02 - Mathematical Modeling of Systems
Modern Control - Lec 02 - Mathematical Modeling of SystemsModern Control - Lec 02 - Mathematical Modeling of Systems
Modern Control - Lec 02 - Mathematical Modeling of Systems
 
COEN507 introduction to linear time invariant.pptx
COEN507 introduction to linear time invariant.pptxCOEN507 introduction to linear time invariant.pptx
COEN507 introduction to linear time invariant.pptx
 
Introduction to mathematical control theory - Dr. Purnima Pandit
Introduction to mathematical control theory - Dr. Purnima PanditIntroduction to mathematical control theory - Dr. Purnima Pandit
Introduction to mathematical control theory - Dr. Purnima Pandit
 
new approach fro reduced ordee modelling of fractional order systems in delts...
new approach fro reduced ordee modelling of fractional order systems in delts...new approach fro reduced ordee modelling of fractional order systems in delts...
new approach fro reduced ordee modelling of fractional order systems in delts...
 
Av 738-Adaptive Filters - Extended Kalman Filter
Av 738-Adaptive Filters - Extended Kalman FilterAv 738-Adaptive Filters - Extended Kalman Filter
Av 738-Adaptive Filters - Extended Kalman Filter
 
State space analysis.pptx
State space analysis.pptxState space analysis.pptx
State space analysis.pptx
 
Kalman_filtering
Kalman_filteringKalman_filtering
Kalman_filtering
 
CHAC Algorithm ECAL'07 Presentation
CHAC Algorithm ECAL'07 PresentationCHAC Algorithm ECAL'07 Presentation
CHAC Algorithm ECAL'07 Presentation
 
linear algebra in control systems
linear algebra in control systemslinear algebra in control systems
linear algebra in control systems
 
Two dimensional-cellular-automata
Two dimensional-cellular-automataTwo dimensional-cellular-automata
Two dimensional-cellular-automata
 
Unit1 pg math model
Unit1 pg math modelUnit1 pg math model
Unit1 pg math model
 
What is the likely future of real-time transient stability?
What is the likely future of real-time transient stability?What is the likely future of real-time transient stability?
What is the likely future of real-time transient stability?
 
01-system_models_slides.pdf
01-system_models_slides.pdf01-system_models_slides.pdf
01-system_models_slides.pdf
 
Tracking and control problem of an aircraft
Tracking and control problem of an aircraftTracking and control problem of an aircraft
Tracking and control problem of an aircraft
 
Trajectory Transformer.pptx
Trajectory Transformer.pptxTrajectory Transformer.pptx
Trajectory Transformer.pptx
 
02 physical.system.modelling mechanical.systems.
02 physical.system.modelling mechanical.systems.02 physical.system.modelling mechanical.systems.
02 physical.system.modelling mechanical.systems.
 
Cracking the Code: Using Quantitative Models in MATLAB to Solve Problems
Cracking the Code: Using Quantitative Models in MATLAB to Solve ProblemsCracking the Code: Using Quantitative Models in MATLAB to Solve Problems
Cracking the Code: Using Quantitative Models in MATLAB to Solve Problems
 
hCHAC Lambda (NICSO 2010)
hCHAC Lambda (NICSO 2010)hCHAC Lambda (NICSO 2010)
hCHAC Lambda (NICSO 2010)
 
Power System Dynamics & Stability Overview & Electromagnetic Transients
Power System Dynamics & Stability Overview  &  Electromagnetic TransientsPower System Dynamics & Stability Overview  &  Electromagnetic Transients
Power System Dynamics & Stability Overview & Electromagnetic Transients
 

More from 홍배 김

Automatic Gain Tuning based on Gaussian Process Global Optimization (= Bayesi...
Automatic Gain Tuning based on Gaussian Process Global Optimization (= Bayesi...Automatic Gain Tuning based on Gaussian Process Global Optimization (= Bayesi...
Automatic Gain Tuning based on Gaussian Process Global Optimization (= Bayesi...
홍배 김
 
Gaussian processing
Gaussian processingGaussian processing
Gaussian processing
홍배 김
 
Lecture Summary : Camera Projection
Lecture Summary : Camera Projection Lecture Summary : Camera Projection
Lecture Summary : Camera Projection
홍배 김
 
Learning agile and dynamic motor skills for legged robots
Learning agile and dynamic motor skills for legged robotsLearning agile and dynamic motor skills for legged robots
Learning agile and dynamic motor skills for legged robots
홍배 김
 
Robotics of Quadruped Robot
Robotics of Quadruped RobotRobotics of Quadruped Robot
Robotics of Quadruped Robot
홍배 김
 
Recurrent Neural Net의 이론과 설명
Recurrent Neural Net의 이론과 설명Recurrent Neural Net의 이론과 설명
Recurrent Neural Net의 이론과 설명
홍배 김
 
Convolutional neural networks 이론과 응용
Convolutional neural networks 이론과 응용Convolutional neural networks 이론과 응용
Convolutional neural networks 이론과 응용
홍배 김
 
Anomaly detection using deep one class classifier
Anomaly detection using deep one class classifierAnomaly detection using deep one class classifier
Anomaly detection using deep one class classifier
홍배 김
 
The world of loss function
The world of loss functionThe world of loss function
The world of loss function
홍배 김
 
Machine learning applications in aerospace domain
Machine learning applications in aerospace domainMachine learning applications in aerospace domain
Machine learning applications in aerospace domain
홍배 김
 
Anomaly Detection and Localization Using GAN and One-Class Classifier
Anomaly Detection and Localization  Using GAN and One-Class ClassifierAnomaly Detection and Localization  Using GAN and One-Class Classifier
Anomaly Detection and Localization Using GAN and One-Class Classifier
홍배 김
 
Anomaly Detection with GANs
Anomaly Detection with GANsAnomaly Detection with GANs
Anomaly Detection with GANs
홍배 김
 
Focal loss의 응용(Detection & Classification)
Focal loss의 응용(Detection & Classification)Focal loss의 응용(Detection & Classification)
Focal loss의 응용(Detection & Classification)
홍배 김
 
Convolution 종류 설명
Convolution 종류 설명Convolution 종류 설명
Convolution 종류 설명
홍배 김
 
Learning by association
Learning by associationLearning by association
Learning by association
홍배 김
 
알기쉬운 Variational autoencoder
알기쉬운 Variational autoencoder알기쉬운 Variational autoencoder
알기쉬운 Variational autoencoder
홍배 김
 
Binarized CNN on FPGA
Binarized CNN on FPGABinarized CNN on FPGA
Binarized CNN on FPGA
홍배 김
 
Visualizing data using t-SNE
Visualizing data using t-SNEVisualizing data using t-SNE
Visualizing data using t-SNE
홍배 김
 
Normalization 방법
Normalization 방법 Normalization 방법
Normalization 방법
홍배 김
 
Learning to remember rare events
Learning to remember rare eventsLearning to remember rare events
Learning to remember rare events
홍배 김
 

More from 홍배 김 (20)

Automatic Gain Tuning based on Gaussian Process Global Optimization (= Bayesi...
Automatic Gain Tuning based on Gaussian Process Global Optimization (= Bayesi...Automatic Gain Tuning based on Gaussian Process Global Optimization (= Bayesi...
Automatic Gain Tuning based on Gaussian Process Global Optimization (= Bayesi...
 
Gaussian processing
Gaussian processingGaussian processing
Gaussian processing
 
Lecture Summary : Camera Projection
Lecture Summary : Camera Projection Lecture Summary : Camera Projection
Lecture Summary : Camera Projection
 
Learning agile and dynamic motor skills for legged robots
Learning agile and dynamic motor skills for legged robotsLearning agile and dynamic motor skills for legged robots
Learning agile and dynamic motor skills for legged robots
 
Robotics of Quadruped Robot
Robotics of Quadruped RobotRobotics of Quadruped Robot
Robotics of Quadruped Robot
 
Recurrent Neural Net의 이론과 설명
Recurrent Neural Net의 이론과 설명Recurrent Neural Net의 이론과 설명
Recurrent Neural Net의 이론과 설명
 
Convolutional neural networks 이론과 응용
Convolutional neural networks 이론과 응용Convolutional neural networks 이론과 응용
Convolutional neural networks 이론과 응용
 
Anomaly detection using deep one class classifier
Anomaly detection using deep one class classifierAnomaly detection using deep one class classifier
Anomaly detection using deep one class classifier
 
The world of loss function
The world of loss functionThe world of loss function
The world of loss function
 
Machine learning applications in aerospace domain
Machine learning applications in aerospace domainMachine learning applications in aerospace domain
Machine learning applications in aerospace domain
 
Anomaly Detection and Localization Using GAN and One-Class Classifier
Anomaly Detection and Localization  Using GAN and One-Class ClassifierAnomaly Detection and Localization  Using GAN and One-Class Classifier
Anomaly Detection and Localization Using GAN and One-Class Classifier
 
Anomaly Detection with GANs
Anomaly Detection with GANsAnomaly Detection with GANs
Anomaly Detection with GANs
 
Focal loss의 응용(Detection & Classification)
Focal loss의 응용(Detection & Classification)Focal loss의 응용(Detection & Classification)
Focal loss의 응용(Detection & Classification)
 
Convolution 종류 설명
Convolution 종류 설명Convolution 종류 설명
Convolution 종류 설명
 
Learning by association
Learning by associationLearning by association
Learning by association
 
알기쉬운 Variational autoencoder
알기쉬운 Variational autoencoder알기쉬운 Variational autoencoder
알기쉬운 Variational autoencoder
 
Binarized CNN on FPGA
Binarized CNN on FPGABinarized CNN on FPGA
Binarized CNN on FPGA
 
Visualizing data using t-SNE
Visualizing data using t-SNEVisualizing data using t-SNE
Visualizing data using t-SNE
 
Normalization 방법
Normalization 방법 Normalization 방법
Normalization 방법
 
Learning to remember rare events
Learning to remember rare eventsLearning to remember rare events
Learning to remember rare events
 

Recently uploaded

State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Jeffrey Haguewood
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
Frank van Harmelen
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Ramesh Iyer
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
Fwdays
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
Product School
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 

Recently uploaded (20)

State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 

Optimal real-time landing using DNN

  • 1. Optimal real-time landing using deep networks Carlos Sánchez and Dario Izzo Advanced Concepts Team (ESA) Daniel Hennes Robotics Innovation Center (DFKI) Learning the optimal state-feedback using deep networks
  • 3. An on-board real-time optimal control system Goal is to make
  • 4. • The mathematical model of a system usually leads to a system of equations describing the nature of the interaction of the system. • These equations are commonly known as governing laws or model equations of the system. • The model equations can be : time independent  steady-state model equations time dependent  dynamic model equations In this course, we are mainly interested in dynamical systems.  Systems that evolve with time are known as dynamic systems. Dynamic Systems
  • 5. Examples of Dynamic models – RLC Circuit • Linear Differential Equations • Example RLC circuit (Ohm‘s and Kirchhoff‘s Laws) 𝑥 = 𝐴𝑥 + 𝐵𝑢 • Nonlinear Differential equations : general cases 𝑥 = 𝑓(𝑥 𝑡 , 𝑢 𝑡 , 𝑡)
  • 6. In mathematical systems theory, simulation is done by solving the governing equations of the system for various input scenarios.  This requires algorithms corresponding to the type of systems model equation.  Numerical methods for the solution of systems of equations and differential equations. Simulation ...
  • 7. • A system with degrees of freedom can be always manipulated to display certain useful behavior. • Manipulation  possibility to control • Control variables are usually systems degrees of freedom. We ask: What is the best control strategy that forces a system to display required characterstics, output, follow a trajectory, etc ?  Optimal Control  Methods of Numerical Optimization Optimization of Dynamic Systems
  • 8. Ex. : Optimal Control of Landing 𝑥1 𝑥1 𝑠 , 𝑥2 𝑠 Model Equations:
  • 9. Objectives of the optimal control : • Minimization of the error : • Minimization of energy : 0 𝑡 𝑢 𝑡 𝑑𝑡 Ex. : Optimal Control of Landing (𝑥1 𝑠 −𝑥1(𝑡)), (𝑥2 𝑠 −𝑥2(𝑡)) Cost @ terminal Integration of cost rate during flight
  • 10. Problem formulation : Cost function : Model (state ) equations : Initial states: Desired final states: How to solve the above optimal control problem in order to achieve the desired goal ? That is, how to determine the optimal trajectories 𝑥1 ∗ (t), 𝑥2 ∗ (𝑡) that provide a minimum energy consumption 0 𝑡 𝑢 𝑡 𝑑𝑡 so that the rocket can be halted at the desired position?
  • 11. Optimization of Dynamic Systems Regard simplified optimal control problem: Cost rate @ t
  • 12. Optimal feedback control for state x at time t is obtained from Hamilton-Jacobi-Bellman equationoptimal control policy a set of extremely challenging partial differential equations  Impossible to implement an Onboard Real Time Controller Continuous Time Optimal Control
  • 13. 1. Pre-compute many optimal trajectories 2. Train an artificial neural network to approximate the optimal behaviour 3. Use the network to drive the spacecraft Proposed Approach Learning the optimal control policy using deep networks
  • 15. Training data generation • The training data is generated using the Hermite-Simpson transcription and a non-linear programming (NLP) solver • The initial state of each trajectory is randomly selected from a training area • 150,000 trajectories are generated for each one of the problems (9,000,000 - 15,000,000 data points)
  • 19. Generalization ? • Successful landings from states outside of the training initial conditions • This suggest that an approximation to the solution of the HJB equations is learned
  • 20. Watch the video clip https://youtu.be/8sDldAK4O0E
  • 21. • Deep networks trained with large datasets successfully approximate the optimal control even for regions out of the training data • DNNs can be used as an on-board reactive control system, while current methods fail to compute optimal trajectories in an efficient way Conclusions