SlideShare a Scribd company logo
1 of 25
Download to read offline
Application of Nonlinear Stochastic
Optimal Control for Genetic Algorithm
計画数理講座
丁 可
2014.02.14
Construction
1. Introduction
2. Overview of Genetic Algorithm
3. Nonlinear Stochastic Optimal Control
4. Application of NSOC for GA
5. Numerical Experiments
6. Conclusion
2
NSOC
GA
Application
3
Introduction
Background
u Nonlinear Stochastic Optimal Control
Foundation problem for control theory
u Genetic Algorithm
Derived from the principles of Darwinian natural
selection and evolution.
Previous research
uMarkov Chain Analysis on Simple Genetic Algorithms
Suzuki Joe (1995)
uLinearly-Solvable Markov Decision Problems
E. Todorov (2006)
4
Overview of Genetic Algorithms
0
1
1
0
	
0
1
0
1
1
0
0
1
1
0
1
1
1
1
1
1
	
1
1
1
1
1
1
1
1
1
1
1
1
Algorithm of GA
5
1. Obtain two parents by the selection function.
2. Produce their child by the crossover function
3. Mutate it by the mutation function.
4. Put the result into the next generation.
5. less than population members, go to Step 1.
l Optimization
l Automatic Programming
l Economic Models
l Social Systems Models
0
1
1
0
	
1
0
0
1
Nonlinear Stochastic Optimal
Control(NSCO)
Continuous Model
𝑥⃗ are given at an initial time t , the SCO problem is to find the
control path 𝑢 ' can minimized. the cost to go function is
continuous-time stochastic dynamics
d) = 𝑓 𝑋, 𝑢 𝑑/ + 𝐹 𝑥, 𝑢 𝑑2
optimal control law is
𝑢∗ = −𝜎6 𝐵 𝑥 8 𝜐)(𝑋)
6
𝐽(𝑥⃗,𝑡) = min
A
→
𝐶(𝑥⃗, 𝑡, 𝑢 ('))
𝑥 ∈ ℝFGstate vector
	𝑢 ∈ ℝFAcontrol	vector
	𝜔 ∈ ℝFPBrown	motion
Discrete Model
7
when we let 𝑝 𝑥T
𝑥, 𝑢 = 𝑢 𝑥T
𝑥
The bellman function of υ 𝑥 can write as
υ 𝑥 = min
V∈𝒰 X
{ℓ 𝑖, 𝑢 + EX]~_ ' 𝑥, 𝑢 𝜐(𝑥T
) }
optimal action will be
𝑢∗ 𝑥T 𝑥 =
𝑝 𝑥T 𝑥 𝓏 𝑥T
𝒢 𝓏 𝑥
𝓏 𝑥 = exp −𝑞 𝑥 𝒢 𝓏 𝑧 	 𝓏 = 𝑄𝑃𝓏
action cost of
current state
the value function
of next state
Application of NSOC for GA
Transition probability of GA
Selection
𝑆j 𝑥T
𝑝 =
𝑓 𝑥T j
∑ 𝑓 𝑥 j
X∈_
Crossover𝐶X] 𝑥l, 𝑥m = ∑
nopnqo
6r 𝛿 𝑥l ⊗ 𝑘 ⊕ 𝑘w ⊗ 𝑥m = 𝑥T
Mutation
8
𝑚j ℎ = 𝜇j
{|}
(1 − 𝜇j)~•{|}
β ∶ Blotzman		invese	T
𝜇j = ϵ𝑒𝑥𝑝 −𝜆𝛽
ℎ mutation	mask
	𝑘 Crossover	mask
𝑝 𝑥T 𝑥 = • 𝑆j 𝑦 𝑝
•
• 𝑚j 𝑥,ℎ{ 𝑚j(𝑦,ℎ6
}•,}‘
)	
			𝐶X] 𝑥l, 𝑥m •
𝜒r + 𝜒̅r
2
r
𝛿 (𝑥 ⊕ ℎ{)⊗ 𝑘 ⊕ 𝑘w ⊗ (𝑦 ⊕ ℎ6 = 𝑥T
Describe GA by NSC
let 𝑝 𝑥T 𝑥, 𝑢 = 𝑢 𝑥T 𝑥 = 𝑝 𝑥T 𝑥 𝑒𝑥𝑝 𝑢X];X
we can get eigenvalue function is
9
λ𝑧— = 𝒫 𝒩𝒩 𝑥 ℛ 𝒩𝒩 𝑥 𝑧 𝒩		(∀𝒙)
𝒫 𝒩𝒩(𝑥) =
𝑝 𝑥({) 𝑥 			𝑝 𝑥(6) 𝑥 		⋯ 		𝑝 𝑥(—) 𝑥
𝑝 𝑥({) 𝑥 				𝑝 𝑥(6) 𝑥 		⋯		𝑝 𝑥(—) 𝑥
				⋮																⋮									⋱									⋮	
𝑝 𝑥({) 𝑥 			𝑝 𝑥(6) 𝑥 			⋯		𝑝 𝑥(—) 𝑥
0
0
0
	
0
0
1
	
0
1
1
	
1
0
0
	
1
0
1
	
1
1
0
	
0
1
0
	
1
1
1
10
ℛ 𝒩𝒩(𝑥)
=
𝑒𝑥𝑝 (−𝑟(𝑥 { ; 𝑥) 															0																											 ⋯																		0				
										0																							𝑒𝑥𝑝 (−𝑟(𝑥 6 ; 𝑥) 										⋯																		0				
																				⋮																		⋮																																								⋱																			⋮				
								0																																			0																						𝑒𝑥𝑝 (−𝑟(𝑥 — ; 𝑥)
𝑧 𝒩 =
𝑧 𝑥{
𝑧 𝑥{
⋮
𝑧 𝑥—
																𝑧 𝑥(l) =
|{λ𝐼 𝒩𝒩 − 𝒫 𝒩𝒩ℛ 𝒩𝒩}l|
|λ𝐼 𝒩𝒩 − 𝒫 𝒩𝒩ℛ 𝒩𝒩|
𝑖 = 1,2,3, ⋯, 𝑁
Numerical Experiments
		
	β: Constant
ϵ =0.3
𝜆 =10
			𝜒 = 0.4
𝜇j = ϵ𝑒𝑥𝑝 −𝜆𝛽
Iteration no. 800
fitness function is to compute the number of 1,
obtained by individuals.
Initial conditions
12
we set 3 patterns to GA, all of which are
Constant function, Linear Function and
Exponential function.
where we list the result of transition probabilities
from state0 into other states. the result shows,
after the iteration of Numerical experiment,
individual is prefer to go to the state 7.
0
2
3
4
5 6
7
1
Introduce theory of GA
Introduce theory of NSOC
Conclusion
5
Overview of previous works
Across the discrete model of NSOC, we apply
it into GA , showed in numerical results.
Future Works
fitness function should
be change by complex
Compute speed should
be improved.
Apply by NSOC
Thank you for your kind attention
15
2
Process of GA
3
Nonlinear stochastic optimal control
the value
function of
current state
the value function
of current state
action cost of
current state
3
Nonlinear stochastic optimal control
3
Nonlinear stochastic optimal control
3
Transition Probabilities for GA
Logical operator Operators
Mutation
Crossover
Eigenvalue function
22
23
• L= 3, β= 𝑒¨/
• 𝜒 = 0.4
• λ =10
• µ = ϵ −λβ
3
Numerical experiment
Initial Condition
25

More Related Content

What's hot

Iterative methods
Iterative methodsIterative methods
Iterative methodsKt Silva
 
Interactives Methods
Interactives MethodsInteractives Methods
Interactives MethodsUIS
 
Wk 6 part 2 non linearites and non linearization april 05
Wk 6 part 2 non linearites and non linearization april 05Wk 6 part 2 non linearites and non linearization april 05
Wk 6 part 2 non linearites and non linearization april 05Charlton Inao
 
ACTIVE CONTROLLER DESIGN FOR THE HYBRID SYNCHRONIZATION OF HYPERCHAOTIC XU AN...
ACTIVE CONTROLLER DESIGN FOR THE HYBRID SYNCHRONIZATION OF HYPERCHAOTIC XU AN...ACTIVE CONTROLLER DESIGN FOR THE HYBRID SYNCHRONIZATION OF HYPERCHAOTIC XU AN...
ACTIVE CONTROLLER DESIGN FOR THE HYBRID SYNCHRONIZATION OF HYPERCHAOTIC XU AN...Zac Darcy
 
A New Approach to Design a Reduced Order Observer
A New Approach to Design a Reduced Order ObserverA New Approach to Design a Reduced Order Observer
A New Approach to Design a Reduced Order ObserverIJERD Editor
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)IJERD Editor
 
Jacobi iteration method
Jacobi iteration methodJacobi iteration method
Jacobi iteration methodMONIRUL ISLAM
 
Insertion Sort, Quick Sort And Their complexity
Insertion Sort, Quick Sort And Their complexityInsertion Sort, Quick Sort And Their complexity
Insertion Sort, Quick Sort And Their complexityMotaleb Hossen Manik
 
PC 1 continuity notes
PC 1 continuity notesPC 1 continuity notes
PC 1 continuity notesvhiggins1
 
Selection sort(sorting algorithm in data structure) and its time complexity
Selection sort(sorting algorithm in data structure) and its time complexitySelection sort(sorting algorithm in data structure) and its time complexity
Selection sort(sorting algorithm in data structure) and its time complexityComputer_ at_home
 
Projective and hybrid projective synchronization of 4-D hyperchaotic system v...
Projective and hybrid projective synchronization of 4-D hyperchaotic system v...Projective and hybrid projective synchronization of 4-D hyperchaotic system v...
Projective and hybrid projective synchronization of 4-D hyperchaotic system v...TELKOMNIKA JOURNAL
 
curve fitting or regression analysis-1.pptx
curve fitting or regression analysis-1.pptxcurve fitting or regression analysis-1.pptx
curve fitting or regression analysis-1.pptxabelmeketa
 
Sulpcegu5e ppt 2_3
Sulpcegu5e ppt 2_3Sulpcegu5e ppt 2_3
Sulpcegu5e ppt 2_3silvia
 
New approach for wolfe’s modified simplex method to solve quadratic programmi...
New approach for wolfe’s modified simplex method to solve quadratic programmi...New approach for wolfe’s modified simplex method to solve quadratic programmi...
New approach for wolfe’s modified simplex method to solve quadratic programmi...eSAT Journals
 

What's hot (19)

At35257260
At35257260At35257260
At35257260
 
13 weightedresidual
13 weightedresidual13 weightedresidual
13 weightedresidual
 
Iterative methods
Iterative methodsIterative methods
Iterative methods
 
Interactives Methods
Interactives MethodsInteractives Methods
Interactives Methods
 
Wk 6 part 2 non linearites and non linearization april 05
Wk 6 part 2 non linearites and non linearization april 05Wk 6 part 2 non linearites and non linearization april 05
Wk 6 part 2 non linearites and non linearization april 05
 
ACTIVE CONTROLLER DESIGN FOR THE HYBRID SYNCHRONIZATION OF HYPERCHAOTIC XU AN...
ACTIVE CONTROLLER DESIGN FOR THE HYBRID SYNCHRONIZATION OF HYPERCHAOTIC XU AN...ACTIVE CONTROLLER DESIGN FOR THE HYBRID SYNCHRONIZATION OF HYPERCHAOTIC XU AN...
ACTIVE CONTROLLER DESIGN FOR THE HYBRID SYNCHRONIZATION OF HYPERCHAOTIC XU AN...
 
A New Approach to Design a Reduced Order Observer
A New Approach to Design a Reduced Order ObserverA New Approach to Design a Reduced Order Observer
A New Approach to Design a Reduced Order Observer
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
Jacobi iteration method
Jacobi iteration methodJacobi iteration method
Jacobi iteration method
 
Insertion Sort, Quick Sort And Their complexity
Insertion Sort, Quick Sort And Their complexityInsertion Sort, Quick Sort And Their complexity
Insertion Sort, Quick Sort And Their complexity
 
Integral calculus
Integral calculusIntegral calculus
Integral calculus
 
PC 1 continuity notes
PC 1 continuity notesPC 1 continuity notes
PC 1 continuity notes
 
H023078082
H023078082H023078082
H023078082
 
algorithms
algorithmsalgorithms
algorithms
 
Selection sort(sorting algorithm in data structure) and its time complexity
Selection sort(sorting algorithm in data structure) and its time complexitySelection sort(sorting algorithm in data structure) and its time complexity
Selection sort(sorting algorithm in data structure) and its time complexity
 
Projective and hybrid projective synchronization of 4-D hyperchaotic system v...
Projective and hybrid projective synchronization of 4-D hyperchaotic system v...Projective and hybrid projective synchronization of 4-D hyperchaotic system v...
Projective and hybrid projective synchronization of 4-D hyperchaotic system v...
 
curve fitting or regression analysis-1.pptx
curve fitting or regression analysis-1.pptxcurve fitting or regression analysis-1.pptx
curve fitting or regression analysis-1.pptx
 
Sulpcegu5e ppt 2_3
Sulpcegu5e ppt 2_3Sulpcegu5e ppt 2_3
Sulpcegu5e ppt 2_3
 
New approach for wolfe’s modified simplex method to solve quadratic programmi...
New approach for wolfe’s modified simplex method to solve quadratic programmi...New approach for wolfe’s modified simplex method to solve quadratic programmi...
New approach for wolfe’s modified simplex method to solve quadratic programmi...
 

Viewers also liked

Tugas difusi inovasi
Tugas difusi inovasiTugas difusi inovasi
Tugas difusi inovasiali muntaha
 
Materials design presentation 01 (2)
Materials design presentation 01 (2)Materials design presentation 01 (2)
Materials design presentation 01 (2)Brenda Westtesten
 
Inriktningsmål för hållbarhet
Inriktningsmål för hållbarhetInriktningsmål för hållbarhet
Inriktningsmål för hållbarhetPierre Ringborg
 
Parker White Lily Residency, Sonepat - Aadi Properties..Pvt
Parker White Lily Residency, Sonepat - Aadi Properties..PvtParker White Lily Residency, Sonepat - Aadi Properties..Pvt
Parker White Lily Residency, Sonepat - Aadi Properties..PvtAadi Property
 
Top 5 Reasons European companies keep outsourcing IT services
Top 5 Reasons European companies keep outsourcing IT servicesTop 5 Reasons European companies keep outsourcing IT services
Top 5 Reasons European companies keep outsourcing IT servicesCedric Brusselmans
 
Varbergs miljömål 2015-2025
Varbergs miljömål 2015-2025Varbergs miljömål 2015-2025
Varbergs miljömål 2015-2025Pierre Ringborg
 
Apa itu teknik Cross Process?
Apa itu teknik Cross Process?Apa itu teknik Cross Process?
Apa itu teknik Cross Process?Fotografi Desain
 
Utredning norra stadsområdet VARBERG 151027
Utredning norra stadsområdet VARBERG 151027Utredning norra stadsområdet VARBERG 151027
Utredning norra stadsområdet VARBERG 151027Pierre Ringborg
 
141 0091 varbergstunneln-tidning_spread_v011 VARBERGSTUNNELN
141 0091 varbergstunneln-tidning_spread_v011 VARBERGSTUNNELN141 0091 varbergstunneln-tidning_spread_v011 VARBERGSTUNNELN
141 0091 varbergstunneln-tidning_spread_v011 VARBERGSTUNNELNPierre Ringborg
 
Do or Do Not Plan There Is.
Do or Do Not Plan There Is.Do or Do Not Plan There Is.
Do or Do Not Plan There Is.Jonathan Hartley
 
Waterloo some modern lessons
Waterloo some modern lessonsWaterloo some modern lessons
Waterloo some modern lessonsPeter Eales
 
White lily residency sonepat,sector 27 9350193692
White lily residency sonepat,sector 27 9350193692White lily residency sonepat,sector 27 9350193692
White lily residency sonepat,sector 27 9350193692Aadi Property
 
SD stödpartiet till Alliansen
SD stödpartiet till AlliansenSD stödpartiet till Alliansen
SD stödpartiet till AlliansenPierre Ringborg
 
Rappport, en till en, utvärdering år 2
Rappport, en till en, utvärdering år 2Rappport, en till en, utvärdering år 2
Rappport, en till en, utvärdering år 2Pierre Ringborg
 

Viewers also liked (20)

Tugas difusi inovasi
Tugas difusi inovasiTugas difusi inovasi
Tugas difusi inovasi
 
Materials design presentation 01 (2)
Materials design presentation 01 (2)Materials design presentation 01 (2)
Materials design presentation 01 (2)
 
Inriktningsmål för hållbarhet
Inriktningsmål för hållbarhetInriktningsmål för hållbarhet
Inriktningsmål för hållbarhet
 
Parker White Lily Residency, Sonepat - Aadi Properties..Pvt
Parker White Lily Residency, Sonepat - Aadi Properties..PvtParker White Lily Residency, Sonepat - Aadi Properties..Pvt
Parker White Lily Residency, Sonepat - Aadi Properties..Pvt
 
Guru (2)
Guru (2)Guru (2)
Guru (2)
 
Facebook Advertising in 2015
Facebook Advertising in 2015Facebook Advertising in 2015
Facebook Advertising in 2015
 
Top 5 Reasons European companies keep outsourcing IT services
Top 5 Reasons European companies keep outsourcing IT servicesTop 5 Reasons European companies keep outsourcing IT services
Top 5 Reasons European companies keep outsourcing IT services
 
Varbergs miljömål 2015-2025
Varbergs miljömål 2015-2025Varbergs miljömål 2015-2025
Varbergs miljömål 2015-2025
 
Apa itu teknik Cross Process?
Apa itu teknik Cross Process?Apa itu teknik Cross Process?
Apa itu teknik Cross Process?
 
Utredning norra stadsområdet VARBERG 151027
Utredning norra stadsområdet VARBERG 151027Utredning norra stadsområdet VARBERG 151027
Utredning norra stadsområdet VARBERG 151027
 
141 0091 varbergstunneln-tidning_spread_v011 VARBERGSTUNNELN
141 0091 varbergstunneln-tidning_spread_v011 VARBERGSTUNNELN141 0091 varbergstunneln-tidning_spread_v011 VARBERGSTUNNELN
141 0091 varbergstunneln-tidning_spread_v011 VARBERGSTUNNELN
 
Do or Do Not Plan There Is.
Do or Do Not Plan There Is.Do or Do Not Plan There Is.
Do or Do Not Plan There Is.
 
Waterloo some modern lessons
Waterloo some modern lessonsWaterloo some modern lessons
Waterloo some modern lessons
 
White lily residency sonepat,sector 27 9350193692
White lily residency sonepat,sector 27 9350193692White lily residency sonepat,sector 27 9350193692
White lily residency sonepat,sector 27 9350193692
 
SD stödpartiet till Alliansen
SD stödpartiet till AlliansenSD stödpartiet till Alliansen
SD stödpartiet till Alliansen
 
Rappport, en till en, utvärdering år 2
Rappport, en till en, utvärdering år 2Rappport, en till en, utvärdering år 2
Rappport, en till en, utvärdering år 2
 
Aadi propertiesmy
Aadi propertiesmyAadi propertiesmy
Aadi propertiesmy
 
C#
C#C#
C#
 
Bio project!!!
Bio project!!!Bio project!!!
Bio project!!!
 
Años 60
Años 60Años 60
Años 60
 

Similar to 硕士论文

An Improved Iterative Method for Solving General System of Equations via Gene...
An Improved Iterative Method for Solving General System of Equations via Gene...An Improved Iterative Method for Solving General System of Equations via Gene...
An Improved Iterative Method for Solving General System of Equations via Gene...Zac Darcy
 
An Improved Iterative Method for Solving General System of Equations via Gene...
An Improved Iterative Method for Solving General System of Equations via Gene...An Improved Iterative Method for Solving General System of Equations via Gene...
An Improved Iterative Method for Solving General System of Equations via Gene...Zac Darcy
 
QTML2021 UAP Quantum Feature Map
QTML2021 UAP Quantum Feature MapQTML2021 UAP Quantum Feature Map
QTML2021 UAP Quantum Feature MapHa Phuong
 
Learning a nonlinear embedding by preserving class neibourhood structure 최종
Learning a nonlinear embedding by preserving class neibourhood structure   최종Learning a nonlinear embedding by preserving class neibourhood structure   최종
Learning a nonlinear embedding by preserving class neibourhood structure 최종WooSung Choi
 
AN IMPROVED ITERATIVE METHOD FOR SOLVING GENERAL SYSTEM OF EQUATIONS VIA GENE...
AN IMPROVED ITERATIVE METHOD FOR SOLVING GENERAL SYSTEM OF EQUATIONS VIA GENE...AN IMPROVED ITERATIVE METHOD FOR SOLVING GENERAL SYSTEM OF EQUATIONS VIA GENE...
AN IMPROVED ITERATIVE METHOD FOR SOLVING GENERAL SYSTEM OF EQUATIONS VIA GENE...Zac Darcy
 
Intro to Quant Trading Strategies (Lecture 2 of 10)
Intro to Quant Trading Strategies (Lecture 2 of 10)Intro to Quant Trading Strategies (Lecture 2 of 10)
Intro to Quant Trading Strategies (Lecture 2 of 10)Adrian Aley
 
Mimo system-order-reduction-using-real-coded-genetic-algorithm
Mimo system-order-reduction-using-real-coded-genetic-algorithmMimo system-order-reduction-using-real-coded-genetic-algorithm
Mimo system-order-reduction-using-real-coded-genetic-algorithmCemal Ardil
 
Investigation of auto-oscilational regimes of the system by dynamic nonlinear...
Investigation of auto-oscilational regimes of the system by dynamic nonlinear...Investigation of auto-oscilational regimes of the system by dynamic nonlinear...
Investigation of auto-oscilational regimes of the system by dynamic nonlinear...IJECEIAES
 
Rasool_PhD_Final_Presentation
Rasool_PhD_Final_PresentationRasool_PhD_Final_Presentation
Rasool_PhD_Final_PresentationGhulam Rasool
 
Radial basis function neural network control for parallel spatial robot
Radial basis function neural network control for parallel spatial robotRadial basis function neural network control for parallel spatial robot
Radial basis function neural network control for parallel spatial robotTELKOMNIKA JOURNAL
 
Solving output control problems using Lyapunov gradient-velocity vector function
Solving output control problems using Lyapunov gradient-velocity vector functionSolving output control problems using Lyapunov gradient-velocity vector function
Solving output control problems using Lyapunov gradient-velocity vector functionIJECEIAES
 
Stochastic optimal control & rl
Stochastic optimal control & rlStochastic optimal control & rl
Stochastic optimal control & rlChoiJinwon3
 
Approximate Solution of a Linear Descriptor Dynamic Control System via a non-...
Approximate Solution of a Linear Descriptor Dynamic Control System via a non-...Approximate Solution of a Linear Descriptor Dynamic Control System via a non-...
Approximate Solution of a Linear Descriptor Dynamic Control System via a non-...IOSR Journals
 
RESEARCH AND SSYNTHESIS OF CONTROL SYSTEMS BY OUTPUT OF THE OBJECT BY SPEED- ...
RESEARCH AND SSYNTHESIS OF CONTROL SYSTEMS BY OUTPUT OF THE OBJECT BY SPEED- ...RESEARCH AND SSYNTHESIS OF CONTROL SYSTEMS BY OUTPUT OF THE OBJECT BY SPEED- ...
RESEARCH AND SSYNTHESIS OF CONTROL SYSTEMS BY OUTPUT OF THE OBJECT BY SPEED- ...IAEME Publication
 
Design of predictive controller for smooth set point tracking for fast dynami...
Design of predictive controller for smooth set point tracking for fast dynami...Design of predictive controller for smooth set point tracking for fast dynami...
Design of predictive controller for smooth set point tracking for fast dynami...eSAT Journals
 
Constrained state feedback control
Constrained state feedback controlConstrained state feedback control
Constrained state feedback controlDivya Goel
 

Similar to 硕士论文 (20)

An Improved Iterative Method for Solving General System of Equations via Gene...
An Improved Iterative Method for Solving General System of Equations via Gene...An Improved Iterative Method for Solving General System of Equations via Gene...
An Improved Iterative Method for Solving General System of Equations via Gene...
 
An Improved Iterative Method for Solving General System of Equations via Gene...
An Improved Iterative Method for Solving General System of Equations via Gene...An Improved Iterative Method for Solving General System of Equations via Gene...
An Improved Iterative Method for Solving General System of Equations via Gene...
 
Modifed my_poster
Modifed my_posterModifed my_poster
Modifed my_poster
 
Tuning of PID, SVFB and LQ Controllers Using Genetic Algorithms
Tuning of PID, SVFB and LQ Controllers Using Genetic AlgorithmsTuning of PID, SVFB and LQ Controllers Using Genetic Algorithms
Tuning of PID, SVFB and LQ Controllers Using Genetic Algorithms
 
QTML2021 UAP Quantum Feature Map
QTML2021 UAP Quantum Feature MapQTML2021 UAP Quantum Feature Map
QTML2021 UAP Quantum Feature Map
 
Learning a nonlinear embedding by preserving class neibourhood structure 최종
Learning a nonlinear embedding by preserving class neibourhood structure   최종Learning a nonlinear embedding by preserving class neibourhood structure   최종
Learning a nonlinear embedding by preserving class neibourhood structure 최종
 
AN IMPROVED ITERATIVE METHOD FOR SOLVING GENERAL SYSTEM OF EQUATIONS VIA GENE...
AN IMPROVED ITERATIVE METHOD FOR SOLVING GENERAL SYSTEM OF EQUATIONS VIA GENE...AN IMPROVED ITERATIVE METHOD FOR SOLVING GENERAL SYSTEM OF EQUATIONS VIA GENE...
AN IMPROVED ITERATIVE METHOD FOR SOLVING GENERAL SYSTEM OF EQUATIONS VIA GENE...
 
Intro to Quant Trading Strategies (Lecture 2 of 10)
Intro to Quant Trading Strategies (Lecture 2 of 10)Intro to Quant Trading Strategies (Lecture 2 of 10)
Intro to Quant Trading Strategies (Lecture 2 of 10)
 
Mimo system-order-reduction-using-real-coded-genetic-algorithm
Mimo system-order-reduction-using-real-coded-genetic-algorithmMimo system-order-reduction-using-real-coded-genetic-algorithm
Mimo system-order-reduction-using-real-coded-genetic-algorithm
 
Investigation of auto-oscilational regimes of the system by dynamic nonlinear...
Investigation of auto-oscilational regimes of the system by dynamic nonlinear...Investigation of auto-oscilational regimes of the system by dynamic nonlinear...
Investigation of auto-oscilational regimes of the system by dynamic nonlinear...
 
Rasool_PhD_Final_Presentation
Rasool_PhD_Final_PresentationRasool_PhD_Final_Presentation
Rasool_PhD_Final_Presentation
 
Radial basis function neural network control for parallel spatial robot
Radial basis function neural network control for parallel spatial robotRadial basis function neural network control for parallel spatial robot
Radial basis function neural network control for parallel spatial robot
 
Solving output control problems using Lyapunov gradient-velocity vector function
Solving output control problems using Lyapunov gradient-velocity vector functionSolving output control problems using Lyapunov gradient-velocity vector function
Solving output control problems using Lyapunov gradient-velocity vector function
 
Stochastic optimal control & rl
Stochastic optimal control & rlStochastic optimal control & rl
Stochastic optimal control & rl
 
Stochastic Optimization
Stochastic OptimizationStochastic Optimization
Stochastic Optimization
 
Approximate Solution of a Linear Descriptor Dynamic Control System via a non-...
Approximate Solution of a Linear Descriptor Dynamic Control System via a non-...Approximate Solution of a Linear Descriptor Dynamic Control System via a non-...
Approximate Solution of a Linear Descriptor Dynamic Control System via a non-...
 
Seminar9
Seminar9Seminar9
Seminar9
 
RESEARCH AND SSYNTHESIS OF CONTROL SYSTEMS BY OUTPUT OF THE OBJECT BY SPEED- ...
RESEARCH AND SSYNTHESIS OF CONTROL SYSTEMS BY OUTPUT OF THE OBJECT BY SPEED- ...RESEARCH AND SSYNTHESIS OF CONTROL SYSTEMS BY OUTPUT OF THE OBJECT BY SPEED- ...
RESEARCH AND SSYNTHESIS OF CONTROL SYSTEMS BY OUTPUT OF THE OBJECT BY SPEED- ...
 
Design of predictive controller for smooth set point tracking for fast dynami...
Design of predictive controller for smooth set point tracking for fast dynami...Design of predictive controller for smooth set point tracking for fast dynami...
Design of predictive controller for smooth set point tracking for fast dynami...
 
Constrained state feedback control
Constrained state feedback controlConstrained state feedback control
Constrained state feedback control
 

硕士论文

  • 1. Application of Nonlinear Stochastic Optimal Control for Genetic Algorithm 計画数理講座 丁 可 2014.02.14
  • 2. Construction 1. Introduction 2. Overview of Genetic Algorithm 3. Nonlinear Stochastic Optimal Control 4. Application of NSOC for GA 5. Numerical Experiments 6. Conclusion 2
  • 3. NSOC GA Application 3 Introduction Background u Nonlinear Stochastic Optimal Control Foundation problem for control theory u Genetic Algorithm Derived from the principles of Darwinian natural selection and evolution. Previous research uMarkov Chain Analysis on Simple Genetic Algorithms Suzuki Joe (1995) uLinearly-Solvable Markov Decision Problems E. Todorov (2006)
  • 4. 4 Overview of Genetic Algorithms 0 1 1 0 0 1 0 1 1 0 0 1 1 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
  • 5. Algorithm of GA 5 1. Obtain two parents by the selection function. 2. Produce their child by the crossover function 3. Mutate it by the mutation function. 4. Put the result into the next generation. 5. less than population members, go to Step 1. l Optimization l Automatic Programming l Economic Models l Social Systems Models 0 1 1 0 1 0 0 1
  • 6. Nonlinear Stochastic Optimal Control(NSCO) Continuous Model 𝑥⃗ are given at an initial time t , the SCO problem is to find the control path 𝑢 ' can minimized. the cost to go function is continuous-time stochastic dynamics d) = 𝑓 𝑋, 𝑢 𝑑/ + 𝐹 𝑥, 𝑢 𝑑2 optimal control law is 𝑢∗ = −𝜎6 𝐵 𝑥 8 𝜐)(𝑋) 6 𝐽(𝑥⃗,𝑡) = min A → 𝐶(𝑥⃗, 𝑡, 𝑢 (')) 𝑥 ∈ ℝFGstate vector 𝑢 ∈ ℝFAcontrol vector 𝜔 ∈ ℝFPBrown motion
  • 7. Discrete Model 7 when we let 𝑝 𝑥T 𝑥, 𝑢 = 𝑢 𝑥T 𝑥 The bellman function of υ 𝑥 can write as υ 𝑥 = min V∈𝒰 X {ℓ 𝑖, 𝑢 + EX]~_ ' 𝑥, 𝑢 𝜐(𝑥T ) } optimal action will be 𝑢∗ 𝑥T 𝑥 = 𝑝 𝑥T 𝑥 𝓏 𝑥T 𝒢 𝓏 𝑥 𝓏 𝑥 = exp −𝑞 𝑥 𝒢 𝓏 𝑧 𝓏 = 𝑄𝑃𝓏 action cost of current state the value function of next state
  • 8. Application of NSOC for GA Transition probability of GA Selection 𝑆j 𝑥T 𝑝 = 𝑓 𝑥T j ∑ 𝑓 𝑥 j X∈_ Crossover𝐶X] 𝑥l, 𝑥m = ∑ nopnqo 6r 𝛿 𝑥l ⊗ 𝑘 ⊕ 𝑘w ⊗ 𝑥m = 𝑥T Mutation 8 𝑚j ℎ = 𝜇j {|} (1 − 𝜇j)~•{|} β ∶ Blotzman invese T 𝜇j = ϵ𝑒𝑥𝑝 −𝜆𝛽 ℎ mutation mask 𝑘 Crossover mask 𝑝 𝑥T 𝑥 = • 𝑆j 𝑦 𝑝 • • 𝑚j 𝑥,ℎ{ 𝑚j(𝑦,ℎ6 }•,}‘ ) 𝐶X] 𝑥l, 𝑥m • 𝜒r + 𝜒̅r 2 r 𝛿 (𝑥 ⊕ ℎ{)⊗ 𝑘 ⊕ 𝑘w ⊗ (𝑦 ⊕ ℎ6 = 𝑥T
  • 9. Describe GA by NSC let 𝑝 𝑥T 𝑥, 𝑢 = 𝑢 𝑥T 𝑥 = 𝑝 𝑥T 𝑥 𝑒𝑥𝑝 𝑢X];X we can get eigenvalue function is 9 λ𝑧— = 𝒫 𝒩𝒩 𝑥 ℛ 𝒩𝒩 𝑥 𝑧 𝒩 (∀𝒙) 𝒫 𝒩𝒩(𝑥) = 𝑝 𝑥({) 𝑥 𝑝 𝑥(6) 𝑥 ⋯ 𝑝 𝑥(—) 𝑥 𝑝 𝑥({) 𝑥 𝑝 𝑥(6) 𝑥 ⋯ 𝑝 𝑥(—) 𝑥 ⋮ ⋮ ⋱ ⋮ 𝑝 𝑥({) 𝑥 𝑝 𝑥(6) 𝑥 ⋯ 𝑝 𝑥(—) 𝑥 0 0 0 0 0 1 0 1 1 1 0 0 1 0 1 1 1 0 0 1 0 1 1 1
  • 10. 10 ℛ 𝒩𝒩(𝑥) = 𝑒𝑥𝑝 (−𝑟(𝑥 { ; 𝑥) 0 ⋯ 0 0 𝑒𝑥𝑝 (−𝑟(𝑥 6 ; 𝑥) ⋯ 0 ⋮ ⋮ ⋱ ⋮ 0 0 𝑒𝑥𝑝 (−𝑟(𝑥 — ; 𝑥) 𝑧 𝒩 = 𝑧 𝑥{ 𝑧 𝑥{ ⋮ 𝑧 𝑥— 𝑧 𝑥(l) = |{λ𝐼 𝒩𝒩 − 𝒫 𝒩𝒩ℛ 𝒩𝒩}l| |λ𝐼 𝒩𝒩 − 𝒫 𝒩𝒩ℛ 𝒩𝒩| 𝑖 = 1,2,3, ⋯, 𝑁
  • 11. Numerical Experiments β: Constant ϵ =0.3 𝜆 =10 𝜒 = 0.4 𝜇j = ϵ𝑒𝑥𝑝 −𝜆𝛽 Iteration no. 800 fitness function is to compute the number of 1, obtained by individuals. Initial conditions
  • 12. 12 we set 3 patterns to GA, all of which are Constant function, Linear Function and Exponential function.
  • 13. where we list the result of transition probabilities from state0 into other states. the result shows, after the iteration of Numerical experiment, individual is prefer to go to the state 7. 0 2 3 4 5 6 7 1
  • 14. Introduce theory of GA Introduce theory of NSOC Conclusion 5 Overview of previous works Across the discrete model of NSOC, we apply it into GA , showed in numerical results. Future Works fitness function should be change by complex Compute speed should be improved. Apply by NSOC
  • 15. Thank you for your kind attention 15
  • 17. 3 Nonlinear stochastic optimal control the value function of current state the value function of current state action cost of current state
  • 20. 3 Transition Probabilities for GA Logical operator Operators Mutation Crossover
  • 22. 22
  • 23. 23
  • 24. • L= 3, β= 𝑒¨/ • 𝜒 = 0.4 • λ =10 • µ = ϵ −λβ 3 Numerical experiment Initial Condition
  • 25. 25