A hitchhiker’s guide to neuroevolution in Erlang

J e r o e n S o e t e r s
NEUROEVOLUTION
A hitchhiker’s guide to neuroevolution in
Erlang.

WHAT IS MACHINE LEARNING?
• Artiﬁcial Neural Networks
• Genetic Algorithms
3

BIOLOGICAL NEURAL NETWORKS
5
Dendrites
Soma
Axon
Synapse

A MODEL FOR A NEURON
6
Y
w1
w2
wn
x1
x2
xn
Dendrites Synapses Axon
Soma

A MODEL FOR A NEURON
6
Y
w1
w2
wn
x1
x2
xn
Input signals Weights Output signal
Neuron

HOW DOES THE NEURON DETERMINE IT’S OUTPUT?
Y =sign ∑xiwi - ⍬
7
n=1
n

PERCEPTRON LEARNING RULE
℮(p) = Yd(p) - Y(p)
wi(p + 1) = wi(p) + α • wi(p) • ℮(p)
=
10

PERCEPTRON TRAINING ALGORITHM
11
weight training
start
stop
weights converged? yes
no
set weights and threshold to
random values [-0.5, 0.5]
activate the perceptron

LOGIC GATES
12
input values
x1 x2
x1 AND x2 x1 OR x2 x1 XOR x2
0 0 0 0 0
0 1 0 1 1
1 0 0 1 1
1 1 1 1 0

TRAINING A PERCEPTRON TO PERFORM THE AND OPERATION
13
epoch
inputs
x1 x2
desired
output
Yd
initial weights
w1 w2
actual
output
Y
error
℮
ﬁnal weights
w1 w2
1
0 0 0 0.3 -0.1 0 0 0.3 -0.1
0 1 0 0.3 -0.1 0 0 0.3 -0.1
1 0 0 0.3 -0.1 1 -1 0.2 -0.1
1 1 1 0.2 -0.1 0 1 0.3 0.0
Threshold: ⍬ = 0.2 ; learning rate: α = 0.1=

13
epoch
inputs
x1 x2
desired
output
Yd
initial weights
w1 w2
actual
output
Y
error
℮
ﬁnal weights
w1 w2
1
0 0 0 0.3 -0.1 0 0 0.3 -0.1
0 1 0 0.3 -0.1 0 0 0.3 -0.1
1 0 0 0.3 -0.1 1 -1 0.2 -0.1
1 1 1 0.2 -0.1 0 1 0.3 0.0
0.3 -0.1

13
epoch
inputs
x1 x2
desired
output
Yd
initial weights
w1 w2
actual
output
Y
error
℮
ﬁnal weights
w1 w2
1
0 0 0 0.3 -0.1 0 0 0.3 -0.1
0 1 0 0.3 -0.1 0 0 0.3 -0.1
1 0 0 0.3 -0.1 1 -1 0.2 -0.1
1 1 1 0.2 -0.1 0 1 0.3 0.0
0 0.3

13
epoch
inputs
x1 x2
desired
output
Yd
initial weights
w1 w2
actual
output
Y
error
℮
ﬁnal weights
w1 w2
1
0 0 0 0.3 -0.1 0 0 0.3 -0.1
0 1 0 0.3 -0.1 0 0 0.3 -0.1
1 0 0 0.3 -0.1 1 -1 0.2 -0.1
1 1 1 0.2 -0.1 0 1 0.3 0.0
0 -0.1

13
epoch
inputs
x1 x2
desired
output
Yd
initial weights
w1 w2
actual
output
Y
error
℮
ﬁnal weights
w1 w2
1
0 0 0 0.3 -0.1 0 0 0.3 -0.1
0 1 0 0.3 -0.1 0 0 0.3 -0.1
1 0 0 0.3 -0.1 1 -1 0.2 -0.1
1 1 1 0.2 -0.1 0 1 0.3 0.0
0

13
epoch
inputs
x1 x2
desired
output
Yd
initial weights
w1 w2
actual
output
Y
error
℮
ﬁnal weights
w1 w2
1
0 0 0 0.3 -0.1 0 0 0.3 -0.1
0 1 0 0.3 -0.1 0 0 0.3 -0.1
1 0 0 0.3 -0.1 1 -1 0.2 -0.1
1 1 1 0.2 -0.1 0 1 0.3 0.0
00

13
epoch
inputs
x1 x2
desired
output
Yd
initial weights
w1 w2
actual
output
Y
error
℮
ﬁnal weights
w1 w2
1
0 0 0 0.3 -0.1 0 0 0.3 -0.1
0 1 0 0.3 -0.1 0 0 0.3 -0.1
1 0 0 0.3 -0.1 1 -1 0.2 -0.1
1 1 1 0.2 -0.1 0 1 0.3 0.0
1 -0.1

13
epoch
inputs
x1 x2
desired
output
Yd
initial weights
w1 w2
actual
output
Y
error
℮
ﬁnal weights
w1 w2
1
0 0 0 0.3 -0.1 0 0 0.3 -0.1
0 1 0 0.3 -0.1 0 0 0.3 -0.1
1 0 0 0.3 -0.1 1 -1 0.2 -0.1
1 1 1 0.2 -0.1 0 1 0.3 0.0
1 0.3

13
epoch
inputs
x1 x2
desired
output
Yd
initial weights
w1 w2
actual
output
Y
error
℮
ﬁnal weights
w1 w2
1
0 0 0 0.3 -0.1 0 0 0.3 -0.1
0 1 0 0.3 -0.1 0 0 0.3 -0.1
1 0 0 0.3 -0.1 1 -1 0.2 -0.1
1 1 1 0.2 -0.1 0 1 0.3 0.0
1

13
epoch
inputs
x1 x2
desired
output
Yd
initial weights
w1 w2
actual
output
Y
error
℮
ﬁnal weights
w1 w2
1
0 0 0 0.3 -0.1 0 0 0.3 -0.1
0 1 0 0.3 -0.1 0 0 0.3 -0.1
1 0 0 0.3 -0.1 1 -1 0.2 -0.1
1 1 1 0.2 -0.1 0 1 0.3 0.0
10

13
epoch
inputs
x1 x2
desired
output
Yd
initial weights
w1 w2
actual
output
Y
error
℮
ﬁnal weights
w1 w2
1
0 0 0 0.3 -0.1 0 0 0.3 -0.1
0 1 0 0.3 -0.1 0 0 0.3 -0.1
1 0 0 0.3 -0.1 1 -1 0.2 -0.1
1 1 1 0.2 -0.1 0 1 0.3 0.0
-1

13
epoch
inputs
x1 x2
desired
output
Yd
initial weights
w1 w2
actual
output
Y
error
℮
ﬁnal weights
w1 w2
1
0 0 0 0.3 -0.1 0 0 0.3 -0.1
0 1 0 0.3 -0.1 0 0 0.3 -0.1
1 0 0 0.3 -0.1 1 -1 0.2 -0.1
1 1 1 0.2 -0.1 0 1 0.3 0.0
0.2 -0.1

13
epoch
inputs
x1 x2
desired
output
Yd
initial weights
w1 w2
actual
output
Y
error
℮
ﬁnal weights
w1 w2
1
0 0 0 0.3 -0.1 0 0 0.3 -0.1
0 1 0 0.3 -0.1 0 0 0.3 -0.1
1 0 0 0.3 -0.1 1 -1 0.2 -0.1
1 1 1 0.2 -0.1 0 1 0.3 0.0
1 0.2

13
epoch
inputs
x1 x2
desired
output
Yd
initial weights
w1 w2
actual
output
Y
error
℮
ﬁnal weights
w1 w2
1
0 0 0 0.3 -0.1 0 0 0.3 -0.1
0 1 0 0.3 -0.1 0 0 0.3 -0.1
1 0 0 0.3 -0.1 1 -1 0.2 -0.1
1 1 1 0.2 -0.1 0 1 0.3 0.0
01

13
epoch
inputs
x1 x2
desired
output
Yd
initial weights
w1 w2
actual
output
Y
error
℮
ﬁnal weights
w1 w2
1
0 0 0 0.3 -0.1 0 0 0.3 -0.1
0 1 0 0.3 -0.1 0 0 0.3 -0.1
1 0 0 0.3 -0.1 1 -1 0.2 -0.1
1 1 1 0.2 -0.1 0 1 0.3 0.0
0.3 0.0

14
epoch
inputs
x1 x2
desired
output
Yd
initial weights
w1 w2
actual
output
Y
error
℮
ﬁnal weights
w1 w2
2
0 0 0 0.3 0.0 0 0 0.3 0.0
0 1 0 0.3 0.0 0 0 0.3 0.0
1 0 0 0.3 0.0 1 -1 0.2 0.0
1 1 1 0.2 0.0 1 0 0.2 0.0
0.3 0.0

14
epoch
inputs
x1 x2
desired
output
Yd
initial weights
w1 w2
actual
output
Y
error
℮
ﬁnal weights
w1 w2
2
0 0 0 0.3 0.0 0 0 0.3 0.0
0 1 0 0.3 0.0 0 0 0.3 0.0
1 0 0 0.3 0.0 1 -1 0.2 0.0
1 1 1 0.2 0.0 1 0 0.2 0.0
0.2 0.0

15
epoch
inputs
x1 x2
desired
output
Yd
initial weights
w1 w2
actual
output
Y
error
℮
ﬁnal weights
w1 w2
3
0 0 0 0.2 0.0 0 0 0.2 0.0
0 1 0 0.2 0.0 0 0 0.2 0.0
1 0 0 0.2 0.0 1 -1 0.1 0.0
1 1 1 0.1 0.0 0 1 0.2 0.1
0.2 0.0

15
epoch
inputs
x1 x2
desired
output
Yd
initial weights
w1 w2
actual
output
Y
error
℮
ﬁnal weights
w1 w2
3
0 0 0 0.2 0.0 0 0 0.2 0.0
0 1 0 0.2 0.0 0 0 0.2 0.0
1 0 0 0.2 0.0 1 -1 0.1 0.0
1 1 1 0.1 0.0 0 1 0.2 0.1
0.1 0.0

15
epoch
inputs
x1 x2
desired
output
Yd
initial weights
w1 w2
actual
output
Y
error
℮
ﬁnal weights
w1 w2
3
0 0 0 0.2 0.0 0 0 0.2 0.0
0 1 0 0.2 0.0 0 0 0.2 0.0
1 0 0 0.2 0.0 1 -1 0.1 0.0
1 1 1 0.1 0.0 0 1 0.2 0.1
0.2 0.1

16
epoch
inputs
x1 x2
desired
output
Yd
initial weights
w1 w2
actual
output
Y
error
℮
ﬁnal weights
w1 w2
4
0 0 0 0.2 0.1 0 0 0.2 0.1
0 1 0 0.2 0.1 0 0 0.2 0.1
1 0 0 0.2 0.1 1 -1 0.1 0.1
1 1 1 0.1 0.1 1 0 0.1 0.1
0.2 0.1

16
epoch
inputs
x1 x2
desired
output
Yd
initial weights
w1 w2
actual
output
Y
error
℮
ﬁnal weights
w1 w2
4
0 0 0 0.2 0.1 0 0 0.2 0.1
0 1 0 0.2 0.1 0 0 0.2 0.1
1 0 0 0.2 0.1 1 -1 0.1 0.1
1 1 1 0.1 0.1 1 0 0.1 0.1
0.1 0.1

17
epoch
inputs
x1 x2
desired
output
Yd
initial weights
w1 w2
actual
output
Y
error
℮
ﬁnal weights
w1 w2
5
0 0 0 0.1 0.1 0 0 0.1 0.1
0 1 0 0.1 0.1 0 0 0.1 0.1
1 0 0 0.1 0.1 0 0 0.1 0.1
1 1 1 0.1 0.1 1 0 0.1 0.1
0.1 0.1

A LITTLE GEOMETRY…
18
0 1
1
x2
x1 0 1
1
x2
x10 1
1
x2
x1
x1 AND x2 x1 OR x2 x1 XOR x2

WE NEED MORE LAYERS
19
3
4
5
1
2
Input layer Hidden layer Output layer
x1
x2
Y

PROBLEMS WITH BACK PROPAGATION
•a training set of suﬃcient size is required
•topology of the network needs to be known in advance
•no recurrent connections are allowed
•activation function must be diﬀerentiable
Does not emulate the biological world
20

1 1000 11 0
CROSSOVER
24
0 111 0 11 0 1 100 1 10 0
parents

1 1000 11 0
CROSSOVER
24
0 111 0 11 0 1 100 1 10 0
✂
parents
✂

1 1000 11 0
1 10 00 111
oﬀspring
CROSSOVER
24
0 111 1 10 0
✂
parents
✂

MUTATION
25
A DA D10 11 101 01 0

MUTATION
25
A DA D10 11 101 01 00 1

EVOLUTIONARY ALGORITHM
•represent the candidate solution as a chromosome
•chose the initial population size N, crossover probability (Pc) and mutation
probability (Pm)
•define a fitness function to measure the performance of the chromosome
•define the genetic operators for the chromosome
26

27
start
stop
generate a population
calculate ﬁtness
termination criteria satisﬁed?
yes
no
new population size = N?
crossover and mutation
select pair for mating add to new population
replace population
no
yes

TRAVELING SALESMAN
28
A
B
D
F
G
C
E
H

•chose the initial population size N, crossover probability Pc and mutation
probability Pm
29

•deﬁne a ﬁtness function to measure the performance of the
chromosome
probability Pm
31

FITNESS FUNCTION
Fitness = 1 / total distance
32

probability Pm
33

CROSSOVER
34
G EBC H FA D A FEB H DC G
parents

CROSSOVER
34
✂
✂
parents

H A D
oﬀspring
CROSSOVER
34
✂
✂
parents

H A D
oﬀspring
CROSSOVER
34
✂
✂
parents
B

H A D
oﬀspring
CROSSOVER
34
✂
✂
parents
B E

H A D
oﬀspring
CROSSOVER
34
✂
✂
parents
B E F

H A D
oﬀspring
CROSSOVER
34
✂
✂
parents
B E F C

H A D
oﬀspring
CROSSOVER
34
✂
✂
parents
B E F C G

•chose the initial population size N, crossover probability Pc and
mutation probability Pm
36

SIMULATION
•inputs (sensors)
•outputs (actuators)
•ﬁtness function
40

AND LOTS MORE…
•data compression
•training NPCs in a video game
•cyber warfare
43

probability Pm
44

•deﬁne a ﬁtness function to measure the performance of the
chromosome
probability Pm
46

FITNESS FUNCTION
Fitness = performance of network on an actual problem
47

probability Pm
48

CROSSOVER
Crossover doesn’t work for large neural nets!
49

MUTATION OPERATORS
and lots more…
54

•chose the initial population size N, crossover probability Pc and
mutation probability Pm
55

RANDOM IMPACT MUTATION
Number of mutations = random(1, network size)
56

MEMETIC ALGORITHM
57
apply to a problem
start
generate a population
local search:
Hill Climber calculate effective fitness
select fit organisms
create offspring

STOCHASTIC HILL CLIMBER (LOCAL SEARCH)
58
start
new ﬁtness > old ﬁtness?
yes
no
stopping condition reached?
stop
apply NN to a problem
backup and perturb weights restore backed-up weights

A LANGUAGE FOR NEUROEVOLUTION
•The system must be able to handle very large numbers of concurrent activities
•Actions must be performed at a certain point in time or within a certain time
•Systems may be distributed over several computers
•The system is used to control hardware
•The software systems are very large
59

59
•The system exhibits complex functionality such as, feature interaction.
•The systems should be in continuous operation for many years.
•Software maintenance (reconfiguration, etc) should be performed without
stopping the system.
•There are stringent quality, and reliability requirements.
•Fault tolerance

59
•The system exhibits complex functionality such as, feature interaction.
•The systems should be in continuous operation for many years.
•Software maintenance (reconfiguration, etc) should be performed without
stopping the system.
•There are stringent quality, and reliability requirements.
•Fault tolerance
Bjarne Dacker. Erlang - A New Programming Language. Ericsson Review,
no 2, 1993.

1:1 MAPPING
61
neuron
neuronneuron
neuron
process process
process process
processprocess
genotype
erlang

THE NEURAL NETWORK
62
neuron
neuronneuron
actuatorsensor
neuron
cortex
scape

THE NEURAL NETWORK
62
neuron
neuronneuron
actuatorsensor
neuron
cortex
scape
sync

THE NEURAL NETWORK
62
neuron
neuronneuron
actuatorsensor
neuron
cortex
scape
sense

THE NEURAL NETWORK
62
neuron
neuronneuron
actuatorsensor
neuron
cortex
scape
percept

THE NEURAL NETWORK
62
neuron
neuronneuron
actuatorsensor
neuron
cortex
scape
forward
forward

THE NEURAL NETWORK
62
neuron
neuronneuron
actuatorsensor
neuron
cortex
scape
action

THE NEURAL NETWORK
62
neuron
neuronneuron
actuatorsensor
neuron
cortex
scape
{ﬁtness, halt_ﬂag}

THE EXOSELF
63
neuron
neuronneuron
actuatorsensor
neuron
cortex
scape
exoself
cortex

THE EXOSELF
63
neuron
neuronneuron
actuatorsensor
neuron
cortex
scape
exoself
{evaluation_completed, ﬁtness}
cortex

THE EXOSELF
63
neuron
neuronneuron
actuatorsensor
neuron
cortex
scape
exoself
cortex
ﬁtness > best ﬁtness

THE EXOSELF
63
neuron
neuronneuron
actuatorsensor
neuron
cortex
scape
exoself
backup_weights
backup_weights
backup_weights
backup_weights
neuron neuron
neuronneuron

THE EXOSELF
63
neuron
neuronneuron
actuatorsensor
neuron
cortex
scape
exoself
neuron
neuron
perturb_weights
perturb_weights

THE EXOSELF
63
neuron
neuronneuron
actuatorsensor
neuron
cortex
scape
exoself
cortex
reactivate

THE POPULATION MONITOR
64
population monitor
database

64
population monitor
database
private private privateprivate private

64
population monitor
database
start
start start start
start

64
population monitor
database
terminated
terminated terminated
terminated
terminated

64
population monitor
database
private privateprivate

64
population monitor
database
private privateprivate private private

64
population monitor
database
private privateprivate private private
start
start start start
start

THE DEVIL IS IN THE DETAILS
•recurrent connections
•newer generations get a higher chance for mutation
•neural plasticity
•public scapes and steady state evolution
65

POLE BALANCING
66
agent
cart
actions
percepts

BENCHMARK RESULTS
68
Method Single-Pole/Incomplete state
information
Double-Pole/Partial
Information W/O Damping
Double-Pole W/Damping
RWG 8557 415209 1232296
SANE 1212 262700 451612
CNE* 724 76906* 87623*
ESP 589 7374 26342
NEAT - - 6929
CMA-ES* - 3521* 6061*
CoSyNE* 127* 1249* 3416*
DXNN not performed 2359 2313
Our System 647 5184 4792

Feel free to reach out at:
e. jsoeters@thoughtworks.com
t. @JeroenSoeters
THANK YOU

DATA SCIENCE AND ENGINEERING
71

A hitchhiker’s guide to neuroevolution in Erlang

More Related Content

Viewers also liked

Similar to A hitchhiker’s guide to neuroevolution in Erlang

More from Thoughtworks

Recently uploaded

A hitchhiker’s guide to neuroevolution in Erlang