SlideShare a Scribd company logo
1 of 27
Download to read offline
A Double Lexicase Selection Operator for
Bloat Control in Evolutionary Feature
Construction for Regression
Hengzhe Zhang
Supervisor: Mengjie Zhang, Bing Xue, Qi Chen, Wolfgang Banzhaf (MSU)
Victoria University of Wellington
17/07/2023
Table of Contents
1 Background
2 Algorithm
3 Experimental Settings
4 Experimental Results
5 Conclusion
1 20
Background
Evolutionary Feature Construction
The general idea of feature construction is to construct a set of new features
{ϕ1, . . . , ϕm} to enhance the learning performance on a given dataset
{{x1, y1}, . . . , {xn, yn}} compared to learning on the original features
{x1, . . . , xp}.
Genetic programming (GP) has been extensively employed to automatically
construct features due to its flexible representation and gradient-free search
mechanism.
(a) Feature Construction on Linear Regression (b) New Feature Space
2 20
Bloat Phenomenon
Bloat refers to the tendency of GP solutions to grow more complex over time
without improving the fitness value.
Growth of program size
3 20
Bloat Phenomenon
The explanations for bloat include:
Hitchhiking
Defense against crossover
Removal bias
The nature of a program search space
Regardless of the reason for bloat, it is widely acknowledged that addressing bloat
can increase search efficiency and enhance the interpretability of the final model.
4 20
Existing Bloat Control Techniques
Depth Limit: Set a strict depth limit for each GP tree.
Variation Operator
▶ Prune-and-Plant (PAP)
▶ Semantic Approximation (SA)
Selection Operator
▶ Double Tournament Selection (DTS)
▶ Semantic Tournament Selection (TS-S)
Fitness Function
▶ Tarpeian
▶ Alpha-Dominance MOGP
5 20
Double Tournament Selection
Double Tournament Selection
Double Stages of DTS:
Stage 1: Tournament selection,
get individuals A, B
Stage 2: Select the smaller one in
A,B with a probability of 0.7
Advantage:
Applicable for various scenarios
(GPSR, GPHH)
Disadvantage:
May lead to reduced diversity,
given that the tournament
selection operator is repeatedly
used. -> Lexicase Selection!
6 20
Lexicase Selection
Tournament vs Lexicase
Tournament Selection produces a
lot of semantically equivalent
individuals
Lexicase Selection preserves a very
good population diversity
Why lexicase selection?
It is not necessary to sum up all
errors as a scalar for EA methods.
MSE = 1
n
Pn
i=1

Yi − Ŷi
2
How to perform lexicase selection?
Step 1: Construct a filter
MAD (et) = λ (et) =
medianj

etj
− mediank etk
 
Step 2: Remove bad individuals
based on the filter
Step 3: Construct more filters until
only one individual remains
7 20
Algorithm
Double Lexicase Selection
Double Lexicase Selection
Two Stages of DLS:
Stage 1: Lexicase selection, get k
individuals A, B, C, D, ...to form a
candidate pool
Stage 2: Roulette Wheel
Selection on k individuals
negatively proportionate to tree
size
Advantage:
Applicable for every scenario
(GPSR, GPHH)
Fully exploit semantics because
of using the lexicase selection
operator
8 20
Experimental Settings
Datasets
98 Regression datasets are used in the experiments, which are all datasets in
PMLB with less than 2000 data items.
The size of datasets range between 47 and 1059, and the dimension of datasets
are between 2 and 124.




1XPEHURI,QVWDQFHV




1XPEHURI)HDWXUHV
Properties of experimental datasets
9 20
Baseline Methods
Seven baseline bloat control methods:
Depth Limit
Double Tournament Selection (DTS) (ECJ 2006)
Tarpeian (EuroGP 2003)
Prune and plant (PAP) (ECJ 2010)
TS-S (Inf. Sci. 2018)
DSA (Swarm Evol. Comput. 2020)
α-MOGP (TEVC) (TEVC 2022)
10 20
Parameter Settings
Parameter settings are common settings in GP.
Parameter settings for all experiments.
11 20
Experimental Results
R2
Score
Only DLS (first row), DSA and Tarpeian methods have similar or better
performance on most datasets compared to the depth limit method.
For the top three algorithms, the DLS method is better than the Tarpeian and
DSA methods.
Statistical comparison of test R2
score for different bloat control methods on 98 datasets.
(“+,“∼, and “- indicate using the method in a row is better than, similar to or worse than
using the method in a column.)
12 20
Model Size
DLS (first row) is a successful bloat control method, as it reduces model sizes
on all datasets.
When comparing the PAP, DSA, DTS and TS-S operators, the DLS operator is
worse at reducing model size.
However, DLS is better than the PAP, DSA, DTS and TS-S operators in terms of
test R2 scores.
Statistical comparison of model sizes for different bloat control methods on 98 datasets.
(+,∼, and - indicate using the method in a row is better than, similar to or worse than
using the method in a column.)
13 20
Evolutionary Plots
DLS operator has good effectiveness in terms of R2 scores over the whole
evolution process and thus achieves good final accuracy.
Depth limit cannot effectively control tree sizes. In contrast, the DLS operator
can effectively control tree size to a relatively low level.
0 25 50 75 100
Generation
0.50
0.75
1.00
R
2
Score
OpenML 582
0 25 50 75 100
Generation
0.8
0.9
R
2
Score
OpenML 599
0 25 50 75 100
Generation
0.50
0.75
R
2
Score
OpenML 618
0 25 50 75 100
Generation
0.50
0.75
R
2
Score
OpenML 645
DLS
αMOGP
Tarpeian
DTS
PAP
TS-S
DSA
DepthLimiting
(a) Evolutionary plots of test R2
score for
different bloat control methods.
0 25 50 75 100
Generation
5
10
Tree
Size
OpenML 582
0 25 50 75 100
Generation
5
10
15
Tree
Size
OpenML 599
0 25 50 75 100
Generation
5
10
Tree
Size
OpenML 618
0 25 50 75 100
Generation
5
10
Tree
Size
OpenML 645
DLS
αMOGP
Tarpeian
DTS
PAP
TS-S
DSA
DepthLimiting
(b) Evolutionary plots of average tree sizes
for different bloat control methods.
14 20
Overall Analysis
Friedman’s rank of test R2
scores and
tree sizes on 98 datasets for different
bloat control methods.
Only four methods, DLS,
Tarpeian, DSA and αMOGP do not
get worse predictive
performance on R2 scores than
using depth-limited methods
alone.
DLS operator achieves a good
trade-off between test R2 scores
and model size.
15 20
Capacity of Candidate Pool
Model sizes decrease with an increase in capacity.
Training time will increase significantly when increasing pool capacity from 10
to 20.
 a 





(a) Statistical comparison on
tree size between using a
capacity of 10 and 2.
 a 





(b) Statistical comparison on
tree size between using a
capacity of 10 and 5.
   
3RRODSDFLW






7UDLQLQJ7LPH
(c) Distribution of training
time versus candidate pool
capacity.
16 20
Roulette wheel selection
Roulette wheel selection operator is significantly better than minimum
selection operator in terms of test R2 scores.
The minimum selection operator favors very small individuals and thus leads
to very poor predictive performance.
 a 







a

(a) Statistical comparison of R2
scores using
roulette instead of minimum as the selection
strategy on 98 datasets.
  
7UHH6L]H





RXQW
$OJRULWKP
0LQ
5RXOHWWH
(b) Distribution of final tree sizes when using
roulette or minimum as the selection strategy
on 98 datasets.
17 20
Training Time
'
/
6
α
0
2
*
3
7
D
U
S
H
L
D
Q
'
7
6
3
$
3
7
6

6
'
6
$
'
H
S
W
K
/
L
P
L
W



7UDLQLQJ7LPH
2SHQ0/B
'
/
6
α
0
2
*
3
7
D
U
S
H
L
D
Q
'
7
6
3
$
3
7
6

6
'
6
$
'
H
S
W
K
/
L
P
L
W



7UDLQLQJ7LPH
2SHQ0/B
'
/
6
α
0
2
*
3
7
D
U
S
H
L
D
Q
'
7
6
3
$
3
7
6

6
'
6
$
'
H
S
W
K
/
L
P
L
W



7UDLQLQJ7LPH
2SHQ0/B
'
/
6
α
0
2
*
3
7
D
U
S
H
L
D
Q
'
7
6
3
$
3
7
6

6
'
6
$
'
H
S
W
K
/
L
P
L
W



7UDLQLQJ7LPH
2SHQ0/B
'
/
6
α
0
2
*
3
7
D
U
S
H
L
D
Q
'
7
6
3
$
3
7
6

6
'
6
$
'
H
S
W
K
/
L
P
L
W



7UDLQLQJ7LPH
2SHQ0/B
'
/
6
α
0
2
*
3
7
D
U
S
H
L
D
Q
'
7
6
3
$
3
7
6

6
'
6
$
'
H
S
W
K
/
L
P
L
W


7UDLQLQJ7LPH
2SHQ0/B
Box plots on the training time for
different bloat control methods.
The time complexity of DLS is
O(Cap|P|2n), where |P| is the
population size, n is the number
of data items and Cap is the
capacity of the candidate pool.
DLS is slower than Tarpeian, DTS,
PAP, DSA and depth limit
methods, but faster than αMOGP.
18 20
Compatibility with Multi-objective Methods
There is no significant difference between integration DLS or not in terms of R2
scores.
Incorporating DLS with MO methods can significantly reduce model sizes across
nearly all datasets, as compared to using MO methods alone.
(a) Statistical comparison of test R2
scores for
integration with DLS and MO methods.
(b) Statistical comparison of model sizes for
integration with DLS and MO methods.
19 20
Conclusion
Conclusions
By employing a double-stage selection mechanism that takes into account
both the model performance and size, the lexicase selection operator can
effectively control bloat.
Utilizing a large candidate pool and employing roulette wheel selection are
crucial for double lexicase selection.
Open Source Project: Evolutionary Forest (90 GitHub Stars)
20 / 20
Thanks for listening!
Email: Hengzhe.zhang@ecs.vuw.ac.nz
GitHub Project: https://github.com/hengzhe-zhang/DoubleLexicaseSelection

More Related Content

Similar to A Double Lexicase Selection Operator for Bloat Control in Evolutionary Feature Construction for Regression

Two methods for optimising cognitive model parameters
Two methods for optimising cognitive model parametersTwo methods for optimising cognitive model parameters
Two methods for optimising cognitive model parametersUniversity of Huddersfield
 
Performance Evaluation: A Comparative Study of Various Classifiers
Performance Evaluation: A Comparative Study of Various ClassifiersPerformance Evaluation: A Comparative Study of Various Classifiers
Performance Evaluation: A Comparative Study of Various Classifiersamreshkr19
 
Wasserstein 1031 thesis [Chung il kim]
Wasserstein 1031 thesis [Chung il kim]Wasserstein 1031 thesis [Chung il kim]
Wasserstein 1031 thesis [Chung il kim]Chung-Il Kim
 
230208 MLOps Getting from Good to Great.pptx
230208 MLOps Getting from Good to Great.pptx230208 MLOps Getting from Good to Great.pptx
230208 MLOps Getting from Good to Great.pptxArthur240715
 
Study of relevancy, diversity, and novelty in recommender systems
Study of relevancy, diversity, and novelty in recommender systemsStudy of relevancy, diversity, and novelty in recommender systems
Study of relevancy, diversity, and novelty in recommender systemsChemseddine Berbague
 
Study on Application of Ensemble learning on Credit Scoring
Study on Application of Ensemble learning on Credit ScoringStudy on Application of Ensemble learning on Credit Scoring
Study on Application of Ensemble learning on Credit Scoringharmonylab
 
Updated paper in latex(1)
Updated paper in latex(1)Updated paper in latex(1)
Updated paper in latex(1)Richa Shukla
 
A simple framework for contrastive learning of visual representations
A simple framework for contrastive learning of visual representationsA simple framework for contrastive learning of visual representations
A simple framework for contrastive learning of visual representationsDevansh16
 
deep_Visualization in Data mining.ppt
deep_Visualization in Data mining.pptdeep_Visualization in Data mining.ppt
deep_Visualization in Data mining.pptPerumalPitchandi
 
ENHANCED BREAST CANCER RECOGNITION BASED ON ROTATION FOREST FEATURE SELECTIO...
 ENHANCED BREAST CANCER RECOGNITION BASED ON ROTATION FOREST FEATURE SELECTIO... ENHANCED BREAST CANCER RECOGNITION BASED ON ROTATION FOREST FEATURE SELECTIO...
ENHANCED BREAST CANCER RECOGNITION BASED ON ROTATION FOREST FEATURE SELECTIO...cscpconf
 
Regression vs Deep Neural net vs SVM
Regression vs Deep Neural net vs SVMRegression vs Deep Neural net vs SVM
Regression vs Deep Neural net vs SVMRatul Alahy
 
An Adaptive Masker for the Differential Evolution Algorithm
An Adaptive Masker for the Differential Evolution AlgorithmAn Adaptive Masker for the Differential Evolution Algorithm
An Adaptive Masker for the Differential Evolution AlgorithmIOSR Journals
 
Fault-Tolerance Aware Multi Objective Scheduling Algorithm for Task Schedulin...
Fault-Tolerance Aware Multi Objective Scheduling Algorithm for Task Schedulin...Fault-Tolerance Aware Multi Objective Scheduling Algorithm for Task Schedulin...
Fault-Tolerance Aware Multi Objective Scheduling Algorithm for Task Schedulin...csandit
 
Deep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpointsDeep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpointsValery Tkachenko
 
IRJET- Expert Independent Bayesian Data Fusion and Decision Making Model for ...
IRJET- Expert Independent Bayesian Data Fusion and Decision Making Model for ...IRJET- Expert Independent Bayesian Data Fusion and Decision Making Model for ...
IRJET- Expert Independent Bayesian Data Fusion and Decision Making Model for ...IRJET Journal
 
ON THE PREDICTION ACCURACIES OF THREE MOST KNOWN REGULARIZERS : RIDGE REGRESS...
ON THE PREDICTION ACCURACIES OF THREE MOST KNOWN REGULARIZERS : RIDGE REGRESS...ON THE PREDICTION ACCURACIES OF THREE MOST KNOWN REGULARIZERS : RIDGE REGRESS...
ON THE PREDICTION ACCURACIES OF THREE MOST KNOWN REGULARIZERS : RIDGE REGRESS...ijaia
 
Gradient Boosted Regression Trees in Scikit Learn by Gilles Louppe & Peter Pr...
Gradient Boosted Regression Trees in Scikit Learn by Gilles Louppe & Peter Pr...Gradient Boosted Regression Trees in Scikit Learn by Gilles Louppe & Peter Pr...
Gradient Boosted Regression Trees in Scikit Learn by Gilles Louppe & Peter Pr...PyData
 
An Improved Adaptive Multi-Objective Particle Swarm Optimization for Disassem...
An Improved Adaptive Multi-Objective Particle Swarm Optimization for Disassem...An Improved Adaptive Multi-Objective Particle Swarm Optimization for Disassem...
An Improved Adaptive Multi-Objective Particle Swarm Optimization for Disassem...IJRESJOURNAL
 

Similar to A Double Lexicase Selection Operator for Bloat Control in Evolutionary Feature Construction for Regression (20)

Two methods for optimising cognitive model parameters
Two methods for optimising cognitive model parametersTwo methods for optimising cognitive model parameters
Two methods for optimising cognitive model parameters
 
Rn d presentation_gurulingannk
Rn d presentation_gurulingannkRn d presentation_gurulingannk
Rn d presentation_gurulingannk
 
Performance Evaluation: A Comparative Study of Various Classifiers
Performance Evaluation: A Comparative Study of Various ClassifiersPerformance Evaluation: A Comparative Study of Various Classifiers
Performance Evaluation: A Comparative Study of Various Classifiers
 
Wasserstein 1031 thesis [Chung il kim]
Wasserstein 1031 thesis [Chung il kim]Wasserstein 1031 thesis [Chung il kim]
Wasserstein 1031 thesis [Chung il kim]
 
230208 MLOps Getting from Good to Great.pptx
230208 MLOps Getting from Good to Great.pptx230208 MLOps Getting from Good to Great.pptx
230208 MLOps Getting from Good to Great.pptx
 
Study of relevancy, diversity, and novelty in recommender systems
Study of relevancy, diversity, and novelty in recommender systemsStudy of relevancy, diversity, and novelty in recommender systems
Study of relevancy, diversity, and novelty in recommender systems
 
Study on Application of Ensemble learning on Credit Scoring
Study on Application of Ensemble learning on Credit ScoringStudy on Application of Ensemble learning on Credit Scoring
Study on Application of Ensemble learning on Credit Scoring
 
Updated paper in latex(1)
Updated paper in latex(1)Updated paper in latex(1)
Updated paper in latex(1)
 
A simple framework for contrastive learning of visual representations
A simple framework for contrastive learning of visual representationsA simple framework for contrastive learning of visual representations
A simple framework for contrastive learning of visual representations
 
Srikanta Mishra
Srikanta MishraSrikanta Mishra
Srikanta Mishra
 
deep_Visualization in Data mining.ppt
deep_Visualization in Data mining.pptdeep_Visualization in Data mining.ppt
deep_Visualization in Data mining.ppt
 
ENHANCED BREAST CANCER RECOGNITION BASED ON ROTATION FOREST FEATURE SELECTIO...
 ENHANCED BREAST CANCER RECOGNITION BASED ON ROTATION FOREST FEATURE SELECTIO... ENHANCED BREAST CANCER RECOGNITION BASED ON ROTATION FOREST FEATURE SELECTIO...
ENHANCED BREAST CANCER RECOGNITION BASED ON ROTATION FOREST FEATURE SELECTIO...
 
Regression vs Deep Neural net vs SVM
Regression vs Deep Neural net vs SVMRegression vs Deep Neural net vs SVM
Regression vs Deep Neural net vs SVM
 
An Adaptive Masker for the Differential Evolution Algorithm
An Adaptive Masker for the Differential Evolution AlgorithmAn Adaptive Masker for the Differential Evolution Algorithm
An Adaptive Masker for the Differential Evolution Algorithm
 
Fault-Tolerance Aware Multi Objective Scheduling Algorithm for Task Schedulin...
Fault-Tolerance Aware Multi Objective Scheduling Algorithm for Task Schedulin...Fault-Tolerance Aware Multi Objective Scheduling Algorithm for Task Schedulin...
Fault-Tolerance Aware Multi Objective Scheduling Algorithm for Task Schedulin...
 
Deep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpointsDeep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpoints
 
IRJET- Expert Independent Bayesian Data Fusion and Decision Making Model for ...
IRJET- Expert Independent Bayesian Data Fusion and Decision Making Model for ...IRJET- Expert Independent Bayesian Data Fusion and Decision Making Model for ...
IRJET- Expert Independent Bayesian Data Fusion and Decision Making Model for ...
 
ON THE PREDICTION ACCURACIES OF THREE MOST KNOWN REGULARIZERS : RIDGE REGRESS...
ON THE PREDICTION ACCURACIES OF THREE MOST KNOWN REGULARIZERS : RIDGE REGRESS...ON THE PREDICTION ACCURACIES OF THREE MOST KNOWN REGULARIZERS : RIDGE REGRESS...
ON THE PREDICTION ACCURACIES OF THREE MOST KNOWN REGULARIZERS : RIDGE REGRESS...
 
Gradient Boosted Regression Trees in Scikit Learn by Gilles Louppe & Peter Pr...
Gradient Boosted Regression Trees in Scikit Learn by Gilles Louppe & Peter Pr...Gradient Boosted Regression Trees in Scikit Learn by Gilles Louppe & Peter Pr...
Gradient Boosted Regression Trees in Scikit Learn by Gilles Louppe & Peter Pr...
 
An Improved Adaptive Multi-Objective Particle Swarm Optimization for Disassem...
An Improved Adaptive Multi-Objective Particle Swarm Optimization for Disassem...An Improved Adaptive Multi-Objective Particle Swarm Optimization for Disassem...
An Improved Adaptive Multi-Objective Particle Swarm Optimization for Disassem...
 

Recently uploaded

Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...jana861314
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 sciencefloriejanemacaya1
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfnehabiju2046
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Patrick Diehl
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptMAESTRELLAMesa2
 

Recently uploaded (20)

Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 science
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdf
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.ppt
 

A Double Lexicase Selection Operator for Bloat Control in Evolutionary Feature Construction for Regression

  • 1. A Double Lexicase Selection Operator for Bloat Control in Evolutionary Feature Construction for Regression Hengzhe Zhang Supervisor: Mengjie Zhang, Bing Xue, Qi Chen, Wolfgang Banzhaf (MSU) Victoria University of Wellington 17/07/2023
  • 2. Table of Contents 1 Background 2 Algorithm 3 Experimental Settings 4 Experimental Results 5 Conclusion 1 20
  • 4. Evolutionary Feature Construction The general idea of feature construction is to construct a set of new features {ϕ1, . . . , ϕm} to enhance the learning performance on a given dataset {{x1, y1}, . . . , {xn, yn}} compared to learning on the original features {x1, . . . , xp}. Genetic programming (GP) has been extensively employed to automatically construct features due to its flexible representation and gradient-free search mechanism. (a) Feature Construction on Linear Regression (b) New Feature Space 2 20
  • 5. Bloat Phenomenon Bloat refers to the tendency of GP solutions to grow more complex over time without improving the fitness value. Growth of program size 3 20
  • 6. Bloat Phenomenon The explanations for bloat include: Hitchhiking Defense against crossover Removal bias The nature of a program search space Regardless of the reason for bloat, it is widely acknowledged that addressing bloat can increase search efficiency and enhance the interpretability of the final model. 4 20
  • 7. Existing Bloat Control Techniques Depth Limit: Set a strict depth limit for each GP tree. Variation Operator ▶ Prune-and-Plant (PAP) ▶ Semantic Approximation (SA) Selection Operator ▶ Double Tournament Selection (DTS) ▶ Semantic Tournament Selection (TS-S) Fitness Function ▶ Tarpeian ▶ Alpha-Dominance MOGP 5 20
  • 8. Double Tournament Selection Double Tournament Selection Double Stages of DTS: Stage 1: Tournament selection, get individuals A, B Stage 2: Select the smaller one in A,B with a probability of 0.7 Advantage: Applicable for various scenarios (GPSR, GPHH) Disadvantage: May lead to reduced diversity, given that the tournament selection operator is repeatedly used. -> Lexicase Selection! 6 20
  • 9. Lexicase Selection Tournament vs Lexicase Tournament Selection produces a lot of semantically equivalent individuals Lexicase Selection preserves a very good population diversity Why lexicase selection? It is not necessary to sum up all errors as a scalar for EA methods. MSE = 1 n Pn i=1 Yi − Ŷi 2 How to perform lexicase selection? Step 1: Construct a filter MAD (et) = λ (et) = medianj etj − mediank etk Step 2: Remove bad individuals based on the filter Step 3: Construct more filters until only one individual remains 7 20
  • 11. Double Lexicase Selection Double Lexicase Selection Two Stages of DLS: Stage 1: Lexicase selection, get k individuals A, B, C, D, ...to form a candidate pool Stage 2: Roulette Wheel Selection on k individuals negatively proportionate to tree size Advantage: Applicable for every scenario (GPSR, GPHH) Fully exploit semantics because of using the lexicase selection operator 8 20
  • 13. Datasets 98 Regression datasets are used in the experiments, which are all datasets in PMLB with less than 2000 data items. The size of datasets range between 47 and 1059, and the dimension of datasets are between 2 and 124. 1XPEHURI,QVWDQFHV 1XPEHURI)HDWXUHV Properties of experimental datasets 9 20
  • 14. Baseline Methods Seven baseline bloat control methods: Depth Limit Double Tournament Selection (DTS) (ECJ 2006) Tarpeian (EuroGP 2003) Prune and plant (PAP) (ECJ 2010) TS-S (Inf. Sci. 2018) DSA (Swarm Evol. Comput. 2020) α-MOGP (TEVC) (TEVC 2022) 10 20
  • 15. Parameter Settings Parameter settings are common settings in GP. Parameter settings for all experiments. 11 20
  • 17. R2 Score Only DLS (first row), DSA and Tarpeian methods have similar or better performance on most datasets compared to the depth limit method. For the top three algorithms, the DLS method is better than the Tarpeian and DSA methods. Statistical comparison of test R2 score for different bloat control methods on 98 datasets. (“+,“∼, and “- indicate using the method in a row is better than, similar to or worse than using the method in a column.) 12 20
  • 18. Model Size DLS (first row) is a successful bloat control method, as it reduces model sizes on all datasets. When comparing the PAP, DSA, DTS and TS-S operators, the DLS operator is worse at reducing model size. However, DLS is better than the PAP, DSA, DTS and TS-S operators in terms of test R2 scores. Statistical comparison of model sizes for different bloat control methods on 98 datasets. (+,∼, and - indicate using the method in a row is better than, similar to or worse than using the method in a column.) 13 20
  • 19. Evolutionary Plots DLS operator has good effectiveness in terms of R2 scores over the whole evolution process and thus achieves good final accuracy. Depth limit cannot effectively control tree sizes. In contrast, the DLS operator can effectively control tree size to a relatively low level. 0 25 50 75 100 Generation 0.50 0.75 1.00 R 2 Score OpenML 582 0 25 50 75 100 Generation 0.8 0.9 R 2 Score OpenML 599 0 25 50 75 100 Generation 0.50 0.75 R 2 Score OpenML 618 0 25 50 75 100 Generation 0.50 0.75 R 2 Score OpenML 645 DLS αMOGP Tarpeian DTS PAP TS-S DSA DepthLimiting (a) Evolutionary plots of test R2 score for different bloat control methods. 0 25 50 75 100 Generation 5 10 Tree Size OpenML 582 0 25 50 75 100 Generation 5 10 15 Tree Size OpenML 599 0 25 50 75 100 Generation 5 10 Tree Size OpenML 618 0 25 50 75 100 Generation 5 10 Tree Size OpenML 645 DLS αMOGP Tarpeian DTS PAP TS-S DSA DepthLimiting (b) Evolutionary plots of average tree sizes for different bloat control methods. 14 20
  • 20. Overall Analysis Friedman’s rank of test R2 scores and tree sizes on 98 datasets for different bloat control methods. Only four methods, DLS, Tarpeian, DSA and αMOGP do not get worse predictive performance on R2 scores than using depth-limited methods alone. DLS operator achieves a good trade-off between test R2 scores and model size. 15 20
  • 21. Capacity of Candidate Pool Model sizes decrease with an increase in capacity. Training time will increase significantly when increasing pool capacity from 10 to 20. a (a) Statistical comparison on tree size between using a capacity of 10 and 2. a (b) Statistical comparison on tree size between using a capacity of 10 and 5. 3RRODSDFLW 7UDLQLQJ7LPH (c) Distribution of training time versus candidate pool capacity. 16 20
  • 22. Roulette wheel selection Roulette wheel selection operator is significantly better than minimum selection operator in terms of test R2 scores. The minimum selection operator favors very small individuals and thus leads to very poor predictive performance. a a (a) Statistical comparison of R2 scores using roulette instead of minimum as the selection strategy on 98 datasets. 7UHH6L]H RXQW $OJRULWKP 0LQ 5RXOHWWH (b) Distribution of final tree sizes when using roulette or minimum as the selection strategy on 98 datasets. 17 20
  • 23. Training Time ' / 6 α 0 2 * 3 7 D U S H L D Q ' 7 6 3 $ 3 7 6 6 ' 6 $ ' H S W K / L P L W 7UDLQLQJ7LPH 2SHQ0/B ' / 6 α 0 2 * 3 7 D U S H L D Q ' 7 6 3 $ 3 7 6 6 ' 6 $ ' H S W K / L P L W 7UDLQLQJ7LPH 2SHQ0/B ' / 6 α 0 2 * 3 7 D U S H L D Q ' 7 6 3 $ 3 7 6 6 ' 6 $ ' H S W K / L P L W 7UDLQLQJ7LPH 2SHQ0/B ' / 6 α 0 2 * 3 7 D U S H L D Q ' 7 6 3 $ 3 7 6 6 ' 6 $ ' H S W K / L P L W 7UDLQLQJ7LPH 2SHQ0/B ' / 6 α 0 2 * 3 7 D U S H L D Q ' 7 6 3 $ 3 7 6 6 ' 6 $ ' H S W K / L P L W 7UDLQLQJ7LPH 2SHQ0/B ' / 6 α 0 2 * 3 7 D U S H L D Q ' 7 6 3 $ 3 7 6 6 ' 6 $ ' H S W K / L P L W 7UDLQLQJ7LPH 2SHQ0/B Box plots on the training time for different bloat control methods. The time complexity of DLS is O(Cap|P|2n), where |P| is the population size, n is the number of data items and Cap is the capacity of the candidate pool. DLS is slower than Tarpeian, DTS, PAP, DSA and depth limit methods, but faster than αMOGP. 18 20
  • 24. Compatibility with Multi-objective Methods There is no significant difference between integration DLS or not in terms of R2 scores. Incorporating DLS with MO methods can significantly reduce model sizes across nearly all datasets, as compared to using MO methods alone. (a) Statistical comparison of test R2 scores for integration with DLS and MO methods. (b) Statistical comparison of model sizes for integration with DLS and MO methods. 19 20
  • 26. Conclusions By employing a double-stage selection mechanism that takes into account both the model performance and size, the lexicase selection operator can effectively control bloat. Utilizing a large candidate pool and employing roulette wheel selection are crucial for double lexicase selection. Open Source Project: Evolutionary Forest (90 GitHub Stars) 20 / 20
  • 27. Thanks for listening! Email: Hengzhe.zhang@ecs.vuw.ac.nz GitHub Project: https://github.com/hengzhe-zhang/DoubleLexicaseSelection