SlideShare a Scribd company logo
Genetic Algorithm
for Variable Selection
Jennifer Pittman
ISDS
Duke University
Genetic Algorithms
Step by Step
Jennifer Pittman
ISDS
Duke University
Example: Protein Signature Selection in Mass Spectrometry
http://www.uni-mainz.de/~frosc000/fbg_po3.html
molecular weight
relative
intensity
Genetic Algorithm (Holland)
• heuristic method based on ‘ survival of the fittest ’
• in each iteration (generation) possible solutions or
individuals represented as strings of numbers
• useful when search space very large or too complex
for analytic treatment
00010101 00111010 11110000
00010001 00111011 10100101
00100100 10111001 01111000
11000101 01011000 01101010
3021 3058 3240
Flowchart of GA
©
http://www.spectroscopynow.com
• individuals allowed to
reproduce (selection),
crossover, mutate
• all individuals in population
evaluated by fitness function
http://ib-poland.virtualave.net/ee/genetic1/3geneticalgorithms.htm
Initialization
• proteins corresponding to 256 mass spectrometry
values from 3000-3255 m/z
• assume optimal signature contains 3 peptides
represented by their m/z values in binary encoding
• population size ~M=L/2 where L is signature length
(a simplified example)
00010101 00111010 11110000
00010101 00111010 11110000
00010001 00111011 10100101
00100100 10111001 01111000
11000101 01011000 01101010
Initial
Population
M = 12
L = 24
Searching
• search space defined by all possible encodings of
solutions
• selection, crossover, and mutation perform
‘pseudo-random’ walk through search space
• operations are non-deterministic yet directed
Phenotype Distribution
http://www.ifs.tuwien.ac.at/~aschatt/info/ga/genetic.html
Evaluation and Selection
• evaluate fitness of each solution in current
population (e.g., ability to classify/discriminate)
[involves genotype-phenotype decoding]
• selection of individuals for survival based on
probabilistic function of fitness
• may include elitist step to ensure survival of
fittest individual
• on average mean fitness of individuals increases
Roulette Wheel Selection
©http://www.softchitech.com/ec_intro_html
Crossover
• combine two individuals to create new individuals
for possible inclusion in next generation
• main operator for local search (looking close to
existing solutions)
• perform each crossover with probability pc {0.5,…,0.8}
• crossover points selected at random
• individuals not crossed carried over in population
Initial Strings Offspring
Single-Point
Two-Point
Uniform
11000101 01011000 01101010
00100100 10111001 01111000
11000101 01011000 01101010
11000101 01011000 01101010
00100100 10111001 01111000
00100100 10111001 01111000 10100100 10011001 01101000
00100100 10011000 01111000
00100100 10111000 01101010
11000101 01011001 01111000
11000101 01111001 01101010
01000101 01111000 01111010
Mutation
• each component of every individual is modified with
probability pm
• main operator for global search (looking at new
areas of the search space)
• individuals not mutated carried over in population
• pm usually small {0.001,…,0.01}
rule of thumb = 1/no. of bits in chromosome
©http://www.softchitech.com/ec_intro_html
00010101 00111010 11110000
00010001 00111011 10100101
00100100 10111001 01111000
11000101 01011000 01101010
3021 3058 3240
3017 3059 3165
3036 3185 3120
3197 3088 3106
0.67
0.23
0.45
0.94
phenotype genotype fitness
1
4 2
3 1
3
4
4
00010101 00111010 11110000
00100100 10111001 01111000
11000101 01011000 01101010
11000101 01011000 01101010
selection
00010101 00111010 11110000
00100100 10111001 01111000
11000101 01011000 01101010
11000101 01011000 01101010
one-point crossover (p=0.6)
0.3
0.8
00010101 00111001 01111000
00100100 10111010 11110000
11000101 01011000 01101010
11000101 01011000 01101010
mutation (p=0.05)
00010101 00111001 01111000
00100100 10111010 11110000
11000101 01011000 01101010
11000101 01011000 01101010
00010101 00110001 01111010
10100110 10111000 11110000
11000101 01111000 01101010
11010101 01011000 00101010
3021 3058 3240
3017 3059 3165
3036 3185 3120
3197 3088 3106
00010101 00111010 11110000
00010001 00111011 10100101
00100100 10111001 01111000
11000101 01011000 01101010
0.67
0.23
0.45
0.94
00010101 00110001 01111010
10100110 10111000 11110000
11000101 01111000 01101010
11010101 01011000 00101010
starting generation
next generation
phenotype
genotype fitness
3021 3049 3122
3166 3184 3240
3197 3120 3106
3213 3088 3042
0.81
0.77
0.42
0.98
0 20 40 60 80 100 120
10
50
100
GA Evolution
Generations
Accuracy
in
Percent
http://www.sdsc.edu/skidl/projects/bio-SKIDL/
genetic algorithm learning
http://www.demon.co.uk/apl385/apl96/skom.htm
0 50 100 150 200
-70
-60
-50
-40
Generations
Fitness
criteria
Fitness
value
(scaled)
iteration
• Holland, J. (1992), Adaptation in natural and
artificial systems , 2nd Ed. Cambridge: MIT Press.
• Davis, L. (Ed.) (1991), Handbook of genetic algorithms.
New York: Van Nostrand Reinhold.
• Goldberg, D. (1989), Genetic algorithms in search,
optimization and machine learning. Addison-Wesley.
References
• Fogel, D. (1995), Evolutionary computation: Towards a
new philosophy of machine intelligence. Piscataway:
IEEE Press.
• Bäck, T., Hammel, U., and Schwefel, H. (1997),
‘Evolutionary computation: Comments on the history and
the current state’, IEEE Trans. On Evol. Comp. 1, (1)
• http://www.spectroscopynow.com
• http://www.cs.bris.ac.uk/~colin/evollect1/evollect0/index.htm
• IlliGAL (http://www-illigal.ge.uiuc.edu/index.php3)
Online Resources
• GAlib (http://lancet.mit.edu/ga/)
iteration
Percent
improvement
over
hillclimber
Schema and GAs
• a schema is template representing set of bit strings
1**100*1 { 10010011, 11010001, 10110001, 11110011, … }
• every schema s has an estimated average fitness f(s):
Et+1  k  [f(s)/f(pop)]  Et
• schema s receives exponentially increasing or decreasing
numbers depending upon ratio f(s)/f(pop)
• above average schemas tend to spread through
population while below average schema disappear
(simultaneously for all schema – ‘implicit parallelism’)
MALDI-TOF
©www.protagen.de/pics/main/maldi2.html

More Related Content

Similar to Genetic Algorithm

Ouellette icgc toronto_oct2012_fged_ver02
Ouellette icgc toronto_oct2012_fged_ver02Ouellette icgc toronto_oct2012_fged_ver02
Ouellette icgc toronto_oct2012_fged_ver02
Neuro, McGill University
 
Introduction to Genetic Algorithms
Introduction to Genetic AlgorithmsIntroduction to Genetic Algorithms
Introduction to Genetic Algorithms
Premsankar Chakkingal
 
Learning, Training,  Classification,  Common Sense and Exascale Computing
Learning, Training,  Classification,  Common Sense and Exascale ComputingLearning, Training,  Classification,  Common Sense and Exascale Computing
Learning, Training,  Classification,  Common Sense and Exascale Computing
Joel Saltz
 
Genetic Algorithm
Genetic AlgorithmGenetic Algorithm
Genetic Algorithm
Fatemeh Karimi
 
User Behaviour Modelling - Online and Offline Methods, Metrics, and Challenges
User Behaviour Modelling - Online and Offline Methods, Metrics, and ChallengesUser Behaviour Modelling - Online and Offline Methods, Metrics, and Challenges
User Behaviour Modelling - Online and Offline Methods, Metrics, and Challenges
Telefonica Research
 
The Life-Changing Impact of AI in Healthcare
The Life-Changing Impact of AI in HealthcareThe Life-Changing Impact of AI in Healthcare
The Life-Changing Impact of AI in Healthcare
Kalin Hitrov
 
인공지능은 의료를 어떻게 혁신할 것인가 (ver 2)
인공지능은 의료를 어떻게 혁신할 것인가 (ver 2)인공지능은 의료를 어떻게 혁신할 것인가 (ver 2)
인공지능은 의료를 어떻게 혁신할 것인가 (ver 2)
Yoon Sup Choi
 
Farid Ali Presentation_Final.pptx
Farid Ali Presentation_Final.pptxFarid Ali Presentation_Final.pptx
Farid Ali Presentation_Final.pptx
FaridAliMousa1
 
Benevolent machine learning sgs
Benevolent machine learning sgsBenevolent machine learning sgs
Benevolent machine learning sgs
Scott Turner
 
Ga ppt (1)
Ga ppt (1)Ga ppt (1)
Ga ppt (1)
RAHUL SOLANKI
 
International Cancer Genomics Consortium (ICGC) Data Coordinating Center
International Cancer Genomics Consortium (ICGC) Data Coordinating CenterInternational Cancer Genomics Consortium (ICGC) Data Coordinating Center
International Cancer Genomics Consortium (ICGC) Data Coordinating Center
Neuro, McGill University
 
genetic computing
genetic computinggenetic computing
genetic computing
KRISH na TimeTraveller
 
A Survey On Genetic Algorithms
A Survey On Genetic AlgorithmsA Survey On Genetic Algorithms
A Survey On Genetic Algorithms
Valerie Felton
 
Benevolent machine learning
Benevolent machine learningBenevolent machine learning
Benevolent machine learning
Scott Turner
 
2019-06-21 YC Preso V5.pdf
2019-06-21 YC Preso V5.pdf2019-06-21 YC Preso V5.pdf
2019-06-21 YC Preso V5.pdf
Yue Cathy Chang
 
Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...
Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...
Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...Intel IT Center
 
Hpai class 15 - genes, mini-modules, and learning
Hpai   class 15 - genes, mini-modules, and learningHpai   class 15 - genes, mini-modules, and learning
Hpai class 15 - genes, mini-modules, and learning
melendez321
 
ga-2.ppt
ga-2.pptga-2.ppt
ga-2.ppt
sayedmha
 
Deep Learning for AI (3)
Deep Learning for AI (3)Deep Learning for AI (3)
Deep Learning for AI (3)
Dongheon Lee
 
High Throughput Investigation of EC Coupling in Isolated Cardiac Myocytes
High Throughput Investigation of EC Coupling in Isolated Cardiac MyocytesHigh Throughput Investigation of EC Coupling in Isolated Cardiac Myocytes
High Throughput Investigation of EC Coupling in Isolated Cardiac Myocytes
InsideScientific
 

Similar to Genetic Algorithm (20)

Ouellette icgc toronto_oct2012_fged_ver02
Ouellette icgc toronto_oct2012_fged_ver02Ouellette icgc toronto_oct2012_fged_ver02
Ouellette icgc toronto_oct2012_fged_ver02
 
Introduction to Genetic Algorithms
Introduction to Genetic AlgorithmsIntroduction to Genetic Algorithms
Introduction to Genetic Algorithms
 
Learning, Training,  Classification,  Common Sense and Exascale Computing
Learning, Training,  Classification,  Common Sense and Exascale ComputingLearning, Training,  Classification,  Common Sense and Exascale Computing
Learning, Training,  Classification,  Common Sense and Exascale Computing
 
Genetic Algorithm
Genetic AlgorithmGenetic Algorithm
Genetic Algorithm
 
User Behaviour Modelling - Online and Offline Methods, Metrics, and Challenges
User Behaviour Modelling - Online and Offline Methods, Metrics, and ChallengesUser Behaviour Modelling - Online and Offline Methods, Metrics, and Challenges
User Behaviour Modelling - Online and Offline Methods, Metrics, and Challenges
 
The Life-Changing Impact of AI in Healthcare
The Life-Changing Impact of AI in HealthcareThe Life-Changing Impact of AI in Healthcare
The Life-Changing Impact of AI in Healthcare
 
인공지능은 의료를 어떻게 혁신할 것인가 (ver 2)
인공지능은 의료를 어떻게 혁신할 것인가 (ver 2)인공지능은 의료를 어떻게 혁신할 것인가 (ver 2)
인공지능은 의료를 어떻게 혁신할 것인가 (ver 2)
 
Farid Ali Presentation_Final.pptx
Farid Ali Presentation_Final.pptxFarid Ali Presentation_Final.pptx
Farid Ali Presentation_Final.pptx
 
Benevolent machine learning sgs
Benevolent machine learning sgsBenevolent machine learning sgs
Benevolent machine learning sgs
 
Ga ppt (1)
Ga ppt (1)Ga ppt (1)
Ga ppt (1)
 
International Cancer Genomics Consortium (ICGC) Data Coordinating Center
International Cancer Genomics Consortium (ICGC) Data Coordinating CenterInternational Cancer Genomics Consortium (ICGC) Data Coordinating Center
International Cancer Genomics Consortium (ICGC) Data Coordinating Center
 
genetic computing
genetic computinggenetic computing
genetic computing
 
A Survey On Genetic Algorithms
A Survey On Genetic AlgorithmsA Survey On Genetic Algorithms
A Survey On Genetic Algorithms
 
Benevolent machine learning
Benevolent machine learningBenevolent machine learning
Benevolent machine learning
 
2019-06-21 YC Preso V5.pdf
2019-06-21 YC Preso V5.pdf2019-06-21 YC Preso V5.pdf
2019-06-21 YC Preso V5.pdf
 
Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...
Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...
Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...
 
Hpai class 15 - genes, mini-modules, and learning
Hpai   class 15 - genes, mini-modules, and learningHpai   class 15 - genes, mini-modules, and learning
Hpai class 15 - genes, mini-modules, and learning
 
ga-2.ppt
ga-2.pptga-2.ppt
ga-2.ppt
 
Deep Learning for AI (3)
Deep Learning for AI (3)Deep Learning for AI (3)
Deep Learning for AI (3)
 
High Throughput Investigation of EC Coupling in Isolated Cardiac Myocytes
High Throughput Investigation of EC Coupling in Isolated Cardiac MyocytesHigh Throughput Investigation of EC Coupling in Isolated Cardiac Myocytes
High Throughput Investigation of EC Coupling in Isolated Cardiac Myocytes
 

More from aravind Guru

A PROFICIENT SENSOR NETWORK BASED SMART METER MULTI-DEMAND RESPONSE SYSTEM US...
A PROFICIENT SENSOR NETWORK BASED SMART METER MULTI-DEMAND RESPONSE SYSTEM US...A PROFICIENT SENSOR NETWORK BASED SMART METER MULTI-DEMAND RESPONSE SYSTEM US...
A PROFICIENT SENSOR NETWORK BASED SMART METER MULTI-DEMAND RESPONSE SYSTEM US...
aravind Guru
 
Word Warriors Questions of the symposium for students
Word Warriors Questions of the symposium for studentsWord Warriors Questions of the symposium for students
Word Warriors Questions of the symposium for students
aravind Guru
 
Connections finale competitions answers.
Connections finale competitions answers.Connections finale competitions answers.
Connections finale competitions answers.
aravind Guru
 
Pulp and Paper making (2).pptx
Pulp and Paper making (2).pptxPulp and Paper making (2).pptx
Pulp and Paper making (2).pptx
aravind Guru
 
Arduino_Beginner.pptx
Arduino_Beginner.pptxArduino_Beginner.pptx
Arduino_Beginner.pptx
aravind Guru
 
3.Project Management.pptx
3.Project Management.pptx3.Project Management.pptx
3.Project Management.pptx
aravind Guru
 
Process Control Laboratory.pptx
Process Control Laboratory.pptxProcess Control Laboratory.pptx
Process Control Laboratory.pptx
aravind Guru
 
SANJAY.K MINOR PROJECT 2-3.pptx
SANJAY.K MINOR PROJECT 2-3.pptxSANJAY.K MINOR PROJECT 2-3.pptx
SANJAY.K MINOR PROJECT 2-3.pptx
aravind Guru
 
_tmpA8_ minor project ppt.pptx
_tmpA8_ minor project ppt.pptx_tmpA8_ minor project ppt.pptx
_tmpA8_ minor project ppt.pptx
aravind Guru
 
application layer protocol for iot.pptx
application layer protocol for iot.pptxapplication layer protocol for iot.pptx
application layer protocol for iot.pptx
aravind Guru
 
AASP_SUMMIT2015_Project_Mgt.pptx
AASP_SUMMIT2015_Project_Mgt.pptxAASP_SUMMIT2015_Project_Mgt.pptx
AASP_SUMMIT2015_Project_Mgt.pptx
aravind Guru
 
6.4 - Finance management and Resorce Mobilization.pptx
6.4 - Finance management and Resorce Mobilization.pptx6.4 - Finance management and Resorce Mobilization.pptx
6.4 - Finance management and Resorce Mobilization.pptx
aravind Guru
 
26-170918023441.pptx
26-170918023441.pptx26-170918023441.pptx
26-170918023441.pptx
aravind Guru
 
isoosimodel
isoosimodelisoosimodel
isoosimodel
aravind Guru
 
PID Controller Tuning
PID Controller TuningPID Controller Tuning
PID Controller Tuning
aravind Guru
 
2. IoT Intro.pptx
2. IoT Intro.pptx2. IoT Intro.pptx
2. IoT Intro.pptx
aravind Guru
 
1. IoT.pptx
1. IoT.pptx1. IoT.pptx
1. IoT.pptx
aravind Guru
 
BATCH 1.pptx
BATCH 1.pptxBATCH 1.pptx
BATCH 1.pptx
aravind Guru
 

More from aravind Guru (18)

A PROFICIENT SENSOR NETWORK BASED SMART METER MULTI-DEMAND RESPONSE SYSTEM US...
A PROFICIENT SENSOR NETWORK BASED SMART METER MULTI-DEMAND RESPONSE SYSTEM US...A PROFICIENT SENSOR NETWORK BASED SMART METER MULTI-DEMAND RESPONSE SYSTEM US...
A PROFICIENT SENSOR NETWORK BASED SMART METER MULTI-DEMAND RESPONSE SYSTEM US...
 
Word Warriors Questions of the symposium for students
Word Warriors Questions of the symposium for studentsWord Warriors Questions of the symposium for students
Word Warriors Questions of the symposium for students
 
Connections finale competitions answers.
Connections finale competitions answers.Connections finale competitions answers.
Connections finale competitions answers.
 
Pulp and Paper making (2).pptx
Pulp and Paper making (2).pptxPulp and Paper making (2).pptx
Pulp and Paper making (2).pptx
 
Arduino_Beginner.pptx
Arduino_Beginner.pptxArduino_Beginner.pptx
Arduino_Beginner.pptx
 
3.Project Management.pptx
3.Project Management.pptx3.Project Management.pptx
3.Project Management.pptx
 
Process Control Laboratory.pptx
Process Control Laboratory.pptxProcess Control Laboratory.pptx
Process Control Laboratory.pptx
 
SANJAY.K MINOR PROJECT 2-3.pptx
SANJAY.K MINOR PROJECT 2-3.pptxSANJAY.K MINOR PROJECT 2-3.pptx
SANJAY.K MINOR PROJECT 2-3.pptx
 
_tmpA8_ minor project ppt.pptx
_tmpA8_ minor project ppt.pptx_tmpA8_ minor project ppt.pptx
_tmpA8_ minor project ppt.pptx
 
application layer protocol for iot.pptx
application layer protocol for iot.pptxapplication layer protocol for iot.pptx
application layer protocol for iot.pptx
 
AASP_SUMMIT2015_Project_Mgt.pptx
AASP_SUMMIT2015_Project_Mgt.pptxAASP_SUMMIT2015_Project_Mgt.pptx
AASP_SUMMIT2015_Project_Mgt.pptx
 
6.4 - Finance management and Resorce Mobilization.pptx
6.4 - Finance management and Resorce Mobilization.pptx6.4 - Finance management and Resorce Mobilization.pptx
6.4 - Finance management and Resorce Mobilization.pptx
 
26-170918023441.pptx
26-170918023441.pptx26-170918023441.pptx
26-170918023441.pptx
 
isoosimodel
isoosimodelisoosimodel
isoosimodel
 
PID Controller Tuning
PID Controller TuningPID Controller Tuning
PID Controller Tuning
 
2. IoT Intro.pptx
2. IoT Intro.pptx2. IoT Intro.pptx
2. IoT Intro.pptx
 
1. IoT.pptx
1. IoT.pptx1. IoT.pptx
1. IoT.pptx
 
BATCH 1.pptx
BATCH 1.pptxBATCH 1.pptx
BATCH 1.pptx
 

Recently uploaded

space technology lecture notes on satellite
space technology lecture notes on satellitespace technology lecture notes on satellite
space technology lecture notes on satellite
ongomchris
 
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
R&R Consult
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
zwunae
 
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
bakpo1
 
MCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdfMCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdf
Osamah Alsalih
 
Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024
Massimo Talia
 
CME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional ElectiveCME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional Elective
karthi keyan
 
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang,  ICLR 2024, MLILAB, KAIST AI.pdfJ.Yang,  ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
MLILAB
 
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdfHybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
fxintegritypublishin
 
Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
JoytuBarua2
 
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
ydteq
 
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
Amil Baba Dawood bangali
 
Architectural Portfolio Sean Lockwood
Architectural Portfolio Sean LockwoodArchitectural Portfolio Sean Lockwood
Architectural Portfolio Sean Lockwood
seandesed
 
Investor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptxInvestor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptx
AmarGB2
 
road safety engineering r s e unit 3.pdf
road safety engineering  r s e unit 3.pdfroad safety engineering  r s e unit 3.pdf
road safety engineering r s e unit 3.pdf
VENKATESHvenky89705
 
Final project report on grocery store management system..pdf
Final project report on grocery store management system..pdfFinal project report on grocery store management system..pdf
Final project report on grocery store management system..pdf
Kamal Acharya
 
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
H.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdfH.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdf
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
MLILAB
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
Kerry Sado
 
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&BDesign and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Sreedhar Chowdam
 
The role of big data in decision making.
The role of big data in decision making.The role of big data in decision making.
The role of big data in decision making.
ankuprajapati0525
 

Recently uploaded (20)

space technology lecture notes on satellite
space technology lecture notes on satellitespace technology lecture notes on satellite
space technology lecture notes on satellite
 
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
 
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
 
MCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdfMCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdf
 
Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024
 
CME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional ElectiveCME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional Elective
 
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang,  ICLR 2024, MLILAB, KAIST AI.pdfJ.Yang,  ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
 
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdfHybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
 
Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
 
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
 
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
 
Architectural Portfolio Sean Lockwood
Architectural Portfolio Sean LockwoodArchitectural Portfolio Sean Lockwood
Architectural Portfolio Sean Lockwood
 
Investor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptxInvestor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptx
 
road safety engineering r s e unit 3.pdf
road safety engineering  r s e unit 3.pdfroad safety engineering  r s e unit 3.pdf
road safety engineering r s e unit 3.pdf
 
Final project report on grocery store management system..pdf
Final project report on grocery store management system..pdfFinal project report on grocery store management system..pdf
Final project report on grocery store management system..pdf
 
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
H.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdfH.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdf
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
 
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&BDesign and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
 
The role of big data in decision making.
The role of big data in decision making.The role of big data in decision making.
The role of big data in decision making.
 

Genetic Algorithm

  • 1. Genetic Algorithm for Variable Selection Jennifer Pittman ISDS Duke University
  • 2. Genetic Algorithms Step by Step Jennifer Pittman ISDS Duke University
  • 3. Example: Protein Signature Selection in Mass Spectrometry http://www.uni-mainz.de/~frosc000/fbg_po3.html molecular weight relative intensity
  • 4. Genetic Algorithm (Holland) • heuristic method based on ‘ survival of the fittest ’ • in each iteration (generation) possible solutions or individuals represented as strings of numbers • useful when search space very large or too complex for analytic treatment 00010101 00111010 11110000 00010001 00111011 10100101 00100100 10111001 01111000 11000101 01011000 01101010 3021 3058 3240
  • 5. Flowchart of GA © http://www.spectroscopynow.com • individuals allowed to reproduce (selection), crossover, mutate • all individuals in population evaluated by fitness function
  • 7. Initialization • proteins corresponding to 256 mass spectrometry values from 3000-3255 m/z • assume optimal signature contains 3 peptides represented by their m/z values in binary encoding • population size ~M=L/2 where L is signature length (a simplified example)
  • 8. 00010101 00111010 11110000 00010101 00111010 11110000 00010001 00111011 10100101 00100100 10111001 01111000 11000101 01011000 01101010 Initial Population M = 12 L = 24
  • 9. Searching • search space defined by all possible encodings of solutions • selection, crossover, and mutation perform ‘pseudo-random’ walk through search space • operations are non-deterministic yet directed
  • 11. Evaluation and Selection • evaluate fitness of each solution in current population (e.g., ability to classify/discriminate) [involves genotype-phenotype decoding] • selection of individuals for survival based on probabilistic function of fitness • may include elitist step to ensure survival of fittest individual • on average mean fitness of individuals increases
  • 13. Crossover • combine two individuals to create new individuals for possible inclusion in next generation • main operator for local search (looking close to existing solutions) • perform each crossover with probability pc {0.5,…,0.8} • crossover points selected at random • individuals not crossed carried over in population
  • 14. Initial Strings Offspring Single-Point Two-Point Uniform 11000101 01011000 01101010 00100100 10111001 01111000 11000101 01011000 01101010 11000101 01011000 01101010 00100100 10111001 01111000 00100100 10111001 01111000 10100100 10011001 01101000 00100100 10011000 01111000 00100100 10111000 01101010 11000101 01011001 01111000 11000101 01111001 01101010 01000101 01111000 01111010
  • 15. Mutation • each component of every individual is modified with probability pm • main operator for global search (looking at new areas of the search space) • individuals not mutated carried over in population • pm usually small {0.001,…,0.01} rule of thumb = 1/no. of bits in chromosome
  • 17. 00010101 00111010 11110000 00010001 00111011 10100101 00100100 10111001 01111000 11000101 01011000 01101010 3021 3058 3240 3017 3059 3165 3036 3185 3120 3197 3088 3106 0.67 0.23 0.45 0.94 phenotype genotype fitness 1 4 2 3 1 3 4 4 00010101 00111010 11110000 00100100 10111001 01111000 11000101 01011000 01101010 11000101 01011000 01101010 selection
  • 18. 00010101 00111010 11110000 00100100 10111001 01111000 11000101 01011000 01101010 11000101 01011000 01101010 one-point crossover (p=0.6) 0.3 0.8 00010101 00111001 01111000 00100100 10111010 11110000 11000101 01011000 01101010 11000101 01011000 01101010 mutation (p=0.05) 00010101 00111001 01111000 00100100 10111010 11110000 11000101 01011000 01101010 11000101 01011000 01101010 00010101 00110001 01111010 10100110 10111000 11110000 11000101 01111000 01101010 11010101 01011000 00101010
  • 19. 3021 3058 3240 3017 3059 3165 3036 3185 3120 3197 3088 3106 00010101 00111010 11110000 00010001 00111011 10100101 00100100 10111001 01111000 11000101 01011000 01101010 0.67 0.23 0.45 0.94 00010101 00110001 01111010 10100110 10111000 11110000 11000101 01111000 01101010 11010101 01011000 00101010 starting generation next generation phenotype genotype fitness 3021 3049 3122 3166 3184 3240 3197 3120 3106 3213 3088 3042 0.81 0.77 0.42 0.98
  • 20. 0 20 40 60 80 100 120 10 50 100 GA Evolution Generations Accuracy in Percent http://www.sdsc.edu/skidl/projects/bio-SKIDL/
  • 21. genetic algorithm learning http://www.demon.co.uk/apl385/apl96/skom.htm 0 50 100 150 200 -70 -60 -50 -40 Generations Fitness criteria
  • 23. • Holland, J. (1992), Adaptation in natural and artificial systems , 2nd Ed. Cambridge: MIT Press. • Davis, L. (Ed.) (1991), Handbook of genetic algorithms. New York: Van Nostrand Reinhold. • Goldberg, D. (1989), Genetic algorithms in search, optimization and machine learning. Addison-Wesley. References • Fogel, D. (1995), Evolutionary computation: Towards a new philosophy of machine intelligence. Piscataway: IEEE Press. • Bäck, T., Hammel, U., and Schwefel, H. (1997), ‘Evolutionary computation: Comments on the history and the current state’, IEEE Trans. On Evol. Comp. 1, (1)
  • 24. • http://www.spectroscopynow.com • http://www.cs.bris.ac.uk/~colin/evollect1/evollect0/index.htm • IlliGAL (http://www-illigal.ge.uiuc.edu/index.php3) Online Resources • GAlib (http://lancet.mit.edu/ga/)
  • 26. Schema and GAs • a schema is template representing set of bit strings 1**100*1 { 10010011, 11010001, 10110001, 11110011, … } • every schema s has an estimated average fitness f(s): Et+1  k  [f(s)/f(pop)]  Et • schema s receives exponentially increasing or decreasing numbers depending upon ratio f(s)/f(pop) • above average schemas tend to spread through population while below average schema disappear (simultaneously for all schema – ‘implicit parallelism’)