SlideShare a Scribd company logo
1 of 19
Download to read offline
Introduction to Machine
       Learning
                  Lecture 23
     Learning Classifier Systems

                 Albert Orriols i Puig
             http://www.albertorriols.net
             htt //       lb t i l      t
                aorriols@salle.url.edu

      Artificial Intelligence – Machine Learning
                        g                      g
          Enginyeria i Arquitectura La Salle
                 Universitat Ramon Llull
Recap of Lectures 21-22
        Value functions
                Vπ(s): Long-term reward estimation
                from s a e s following po cy π
                  o state o o        g policy
                Qπ(s,a): Long-term reward estimation
                from s a e s e ecu g ac o a
                  o state executing action
                and then following policy π
        The long term reward is a recency weighted average of
                                  recency-weighted
        the received rewards

      …r                                                                                        …
                               at rt+1          at+1 rt+2             at+2 rt+3          at+3
                    t
                          st             st+1                  st+2               st+3




                                                                                                Slide 2
Artificial Intelligence                         Machine Learning
Recap of Lectures 21-22
        Q
        Q-learning
                 g




                                             Slide 3
Artificial Intelligence   Machine Learning
Today’s Agenda

        The Origins of LCSs
                Michigan-style LCSs
                Pittsburg-style LCS
                Pitt b     t l LCSs


        Michigan-style LCSs




                                                   Slide 4
Artificial Intelligence         Machine Learning
Original Idea of LCS
        Holland’s envision: Cognitive Systems
                              g        y
                Create true artificial intelligence itself
                True intelligence requires adaptive behavior in the face of changing
                circumstances (Holland & Reitman, 1978)
                Holland s
                Holland’s vision going back to late 50s and early 60s of roving bands
                of computer programs.

             Holland’s notion of genetic search as program searching (1962)
             The free generation procedure. . . Requires the generators (and
             combinations of generators) to “shift” and “connect” at random in the
                                                shift    connect
             computer…two or more generators occupying adjacent modules (“in
             contact”) may become connected. Such connected sets of
             generators are to shift as a unit.
                    t       t hift          it


                From stimulus-response t internal states and modifiable d t t
                F      ti l            to i t   l tt       d   difi bl detectors
                and effectors
                                                                                  Slide 5
Artificial Intelligence                      Machine Learning
First LCS Implementation

   CS-1 (Holland & Reitman, 1978)


   Post-production system
   General memory containing classifiers
   Process:
           Code the situation and find in memory
           the actions that are appropriate to
           both CS-1 goal and situation
           Store in memory the consequences of
           these actions (learning)
           Generate new good productions
           (classifiers) t endure.
           (l    ifi ) to d




                          Population of classifiers    Current system knowledge

                          Performance component        Short term behavior of the system

                          Rule discovery component        Get new promising rules

                                                                                           Slide 6
Artificial Intelligence                          Machine Learning
Meanwhile, in Pitts University
        Smith’s interpretation of Holland’s GA envision
                     p

            Smith’s notion of learning as adaptive search (1980, 1983)
            LS-1: “Learns a set of heuristics, represented as production
            LS 1 “L            t fh    i ti            td        d ti
            system programs, to govern the application of a set of
            operators in performing a particular task”




                                                         Great success! LS-1 took
                                                         Waterman’s poker player to the
                                                         cleaners (not bluffing)




                                                                                Slide 7
Artificial Intelligence               Machine Learning
Two Models
        And here, two ways started: Michigan vs Pitts LCSs
                ,       y                g

                                                          Pittsburgh-style LCSs
        Michigan-style LCSs
                                                             Straight GA
                 Cognitive system
                                                             Individual = set of rules
                 Individual = rule
                                                             Solution: best individual
                 Solution: all the
                 population
                                                             Usually offline systems
                                                             U   ll ffli
                 Apportionment of credit
                 Reinforcement learning




        We focus on Michigan-style LCS
                                                                             Slide 8
Artificial Intelligence                Machine Learning
Michigan-style LCSs
             General schema


                                          Environment
                      Sensorial
                                         Action       Reward
                      state

                                                                         Online rule evaluator:
                                                                         •   XCS: Q-Learning (Sutton & Barto, 1998)
                                   Classifier 1
                                                  Learning
Any Representation:
  y   p                                                                           Uses Widrow-Hoff delta rule
                                   Classifier 2
                                                  Classifier
   production rules,
  genetic programs,                                System
                                   Classifier n
     perceptrons,
        SVMs



                                                  Rule evolution:
                                  Genetic         Typically, a GA (Holland, 75;
                                  Algorithm       Goldberg, 89) applied on the
                                                  population.



                                                                                                          Slide 9
     Artificial Intelligence                          Machine Learning
Knowledge Representation
        The knowledge representation consists of
                   g    p
                Population of classifiers
                          Usually independent of each other
                Each classifier has
                          Condition
                          C diti part C
                                    t
                          Action part A
                          Prediction
                          P di ti part P
                                     t
                          Interpreted as:
                              If condition C is satisfied and action A is executed, then P is
                                                                          executed
                              expected to be true

                Solution for a new problem
                          Get the classifiers that match the sensorial state
                          Decide which action should be used among the actions of
                          the selected classifiers
                                                                                        Slide 10
Artificial Intelligence                       Machine Learning
Condition Structures
        Condition structure depends on the types of attributes
                              p             yp
                Binary
                          Ternary encoding {0, 1 #}
                                           {0 1,

                                 If v1 is ‘0’ and v2 is ‘1’ and v3 is ‘#’ … and vn in ‘0’ then actioni



                Continuous
                          Interval-based encoding

                                 If v1 in [l1,u1] and v2 in [l2,u2] … and vn in [ln,un] then actioni
                                              u                 u                   u


                          Hyperellipsoids




                                                                                                         Slide 11
Artificial Intelligence                           Machine Learning
Condition Structures
        Condition structure depends on the types of attributes
                              p             yp
                Many other representations
                          Partial matching (Booker 1985)
                                           (Booker,

                          Default hierarchies (Holland et al., 1986)
                          Fuzzy conditions (Bonarini 2000; Valenzuela Rendón 1991; Casillas et
                                           (Bonarini,      Valenzuela-Rendón,
                          al., 2008, Orriols et al., 2009)

                          Neural-network-based encodings (Bull & O’Hara, 2002)
                          GP tree encodings with S-expressions (Lanzi, 1999)




                                                                                           Slide 12
Artificial Intelligence                               Machine Learning
Prediction
        Prediction can be:
                Scalar number
                Line
                Polynomial
                Neural network
                …


        We ill
        W will consider the initial idea: prediction is a scalar number
                   id th i iti l id          di ti i         l      b




                                                                   Slide 13
Artificial Intelligence             Machine Learning
Learning Interaction in XCS

                                                               ENVIRONMENT

                                                            Match Set [M]
         Problem
         instance
                                                          1C    A   PεF   num as ts exp
                                                                                                                Selected
                                                          3C    A   PεF   num as ts exp
                                                                                                                 action
                                                          5C    A   PεF   num as ts exp
  Population [P]                                          6C    A   PεF   num as ts exp
                                     Match set
                                                                                                                                           REWARD
                                                                      …
                                     generation
1C   A   PεF   num as ts exp
2C   A   PεF   num as ts exp
                                                                                                                   Prediction
3C   A   PεF   num as ts exp
                                                                                                                     Array
4C   A   PεF   num as ts exp
5C   A   PεF   num as ts exp
6C   A   PεF   num as ts exp                                                                               Selected action
             …
                                                                                    Action Set [A]
                                                                                               []                                  Classifier
                                                                                 1C   A   PεF   num as ts exp                     Parameters
                          Deletion                Selection, reproduction,
                                                                                 3C   A   PεF   num as ts exp                       Update
                                                        and mutation
                                                                                 5C   A   PεF   num as ts exp
                                                                                                                                (Widrow-Hoff rule)
                                                                                 6C   A   PεF   num as ts exp
                                                                                            …                                Delayed reward [A-1]
                                Genetic Algorithm
                                                               Competition                                                       Fitness Sharing
                                                                                            Action Set [A]-1
                                                               in the niche
                                                                                           1C    A   PεF   num as ts e p
                                                                                                            u        exp
                                                                                           3C    A   PεF   num as ts exp
                                                                                           5C    A   PεF   num as ts exp
                                                                                           6C    A   PεF   num as ts exp
                                                                                                        …                             Slide 14
Artificial Intelligence                                             Machine Learning
Estimate Classifier Prediction
        Three key p
                y parameters
                Prediction: What I will get if I select the action




                Error: Error on that prediction                  Does it sound familiar?
                                                                        Q-learning!



                Fitness: How good is my classifier
                             g        y




        These parameters are estimated on-line
                                                                                 Slide 15
Artificial Intelligence                  Machine Learning
Evolutionary Search
        GA applied time to time to [A]
            pp                     []
                Select two parents
                Cross th
                C     them
                Mutate them
                Introduce the two new offspring into the population
                If the population is full
                   t e popu at o s u          remove poo classifiers
                                               e o e poor c ass e s




                                                                       Slide 16
Artificial Intelligence                     Machine Learning
LCS Learning Pressures
        Parameter updates identifies most accurate classifiers

        Different pressures caused by the GA:
                S t pressure t
                             toward generality
                                  d       lit
                Set

                Fitness pressure toward highly fit classifiers

                Mutation pressure pressuring toward diversification

                Subsumption pressure toward the deletion
                of accurate, over-specialized
                classifiers




                                                                      Slide 17
Artificial Intelligence                       Machine Learning
Next Class

        Applications of LCS
        A li ti       f




                                                 Slide 18
Artificial Intelligence       Machine Learning
Introduction to Machine
       Learning
                  Lecture 23
     Learning Classifier Systems

                 Albert Orriols i Puig
             http://www.albertorriols.net
             htt //       lb t i l      t
                aorriols@salle.url.edu

      Artificial Intelligence – Machine Learning
                        g                      g
          Enginyeria i Arquitectura La Salle
                 Universitat Ramon Llull

More Related Content

What's hot

Lecture16 - Advances topics on association rules PART III
Lecture16 - Advances topics on association rules PART IIILecture16 - Advances topics on association rules PART III
Lecture16 - Advances topics on association rules PART IIIAlbert Orriols-Puig
 
New Challenges in Learning Classifier Systems: Mining Rarities and Evolving F...
New Challenges in Learning Classifier Systems: Mining Rarities and Evolving F...New Challenges in Learning Classifier Systems: Mining Rarities and Evolving F...
New Challenges in Learning Classifier Systems: Mining Rarities and Evolving F...Albert Orriols-Puig
 
CCIA'2008: Can Evolution Strategies Improve Learning Guidance in XCS? Design ...
CCIA'2008: Can Evolution Strategies Improve Learning Guidance in XCS? Design ...CCIA'2008: Can Evolution Strategies Improve Learning Guidance in XCS? Design ...
CCIA'2008: Can Evolution Strategies Improve Learning Guidance in XCS? Design ...Albert Orriols-Puig
 
Lecture9 - Bayesian-Decision-Theory
Lecture9 - Bayesian-Decision-TheoryLecture9 - Bayesian-Decision-Theory
Lecture9 - Bayesian-Decision-TheoryAlbert Orriols-Puig
 
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...Albert Orriols-Puig
 
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCSHIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCSAlbert Orriols-Puig
 
GECCO'2007: Modeling XCS in Class Imbalances: Population Size and Parameter S...
GECCO'2007: Modeling XCS in Class Imbalances: Population Size and Parameter S...GECCO'2007: Modeling XCS in Class Imbalances: Population Size and Parameter S...
GECCO'2007: Modeling XCS in Class Imbalances: Population Size and Parameter S...Albert Orriols-Puig
 
HIS'2008: Genetic-based Synthetic Data Sets for the Analysis of Classifiers B...
HIS'2008: Genetic-based Synthetic Data Sets for the Analysis of Classifiers B...HIS'2008: Genetic-based Synthetic Data Sets for the Analysis of Classifiers B...
HIS'2008: Genetic-based Synthetic Data Sets for the Analysis of Classifiers B...Albert Orriols-Puig
 
CCIA'2008: On the dimensions of data complexity through synthetic data sets
CCIA'2008: On the dimensions of data complexity through synthetic data setsCCIA'2008: On the dimensions of data complexity through synthetic data sets
CCIA'2008: On the dimensions of data complexity through synthetic data setsAlbert Orriols-Puig
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learningbutest
 
Multivariate analyses & decoding
Multivariate analyses & decodingMultivariate analyses & decoding
Multivariate analyses & decodingkhbrodersen
 
NNFL - Guru Nanak Dev Engineering College
NNFL - Guru Nanak Dev Engineering CollegeNNFL - Guru Nanak Dev Engineering College
NNFL - Guru Nanak Dev Engineering CollegeMR. VIKRAM SNEHI
 

What's hot (20)

Lecture22
Lecture22Lecture22
Lecture22
 
Lecture6 - C4.5
Lecture6 - C4.5Lecture6 - C4.5
Lecture6 - C4.5
 
Lecture16 - Advances topics on association rules PART III
Lecture16 - Advances topics on association rules PART IIILecture16 - Advances topics on association rules PART III
Lecture16 - Advances topics on association rules PART III
 
Lecture24
Lecture24Lecture24
Lecture24
 
Lecture2 - Machine Learning
Lecture2 - Machine LearningLecture2 - Machine Learning
Lecture2 - Machine Learning
 
Lecture1 - Machine Learning
Lecture1 - Machine LearningLecture1 - Machine Learning
Lecture1 - Machine Learning
 
New Challenges in Learning Classifier Systems: Mining Rarities and Evolving F...
New Challenges in Learning Classifier Systems: Mining Rarities and Evolving F...New Challenges in Learning Classifier Systems: Mining Rarities and Evolving F...
New Challenges in Learning Classifier Systems: Mining Rarities and Evolving F...
 
CCIA'2008: Can Evolution Strategies Improve Learning Guidance in XCS? Design ...
CCIA'2008: Can Evolution Strategies Improve Learning Guidance in XCS? Design ...CCIA'2008: Can Evolution Strategies Improve Learning Guidance in XCS? Design ...
CCIA'2008: Can Evolution Strategies Improve Learning Guidance in XCS? Design ...
 
Lecture4 - Machine Learning
Lecture4 - Machine LearningLecture4 - Machine Learning
Lecture4 - Machine Learning
 
Lecture8 - From CBR to IBk
Lecture8 - From CBR to IBkLecture8 - From CBR to IBk
Lecture8 - From CBR to IBk
 
Lecture11 - neural networks
Lecture11 - neural networksLecture11 - neural networks
Lecture11 - neural networks
 
Lecture9 - Bayesian-Decision-Theory
Lecture9 - Bayesian-Decision-TheoryLecture9 - Bayesian-Decision-Theory
Lecture9 - Bayesian-Decision-Theory
 
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...
 
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCSHIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
 
GECCO'2007: Modeling XCS in Class Imbalances: Population Size and Parameter S...
GECCO'2007: Modeling XCS in Class Imbalances: Population Size and Parameter S...GECCO'2007: Modeling XCS in Class Imbalances: Population Size and Parameter S...
GECCO'2007: Modeling XCS in Class Imbalances: Population Size and Parameter S...
 
HIS'2008: Genetic-based Synthetic Data Sets for the Analysis of Classifiers B...
HIS'2008: Genetic-based Synthetic Data Sets for the Analysis of Classifiers B...HIS'2008: Genetic-based Synthetic Data Sets for the Analysis of Classifiers B...
HIS'2008: Genetic-based Synthetic Data Sets for the Analysis of Classifiers B...
 
CCIA'2008: On the dimensions of data complexity through synthetic data sets
CCIA'2008: On the dimensions of data complexity through synthetic data setsCCIA'2008: On the dimensions of data complexity through synthetic data sets
CCIA'2008: On the dimensions of data complexity through synthetic data sets
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Multivariate analyses & decoding
Multivariate analyses & decodingMultivariate analyses & decoding
Multivariate analyses & decoding
 
NNFL - Guru Nanak Dev Engineering College
NNFL - Guru Nanak Dev Engineering CollegeNNFL - Guru Nanak Dev Engineering College
NNFL - Guru Nanak Dev Engineering College
 

Viewers also liked

Lecture1 AI1 Introduction to artificial intelligence
Lecture1 AI1 Introduction to artificial intelligenceLecture1 AI1 Introduction to artificial intelligence
Lecture1 AI1 Introduction to artificial intelligenceAlbert Orriols-Puig
 
HAIS09-BeyondHomemadeArtificialDatasets
HAIS09-BeyondHomemadeArtificialDatasetsHAIS09-BeyondHomemadeArtificialDatasets
HAIS09-BeyondHomemadeArtificialDatasetsAlbert Orriols-Puig
 
Lecture14 - Advanced topics in association rules
Lecture14 - Advanced topics in association rulesLecture14 - Advanced topics in association rules
Lecture14 - Advanced topics in association rulesAlbert Orriols-Puig
 
Basics of Clustering
Basics of ClusteringBasics of Clustering
Basics of ClusteringB. Nichols
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligenceu053675
 
Data Mining: clustering and analysis
Data Mining: clustering and analysisData Mining: clustering and analysis
Data Mining: clustering and analysisDataminingTools Inc
 

Viewers also liked (11)

Lecture1 AI1 Introduction to artificial intelligence
Lecture1 AI1 Introduction to artificial intelligenceLecture1 AI1 Introduction to artificial intelligence
Lecture1 AI1 Introduction to artificial intelligence
 
HAIS09-BeyondHomemadeArtificialDatasets
HAIS09-BeyondHomemadeArtificialDatasetsHAIS09-BeyondHomemadeArtificialDatasets
HAIS09-BeyondHomemadeArtificialDatasets
 
Lecture14 - Advanced topics in association rules
Lecture14 - Advanced topics in association rulesLecture14 - Advanced topics in association rules
Lecture14 - Advanced topics in association rules
 
Lecture12 - SVM
Lecture12 - SVMLecture12 - SVM
Lecture12 - SVM
 
Lecture10 - Naïve Bayes
Lecture10 - Naïve BayesLecture10 - Naïve Bayes
Lecture10 - Naïve Bayes
 
Lecture13 - Association Rules
Lecture13 - Association RulesLecture13 - Association Rules
Lecture13 - Association Rules
 
Lecture5 - C4.5
Lecture5 - C4.5Lecture5 - C4.5
Lecture5 - C4.5
 
Basics of Clustering
Basics of ClusteringBasics of Clustering
Basics of Clustering
 
Decision tree Using c4.5 Algorithm
Decision tree Using c4.5 AlgorithmDecision tree Using c4.5 Algorithm
Decision tree Using c4.5 Algorithm
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 
Data Mining: clustering and analysis
Data Mining: clustering and analysisData Mining: clustering and analysis
Data Mining: clustering and analysis
 

Similar to Lecture23

Evolving Rules to Solve Problems: The Learning Classifier Systems Way
Evolving Rules to Solve Problems: The Learning Classifier Systems WayEvolving Rules to Solve Problems: The Learning Classifier Systems Way
Evolving Rules to Solve Problems: The Learning Classifier Systems WayPier Luca Lanzi
 
Yann le cun
Yann le cunYann le cun
Yann le cunYandex
 
Certified Deep Learning Specialist (CDLS)
Certified Deep Learning Specialist (CDLS)Certified Deep Learning Specialist (CDLS)
Certified Deep Learning Specialist (CDLS)GICTTraining
 
CFAR-m Presentation English
CFAR-m Presentation EnglishCFAR-m Presentation English
CFAR-m Presentation Englishbusinessangeleu
 
Forever Young: A Tribute to the Grandmaster through a recount of Personal Jou...
Forever Young: A Tribute to the Grandmaster through a recount of Personal Jou...Forever Young: A Tribute to the Grandmaster through a recount of Personal Jou...
Forever Young: A Tribute to the Grandmaster through a recount of Personal Jou...Goergen Institute for Data Science
 
Introduction to soft computing
Introduction to soft computingIntroduction to soft computing
Introduction to soft computingAnkush Kumar
 
Intro to AI STRIPS Planning & Applications in Video-games Lecture1-Part2
Intro to AI STRIPS Planning & Applications in Video-games Lecture1-Part2Intro to AI STRIPS Planning & Applications in Video-games Lecture1-Part2
Intro to AI STRIPS Planning & Applications in Video-games Lecture1-Part2Stavros Vassos
 
Computation of Things - Justyna Zander, Pieter Mosterman - H+ Summit @ Harvard
Computation of Things - Justyna Zander, Pieter Mosterman - H+ Summit @ HarvardComputation of Things - Justyna Zander, Pieter Mosterman - H+ Summit @ Harvard
Computation of Things - Justyna Zander, Pieter Mosterman - H+ Summit @ HarvardHumanity Plus
 
Cognitive ability of human brain and soft computing techniques
Cognitive ability of human brain and soft computing techniquesCognitive ability of human brain and soft computing techniques
Cognitive ability of human brain and soft computing techniquesDr G R Sinha
 
Introduction to reinforcement learning
Introduction to reinforcement learningIntroduction to reinforcement learning
Introduction to reinforcement learningXander Steenbrugge
 
Human Level Artificial Intelligence
Human Level Artificial IntelligenceHuman Level Artificial Intelligence
Human Level Artificial IntelligenceRahul Chaurasia
 
Ai complete note
Ai complete noteAi complete note
Ai complete noteNajar Aryal
 
How machines can take decisions
How machines can take decisionsHow machines can take decisions
How machines can take decisionsDeepu S Nath
 
How machines can take decisions
How machines can take decisionsHow machines can take decisions
How machines can take decisionsDeepu S Nath
 
Baisc Deep Learning HandsOn
Baisc Deep Learning HandsOnBaisc Deep Learning HandsOn
Baisc Deep Learning HandsOnSean Yu
 
Raise of deep learning
Raise of deep learningRaise of deep learning
Raise of deep learningBill GU
 
Intelligent system by SHAHIN ELAHI BOX
Intelligent system by SHAHIN ELAHI BOXIntelligent system by SHAHIN ELAHI BOX
Intelligent system by SHAHIN ELAHI BOXShahin Alam
 

Similar to Lecture23 (20)

Evolving Rules to Solve Problems: The Learning Classifier Systems Way
Evolving Rules to Solve Problems: The Learning Classifier Systems WayEvolving Rules to Solve Problems: The Learning Classifier Systems Way
Evolving Rules to Solve Problems: The Learning Classifier Systems Way
 
Yann le cun
Yann le cunYann le cun
Yann le cun
 
Certified Deep Learning Specialist (CDLS)
Certified Deep Learning Specialist (CDLS)Certified Deep Learning Specialist (CDLS)
Certified Deep Learning Specialist (CDLS)
 
20181212 ibm aot
20181212 ibm aot20181212 ibm aot
20181212 ibm aot
 
CFAR-m Presentation English
CFAR-m Presentation EnglishCFAR-m Presentation English
CFAR-m Presentation English
 
Forever Young: A Tribute to the Grandmaster through a recount of Personal Jou...
Forever Young: A Tribute to the Grandmaster through a recount of Personal Jou...Forever Young: A Tribute to the Grandmaster through a recount of Personal Jou...
Forever Young: A Tribute to the Grandmaster through a recount of Personal Jou...
 
Introduction to soft computing
Introduction to soft computingIntroduction to soft computing
Introduction to soft computing
 
Intro to AI STRIPS Planning & Applications in Video-games Lecture1-Part2
Intro to AI STRIPS Planning & Applications in Video-games Lecture1-Part2Intro to AI STRIPS Planning & Applications in Video-games Lecture1-Part2
Intro to AI STRIPS Planning & Applications in Video-games Lecture1-Part2
 
Computation of Things
Computation of ThingsComputation of Things
Computation of Things
 
Computation of Things - Justyna Zander, Pieter Mosterman - H+ Summit @ Harvard
Computation of Things - Justyna Zander, Pieter Mosterman - H+ Summit @ HarvardComputation of Things - Justyna Zander, Pieter Mosterman - H+ Summit @ Harvard
Computation of Things - Justyna Zander, Pieter Mosterman - H+ Summit @ Harvard
 
Cognitive ability of human brain and soft computing techniques
Cognitive ability of human brain and soft computing techniquesCognitive ability of human brain and soft computing techniques
Cognitive ability of human brain and soft computing techniques
 
Introduction to reinforcement learning
Introduction to reinforcement learningIntroduction to reinforcement learning
Introduction to reinforcement learning
 
Human Level Artificial Intelligence
Human Level Artificial IntelligenceHuman Level Artificial Intelligence
Human Level Artificial Intelligence
 
Ai complete note
Ai complete noteAi complete note
Ai complete note
 
How machines can take decisions
How machines can take decisionsHow machines can take decisions
How machines can take decisions
 
How machines can take decisions
How machines can take decisionsHow machines can take decisions
How machines can take decisions
 
Chapter 0 syllabus 2019 20
Chapter 0  syllabus 2019 20Chapter 0  syllabus 2019 20
Chapter 0 syllabus 2019 20
 
Baisc Deep Learning HandsOn
Baisc Deep Learning HandsOnBaisc Deep Learning HandsOn
Baisc Deep Learning HandsOn
 
Raise of deep learning
Raise of deep learningRaise of deep learning
Raise of deep learning
 
Intelligent system by SHAHIN ELAHI BOX
Intelligent system by SHAHIN ELAHI BOXIntelligent system by SHAHIN ELAHI BOX
Intelligent system by SHAHIN ELAHI BOX
 

Recently uploaded

Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfUjwalaBharambe
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxUnboundStockton
 
Capitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitolTechU
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementmkooblal
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentInMediaRes1
 
Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...jaredbarbolino94
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxAvyJaneVismanos
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxDr.Ibrahim Hassaan
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
MICROBIOLOGY biochemical test detailed.pptx
MICROBIOLOGY biochemical test detailed.pptxMICROBIOLOGY biochemical test detailed.pptx
MICROBIOLOGY biochemical test detailed.pptxabhijeetpadhi001
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 

Recently uploaded (20)

Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docx
 
Capitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptx
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of management
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media Component
 
Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptx
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptx
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
MICROBIOLOGY biochemical test detailed.pptx
MICROBIOLOGY biochemical test detailed.pptxMICROBIOLOGY biochemical test detailed.pptx
MICROBIOLOGY biochemical test detailed.pptx
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 

Lecture23

  • 1. Introduction to Machine Learning Lecture 23 Learning Classifier Systems Albert Orriols i Puig http://www.albertorriols.net htt // lb t i l t aorriols@salle.url.edu Artificial Intelligence – Machine Learning g g Enginyeria i Arquitectura La Salle Universitat Ramon Llull
  • 2. Recap of Lectures 21-22 Value functions Vπ(s): Long-term reward estimation from s a e s following po cy π o state o o g policy Qπ(s,a): Long-term reward estimation from s a e s e ecu g ac o a o state executing action and then following policy π The long term reward is a recency weighted average of recency-weighted the received rewards …r … at rt+1 at+1 rt+2 at+2 rt+3 at+3 t st st+1 st+2 st+3 Slide 2 Artificial Intelligence Machine Learning
  • 3. Recap of Lectures 21-22 Q Q-learning g Slide 3 Artificial Intelligence Machine Learning
  • 4. Today’s Agenda The Origins of LCSs Michigan-style LCSs Pittsburg-style LCS Pitt b t l LCSs Michigan-style LCSs Slide 4 Artificial Intelligence Machine Learning
  • 5. Original Idea of LCS Holland’s envision: Cognitive Systems g y Create true artificial intelligence itself True intelligence requires adaptive behavior in the face of changing circumstances (Holland & Reitman, 1978) Holland s Holland’s vision going back to late 50s and early 60s of roving bands of computer programs. Holland’s notion of genetic search as program searching (1962) The free generation procedure. . . Requires the generators (and combinations of generators) to “shift” and “connect” at random in the shift connect computer…two or more generators occupying adjacent modules (“in contact”) may become connected. Such connected sets of generators are to shift as a unit. t t hift it From stimulus-response t internal states and modifiable d t t F ti l to i t l tt d difi bl detectors and effectors Slide 5 Artificial Intelligence Machine Learning
  • 6. First LCS Implementation CS-1 (Holland & Reitman, 1978) Post-production system General memory containing classifiers Process: Code the situation and find in memory the actions that are appropriate to both CS-1 goal and situation Store in memory the consequences of these actions (learning) Generate new good productions (classifiers) t endure. (l ifi ) to d Population of classifiers Current system knowledge Performance component Short term behavior of the system Rule discovery component Get new promising rules Slide 6 Artificial Intelligence Machine Learning
  • 7. Meanwhile, in Pitts University Smith’s interpretation of Holland’s GA envision p Smith’s notion of learning as adaptive search (1980, 1983) LS-1: “Learns a set of heuristics, represented as production LS 1 “L t fh i ti td d ti system programs, to govern the application of a set of operators in performing a particular task” Great success! LS-1 took Waterman’s poker player to the cleaners (not bluffing) Slide 7 Artificial Intelligence Machine Learning
  • 8. Two Models And here, two ways started: Michigan vs Pitts LCSs , y g Pittsburgh-style LCSs Michigan-style LCSs Straight GA Cognitive system Individual = set of rules Individual = rule Solution: best individual Solution: all the population Usually offline systems U ll ffli Apportionment of credit Reinforcement learning We focus on Michigan-style LCS Slide 8 Artificial Intelligence Machine Learning
  • 9. Michigan-style LCSs General schema Environment Sensorial Action Reward state Online rule evaluator: • XCS: Q-Learning (Sutton & Barto, 1998) Classifier 1 Learning Any Representation: y p Uses Widrow-Hoff delta rule Classifier 2 Classifier production rules, genetic programs, System Classifier n perceptrons, SVMs Rule evolution: Genetic Typically, a GA (Holland, 75; Algorithm Goldberg, 89) applied on the population. Slide 9 Artificial Intelligence Machine Learning
  • 10. Knowledge Representation The knowledge representation consists of g p Population of classifiers Usually independent of each other Each classifier has Condition C diti part C t Action part A Prediction P di ti part P t Interpreted as: If condition C is satisfied and action A is executed, then P is executed expected to be true Solution for a new problem Get the classifiers that match the sensorial state Decide which action should be used among the actions of the selected classifiers Slide 10 Artificial Intelligence Machine Learning
  • 11. Condition Structures Condition structure depends on the types of attributes p yp Binary Ternary encoding {0, 1 #} {0 1, If v1 is ‘0’ and v2 is ‘1’ and v3 is ‘#’ … and vn in ‘0’ then actioni Continuous Interval-based encoding If v1 in [l1,u1] and v2 in [l2,u2] … and vn in [ln,un] then actioni u u u Hyperellipsoids Slide 11 Artificial Intelligence Machine Learning
  • 12. Condition Structures Condition structure depends on the types of attributes p yp Many other representations Partial matching (Booker 1985) (Booker, Default hierarchies (Holland et al., 1986) Fuzzy conditions (Bonarini 2000; Valenzuela Rendón 1991; Casillas et (Bonarini, Valenzuela-Rendón, al., 2008, Orriols et al., 2009) Neural-network-based encodings (Bull & O’Hara, 2002) GP tree encodings with S-expressions (Lanzi, 1999) Slide 12 Artificial Intelligence Machine Learning
  • 13. Prediction Prediction can be: Scalar number Line Polynomial Neural network … We ill W will consider the initial idea: prediction is a scalar number id th i iti l id di ti i l b Slide 13 Artificial Intelligence Machine Learning
  • 14. Learning Interaction in XCS ENVIRONMENT Match Set [M] Problem instance 1C A PεF num as ts exp Selected 3C A PεF num as ts exp action 5C A PεF num as ts exp Population [P] 6C A PεF num as ts exp Match set REWARD … generation 1C A PεF num as ts exp 2C A PεF num as ts exp Prediction 3C A PεF num as ts exp Array 4C A PεF num as ts exp 5C A PεF num as ts exp 6C A PεF num as ts exp Selected action … Action Set [A] [] Classifier 1C A PεF num as ts exp Parameters Deletion Selection, reproduction, 3C A PεF num as ts exp Update and mutation 5C A PεF num as ts exp (Widrow-Hoff rule) 6C A PεF num as ts exp … Delayed reward [A-1] Genetic Algorithm Competition Fitness Sharing Action Set [A]-1 in the niche 1C A PεF num as ts e p u exp 3C A PεF num as ts exp 5C A PεF num as ts exp 6C A PεF num as ts exp … Slide 14 Artificial Intelligence Machine Learning
  • 15. Estimate Classifier Prediction Three key p y parameters Prediction: What I will get if I select the action Error: Error on that prediction Does it sound familiar? Q-learning! Fitness: How good is my classifier g y These parameters are estimated on-line Slide 15 Artificial Intelligence Machine Learning
  • 16. Evolutionary Search GA applied time to time to [A] pp [] Select two parents Cross th C them Mutate them Introduce the two new offspring into the population If the population is full t e popu at o s u remove poo classifiers e o e poor c ass e s Slide 16 Artificial Intelligence Machine Learning
  • 17. LCS Learning Pressures Parameter updates identifies most accurate classifiers Different pressures caused by the GA: S t pressure t toward generality d lit Set Fitness pressure toward highly fit classifiers Mutation pressure pressuring toward diversification Subsumption pressure toward the deletion of accurate, over-specialized classifiers Slide 17 Artificial Intelligence Machine Learning
  • 18. Next Class Applications of LCS A li ti f Slide 18 Artificial Intelligence Machine Learning
  • 19. Introduction to Machine Learning Lecture 23 Learning Classifier Systems Albert Orriols i Puig http://www.albertorriols.net htt // lb t i l t aorriols@salle.url.edu Artificial Intelligence – Machine Learning g g Enginyeria i Arquitectura La Salle Universitat Ramon Llull