SlideShare a Scribd company logo
Artificial Data Sets based on
Knowledge Generators: Analysis of
         g                   y
  Learning Algorithms Efficiency

                       Joaquin Rios-Boutin
                               Rios Boutin
                        Albert Orriols-Puig
                    Josep Maria Garrell Guiu
                    Josep-Maria Garrell-Guiu

             Grup de Recerca en Sistemes Intel·ligents
      Enginyeria i Arquitectura La Salle Universitat Ramon Llull
                                     Salle,
               {jrios, aorriols, josepmg}@salle.url.edu
Motivation

       What is the Holy Grail of Machine Learning?
       – Find the right Learning Algorithm to every Problem
       – Real Problems are black boxes
          • We don’t know which knowledge is contained

                                                                            DI
          • We can’t answer:
              – When to stop training?
              – How much efficient is the learning process?



       – Artificial Problems:
                                                                            DI
                                                                     K
          • Knowledge-driven
          • Property-driven
                                                                         Complex.Met.

                                Enginyeria i Arquitectura la Salle                 Slide 2
GRSI
Framework

       Machine Learning as a Communication System



                       Communication Chanel
                                                                 Learning
       Environment.
                                                                Algorithm.
                                                                Al   ith
                               Data Set
        Knowledge
                                                                 Learned
       to be learned
                                                                Knowledge




                           Enginyeria i Arquitectura la Salle                Slide 3
GRSI
Outline



        1. Algorithm Evaluation Methodology Definition
        1 Al    ih E l      i   M hdl       D fi i i

        2.
        2 Methodology Implementation

        3. Experiment Description

        4. Results and Analysis

        5. Conclusions and Further Work




                            Enginyeria i Arquitectura la Salle   Slide 4
GRSI
1 Algorithm Evaluation Process
           g

                 Process Execution and Control DB

                                            Problem
             Sampling   Sampling                                                                                       Algorithm
                                            Data Set
              Method      Size                                                                                        Parameters




                                                                                                                                              Accuracy
                  DI                                                                                                    Learning
               Generation                                                                                               Algorithm   1.2
                                                                                                                                                            DS1kMulplx6m1


                                                                                                                                     1


                                                                                                                                    0.8


                                                                                                                                    0.6


                                                                                                                                    0.4


                                                                                                                                    0.2


                                                                                                                                     0
                                                                                                                                          0   2000   4000      6000    8000   10000




                  Knowledge
                  Comparison                                    1.2
                                                                                        DS1kMulplx6m1


                                                                 1




                                                                                                                   Optimal
                                                                                                                    p
                                                                0.8


                                                                0.6




                                                                                                                  Population
                                                                0.4


                                                                0.2


                                                                 0
                                                                      0   2000   4000      6000    8000   10000




                                   Enginyeria i Arquitectura la Salle                                                                                        Slide 5
GRSI
1 Algorithm Evaluation Process Dimensions




              100000
               10000
     mpling
          g




                1000
     Size




                 100
   Sam
     S




                                                                            Apn
                                                                            A
                  10                                                          Alg.P
                   1                                                  AP1     aram
                       SRS    SIS           RRS                 RIS             .
                        Sampling Methods

                             To each Problem
                                Enginyeria i Arquitectura la Salle                    Slide 6
GRSI
Outline



        1. Algorithm Evaluation Methodology Definition
        1 Al    ih E l      i   M hdl       D fi i i

        2.
        2 Methodology Implementation

        3. Experiment Description

        4. Results and Analysis

        5. Conclusions and Further Work




                            Enginyeria i Arquitectura la Salle   Slide 7
GRSI
2 Knowledge Representation
                g    p

                              Condition                                 Class/Action


                  a11   a12              a1j                        a1m    C1     Rule1



                  ai1   ai2              aij                        aim    Ci
        Rule
        Set


                  an1   an2              anj                        anm    Cn



               aij={0,1, #} CiєN
                   {0,1,

                                   Enginyeria i Arquitectura la Salle                     Slide 8
GRSI
2 Sampling Methods
            pg

       SRS Sequential Rule Selection                                   SIS Sequential Instance Selection
                                                                                                     Sequential #
                                                                                                     substitution
                                                                                               1st
                                                          2nd
                      Random # substitution
                2nd


         1st




       RRS Random Rule Selection                                        RIS Random Instance Selection
                      Random # substitution                                                            Sequential #
                2nd
                                                                                                 1st   substitution
                                                               2nd
         1st




                                              Enginyeria i Arquitectura la Salle                               Slide 9
GRSI
2 Problems to learn and Learning Algorithm

               Mux6 Mux11                                                  Parity5
               0   0   #   #   #       0   0                               0    0   0   0   0   0
               0   0   #   #   #       1   1                               0    0   0   0   1   1
               0   1   #   #   0       #   0                               0    0   0   1   0   1
               0   1   #   #   1       #   1                               0    0   0   1   1   0


               1   1   0   #   #       #   0                               1    1   1   1   0   0


                                               XCS
               1   1   1   #   #       #   1                               1    1   1   1   1   1




                                                                           Parity5-3
           Position5 Position11
           0       0   0   0       0       0                               0    0   0   0   0   #   #   #   0
           0       0   0   0       1       1                               0    0   0   0   1   #   #   #   1
           0       0   0   1       #       2                               0    0   0   1   0   #   #   #   1
           0       0   1   #       #       3                               0    0   0   1   1   #   #   #   0


           1       #   #   #       #       5                               1    1   1   1   0   #   #   #   0
                                                                           1    1   1   1   1   #   #   #   1


                                               Enginyeria i Arquitectura la Salle                               Slide 10
GRSI
2 Problem Properties
                    p


        Optimal Rule Sets
         – Complete
         – Non overlapped
         – Irreducible
        Why?
         – Simple structure of knowledge complexity
         –V
          Very k
               known artificial problems
                       tifi i l    bl




                                Enginyeria i Arquitectura la Salle   Slide 11
GRSI
Outline



        1. Algorithm Evaluation Methodology Definition
        1 Al    ih E l      i   M hdl       D fi i i

        2.
        2 Methodology Implementation

        3. Experiment Description

        4. Results and Analysis

        5. Conclusions and Further Work




                            Enginyeria i Arquitectura la Salle   Slide 12
GRSI
3 Sampling and Learning Iteration
            pg               g


         {
         {Sampling Iteration} Problem {Training Iteration}
              pg               }          {      g       }
             Sampling Sampling               Algorithm
                                 Data Set
              Method    Size                Parameters




                                                                                                                               Accuracy
                 DI                                                                                      Learning
              Genaration                                                                                 Algorithm   1.2
                                                                                                                                             DS1kMulplx6m1


                                                                                                                      1


                                                                                                                     0.8


                                                                                                                     0.6


                                                                                                                     0.4


                                                                                                                     0.2


                                                                                                                      0
                                                                                                                           0   2000   4000      6000    8000   10000




               Knowledge
               Comparison                 1.2
                                                                  DS1kMulplx6m1


                                           1




                                                                                              Optimal
                                          0.8




                                                                                            Population
                                                                                            P    l ti
                                          0.6


                                          0.4


                                          0.2


                                           0
                                                0   2000   4000      6000    8000   10000




                           Enginyeria i Arquitectura la Salle                                                                           Slide 13
GRSI
3 Output Results and Iteration Reduction
            p


        Output Results
         – 2 Plots to every Problem Sampling Method Sampling Size and
                            Problem,         Method,
           Algorithm Parameters.                 1.2
                                                                        DS1kMulplx6m1

             • Optimal Population                1


             • Accuracy                         0.8

        Iteration R d ti
        It ti Reduction                         0.6

         – SIS Pure sequential
                                                0.4

             • No Sampling Iteration Needed
                                                0.2
         – Problems without “don’t care”
                                                 0
             • SRS=SIS and RRS=RIS                    0   2000   4000      6000    8000   10000




                                                                                                  Slide 14
GRSI
3 Experimental Parameters
           p


        Number of Problems = 6
        Number f Sampling M th d = 4
        N b of S     li Methods
        Number of different Sampling Sizes = 4
        Number of different Algorithms Parameters Sets = 2
        Number f Sampling It ti
        N b of S     li Iterations = 10
        Number of Training Iterations = 10
        Number of Data Sets Generated = 744
        Number of Training Process = 14880



                                                             Slide 15
GRSI
Outline



        1. Algorithm Evaluation Methodology Definition
        1 Al    ih E l      i   M hdl       D fi i i

        2.
        2 Methodology Implementation

        3. Experiment Description

        4. Results and Analysis

        5. Conclusions and Further Work




                            Enginyeria i Arquitectura la Salle   Slide 16
GRSI
Problem Dimension

         Sampling M. = RIS Sampling Size = 1000 Learning Alg. Param. = pDNC 0.2
                  M                                      Alg Param          02
                    1.2
       Mux6                                                            1.1
                                            DS1kMulplx6m4
                                                                                               DS1kParity5m4
                                                                                                                           Parity5
                                                                        1
                     1

                                                                       0.9
                    0.8
                                                                       0.8
                    0.6
                                                                       0.7

                                                                       0.6
                    0.4

                                                                       0.5
                    0.2
                                                                       0.4
                     0
                                                                       0.3

                   -0.2                                                0.2
                          0   2000   4000      6000    8000   10000          0   2000   4000      6000    8000    10000
                   1.05
                                        DS1kMulplx6m4                 1.05
                                                                                               DS1kParity5m4
                                                                        1
                     1
                                                                      0.95

                                                                       0.9
                   0.95

                                                                      0.85
                    0.9
                                                                       0.8

                                                                      0.75
                   0.85
                                                                       0.7

                                                                      0.65
                    0.8
                                                                       0.6

                   0.75                                               0.55
                          0   2000   4000      6000    8000   10000          0   2000   4000      6000     8000    10000




                                                                                                                                     Slide 17
GRSI
Sampling Method Dimension
           pg

          Problem = Position5 Sampling Size = 1000 Learning Alg. Param. = pDNC 0.2
                                                            Alg Param          02
SRS Sequential Rule Selection                                                      RIS Random Instance Selection
             1.2                                              0.9
                                DS1kPosition5m1                                   DS1kPosition5m4
                                                              0.8
              1
                                                              0.7

                                                              0.6
             0.8

                                                              0.5
             0.6
                                                              0.4

                                                              0.3
             0.4
                                                              0.2

                                                              0.1
             0.2

                                                               0
              0
                                                             -0.1
                   0   2000   4000   6000     8000   10000
                                                                    0   2000   4000    6000    8000    10000

             1.1                                             1.1
                                 DS1kPosition5m1                                 DS1kPosition5m4

              1                                               1

             0.9                                             0.9

             0.8                                             0.8

             0.7                                             0.7

             0.6
                                                             0.6

             0.5
                                                             0.5

             0.4
                                                             0.4

             0.3
                                                             0.3
                   0   2000   4000    6000    8000   10000
                                                                   0    2000   4000    6000     8000   10000
                                                                                                               Slide 18
 GRSI
Sampling Size Dimension
          pg

        Problem = Parity5 Sampling M.= RIS Learning Alg. Param. = pDNC 0.2
                                   M=               Alg Param          02
               1.1
                                                                  1.1
                                       DS100Parity5m4
        100                                                                               DS10kParity5m4
                                                                                                                     10000
                1
                                                                   1
               0.9
                                                                  0.9
               0.8
                                                                  0.8
               0.7
                                                                  0.7
               0.6
                                                                  0.6
               0.5
                                                                  0.5
               0.4

                                                                  0.4
               0.3

                                                                  0.3
               0.2

               0.1                                                0.2
                     0       2000   4000   6000    8000   10000         0       2000   4000   6000    8000   10000

                                                                  1.05
               1.05
                                                                                          DS10kParity5m4
                                       DS100Parity5m4
                                                                        1
                     1
                                                                  0.95
               0.95
                                                                   0.9
                0.9
                                                                  0.85
               0.85
                                                                   0.8

                0.8                                               0.75

                                                                   0.7
               0.75
                                                                  0.65
                0.7
                                                                   0.6
               0.65
                                                                  0.55
                                                                            0   2000   4000   6000    8000   10000
                0.6
                         0   2000   4000   6000    8000   10000

                                                                                                                             Slide 19
GRSI
Parameter Algorithm Dimension
                   g

        Problem = Mux6 Sampling M. = RIS Sampling Size = 1000
                                M
                1                                                 1.2
                                       DS1kMulplx6m4                                            DS1kMulplx6m4

  pDNC 0.8    0.9
                                                                                                                             pDNC 0.2
                                                                    1
              0.8

              0.7                                                 0.8
              0.6
                                                                  0.6
              0.5

              0.4
                                                                  0.4
              0.3

                                                                  0.2
              0.2

              0.1
                                                                    0
                0

              -0.1                                                -0.2
                     0   2000   4000      6000    8000    10000          0       2000    4000      6000     8000    10000


             1.05
                                   DS1kMulplx6m4                   1.05
                                                                                                DS1kMulplx6m4
                1

             0.95                                                        1
              0.9

             0.85                                                  0.95

              0.8
                                                                    0.9
             0.75

              0.7
                                                                   0.85
             0.65

              0.6
                                                                    0.8
             0.55

              0.5
                                                                   0.75
                     0   2000   4000     6000    8000    10000
                                                                             0    2000    4000       6000    8000    10000




                                                                                                                                        Slide 20
GRSI
Outline



        1. Algorithm Evaluation Methodology Definition
        1 Al    ih E l      i   M hdl       D fi i i

        2.
        2 Methodology Implementation

        3. Experiment Description

        4. Results and Analysis

        5. Conclusions and Further Work




                            Enginyeria i Arquitectura la Salle   Slide 21
GRSI
Conclusions and Further Work

        Conclusions
         – Automatic Learning Algorithm Analyzer based on Artificial Data Sets
         – Four dimensions comparisons
         – Methodology Implementation, Experiment and Results Analysis

        Further Work
         – Non ORS Problems
         – R l Att ib t
           Real Attributes
         – Sampling Methods based on distance or transition matrix
         – Multi Step Problems
                    p
         – Different Learning Algorithms
         – Different Knowledge representations
         – Knowledge Covering Metrics
         – Applying Data Set Complexity Metrics Suite



                                                                                 Slide 22
GRSI
GRSI


       Artificial Data Sets based on Knowledge Generators:
       Analysis of Learning Algorithms Efficiency
            y              gg                   y
         Joaquin Rios Boutin, Albert Orriols-Puig, Josep-Maria Garrell-Guiu
                         {j
                         {jrios, aorriols, josepmg}@salle.url.edu
                                           j   p g}@

       GRSI (Grup de Recerca en Sistemes Intel·ligents)
             http://www.salle.url.edu/GRSI
           • http://www salle url edu/GRSI


       – Oriented to:
           • CBR (Computer Based Reasoning) Algorithms
           • Evolutive Computation Algorithms
           • Data Mining Technology Transfer




                                 Enginyeria i Arquitectura la Salle           Slide 23
GRSI

More Related Content

Viewers also liked

ESTYLF'2008: Modelado Causal en Marketing mediante Aprendizaje no Supervisado...
ESTYLF'2008: Modelado Causal en Marketing mediante Aprendizaje no Supervisado...ESTYLF'2008: Modelado Causal en Marketing mediante Aprendizaje no Supervisado...
ESTYLF'2008: Modelado Causal en Marketing mediante Aprendizaje no Supervisado...Albert Orriols-Puig
 
CCIA'2008: On the dimensions of data complexity through synthetic data sets
CCIA'2008: On the dimensions of data complexity through synthetic data setsCCIA'2008: On the dimensions of data complexity through synthetic data sets
CCIA'2008: On the dimensions of data complexity through synthetic data setsAlbert Orriols-Puig
 
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCSHIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCSAlbert Orriols-Puig
 
GECCO'2007: Modeling Selection Pressure in XCS for Proportionate and Tourname...
GECCO'2007: Modeling Selection Pressure in XCS for Proportionate and Tourname...GECCO'2007: Modeling Selection Pressure in XCS for Proportionate and Tourname...
GECCO'2007: Modeling Selection Pressure in XCS for Proportionate and Tourname...Albert Orriols-Puig
 
HAIS09-BeyondHomemadeArtificialDatasets
HAIS09-BeyondHomemadeArtificialDatasetsHAIS09-BeyondHomemadeArtificialDatasets
HAIS09-BeyondHomemadeArtificialDatasetsAlbert Orriols-Puig
 
Lecture1 AI1 Introduction to artificial intelligence
Lecture1 AI1 Introduction to artificial intelligenceLecture1 AI1 Introduction to artificial intelligence
Lecture1 AI1 Introduction to artificial intelligenceAlbert Orriols-Puig
 
Artificial Intelligence Presentation
Artificial Intelligence PresentationArtificial Intelligence Presentation
Artificial Intelligence Presentationlpaviglianiti
 
Artificial inteligence
Artificial inteligenceArtificial inteligence
Artificial inteligence
Intekhab Alam Khan
 

Viewers also liked (11)

ESTYLF'2008: Modelado Causal en Marketing mediante Aprendizaje no Supervisado...
ESTYLF'2008: Modelado Causal en Marketing mediante Aprendizaje no Supervisado...ESTYLF'2008: Modelado Causal en Marketing mediante Aprendizaje no Supervisado...
ESTYLF'2008: Modelado Causal en Marketing mediante Aprendizaje no Supervisado...
 
CCIA'2008: On the dimensions of data complexity through synthetic data sets
CCIA'2008: On the dimensions of data complexity through synthetic data setsCCIA'2008: On the dimensions of data complexity through synthetic data sets
CCIA'2008: On the dimensions of data complexity through synthetic data sets
 
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCSHIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
 
GECCO'2007: Modeling Selection Pressure in XCS for Proportionate and Tourname...
GECCO'2007: Modeling Selection Pressure in XCS for Proportionate and Tourname...GECCO'2007: Modeling Selection Pressure in XCS for Proportionate and Tourname...
GECCO'2007: Modeling Selection Pressure in XCS for Proportionate and Tourname...
 
Lecture7 - IBk
Lecture7 - IBkLecture7 - IBk
Lecture7 - IBk
 
Lecture24
Lecture24Lecture24
Lecture24
 
Lecture23
Lecture23Lecture23
Lecture23
 
HAIS09-BeyondHomemadeArtificialDatasets
HAIS09-BeyondHomemadeArtificialDatasetsHAIS09-BeyondHomemadeArtificialDatasets
HAIS09-BeyondHomemadeArtificialDatasets
 
Lecture1 AI1 Introduction to artificial intelligence
Lecture1 AI1 Introduction to artificial intelligenceLecture1 AI1 Introduction to artificial intelligence
Lecture1 AI1 Introduction to artificial intelligence
 
Artificial Intelligence Presentation
Artificial Intelligence PresentationArtificial Intelligence Presentation
Artificial Intelligence Presentation
 
Artificial inteligence
Artificial inteligenceArtificial inteligence
Artificial inteligence
 

Similar to HIS'2008: Artificial Data Sets based on Knowledge Generators: Analysis of Learning Algorithms Efficiency

The Influence of Extensible Algorithms on Operating Systems
The Influence of Extensible Algorithms on Operating SystemsThe Influence of Extensible Algorithms on Operating Systems
The Influence of Extensible Algorithms on Operating Systemsricky_pi_tercios
 
Meetup Python Madrid 2018: ¿Segmentación semántica? ¿Pero de qué me estás hab...
Meetup Python Madrid 2018: ¿Segmentación semántica? ¿Pero de qué me estás hab...Meetup Python Madrid 2018: ¿Segmentación semántica? ¿Pero de qué me estás hab...
Meetup Python Madrid 2018: ¿Segmentación semántica? ¿Pero de qué me estás hab...
Ricardo Guerrero Gómez-Olmedo
 
Artificial Intelligence IA at the service of Laboratories
Artificial Intelligence IA at the service of LaboratoriesArtificial Intelligence IA at the service of Laboratories
Artificial Intelligence IA at the service of Laboratories
Yvon Gervaise
 
Conférence Y. GervaiseEN1st Green Analytical Y. Gervaise.pdf
Conférence Y. GervaiseEN1st Green Analytical Y. Gervaise.pdfConférence Y. GervaiseEN1st Green Analytical Y. Gervaise.pdf
Conférence Y. GervaiseEN1st Green Analytical Y. Gervaise.pdf
YvonGervaise
 
Data Mining
Data MiningData Mining
Data Miningswami920
 
Smart Data Webinar: Machine Learning Update
Smart Data Webinar: Machine Learning UpdateSmart Data Webinar: Machine Learning Update
Smart Data Webinar: Machine Learning Update
DATAVERSITY
 
IBM Meetup on November 1, 2018: Machine Learning made easy with Watson Studio
IBM Meetup on November 1, 2018: Machine Learning made easy with Watson StudioIBM Meetup on November 1, 2018: Machine Learning made easy with Watson Studio
IBM Meetup on November 1, 2018: Machine Learning made easy with Watson Studio
Svetlana Levitan, PhD
 
Four Problems You Run into When DIY-ing a “Big Data” Analytics System
Four Problems You Run into When DIY-ing a “Big Data” Analytics SystemFour Problems You Run into When DIY-ing a “Big Data” Analytics System
Four Problems You Run into When DIY-ing a “Big Data” Analytics System
Treasure Data, Inc.
 
Lecture-1-2-+(1).pdf
Lecture-1-2-+(1).pdfLecture-1-2-+(1).pdf
Lecture-1-2-+(1).pdf
samaghorab
 
Lecture-1-2-+(1).pdf
Lecture-1-2-+(1).pdfLecture-1-2-+(1).pdf
Lecture-1-2-+(1).pdf
samaghorab
 
IRJET- Object Detection in an Image using Convolutional Neural Network
IRJET- Object Detection in an Image using Convolutional Neural NetworkIRJET- Object Detection in an Image using Convolutional Neural Network
IRJET- Object Detection in an Image using Convolutional Neural Network
IRJET Journal
 
MachinaFiesta: A Vision into Machine Learning 🚀
MachinaFiesta: A Vision into Machine Learning 🚀MachinaFiesta: A Vision into Machine Learning 🚀
MachinaFiesta: A Vision into Machine Learning 🚀
GDSCNiT
 
An introduction to Machine Learning with scikit-learn (October 2018)
An introduction to Machine Learning with scikit-learn (October 2018)An introduction to Machine Learning with scikit-learn (October 2018)
An introduction to Machine Learning with scikit-learn (October 2018)
Julien SIMON
 
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
vinayiqbusiness
 
Machine Learning for Speech
Machine Learning for Speech Machine Learning for Speech
Machine Learning for Speech butest
 
How to test an AI application
How to test an AI applicationHow to test an AI application
How to test an AI application
Kari Kakkonen
 
Integrating R with the CDK: Enhanced Chemical Data Mining
Integrating R with the CDK: Enhanced Chemical Data MiningIntegrating R with the CDK: Enhanced Chemical Data Mining
Integrating R with the CDK: Enhanced Chemical Data MiningRajarshi Guha
 
How to implement artificial intelligence solutions
How to implement artificial intelligence solutionsHow to implement artificial intelligence solutions
How to implement artificial intelligence solutions
Carlos Toxtli
 
Apple Machine Learning
Apple Machine LearningApple Machine Learning
Apple Machine Learning
Denise Nepraunig
 
Final Year Project Guidance
Final Year Project GuidanceFinal Year Project Guidance
Final Year Project Guidance
Varad Meru
 

Similar to HIS'2008: Artificial Data Sets based on Knowledge Generators: Analysis of Learning Algorithms Efficiency (20)

The Influence of Extensible Algorithms on Operating Systems
The Influence of Extensible Algorithms on Operating SystemsThe Influence of Extensible Algorithms on Operating Systems
The Influence of Extensible Algorithms on Operating Systems
 
Meetup Python Madrid 2018: ¿Segmentación semántica? ¿Pero de qué me estás hab...
Meetup Python Madrid 2018: ¿Segmentación semántica? ¿Pero de qué me estás hab...Meetup Python Madrid 2018: ¿Segmentación semántica? ¿Pero de qué me estás hab...
Meetup Python Madrid 2018: ¿Segmentación semántica? ¿Pero de qué me estás hab...
 
Artificial Intelligence IA at the service of Laboratories
Artificial Intelligence IA at the service of LaboratoriesArtificial Intelligence IA at the service of Laboratories
Artificial Intelligence IA at the service of Laboratories
 
Conférence Y. GervaiseEN1st Green Analytical Y. Gervaise.pdf
Conférence Y. GervaiseEN1st Green Analytical Y. Gervaise.pdfConférence Y. GervaiseEN1st Green Analytical Y. Gervaise.pdf
Conférence Y. GervaiseEN1st Green Analytical Y. Gervaise.pdf
 
Data Mining
Data MiningData Mining
Data Mining
 
Smart Data Webinar: Machine Learning Update
Smart Data Webinar: Machine Learning UpdateSmart Data Webinar: Machine Learning Update
Smart Data Webinar: Machine Learning Update
 
IBM Meetup on November 1, 2018: Machine Learning made easy with Watson Studio
IBM Meetup on November 1, 2018: Machine Learning made easy with Watson StudioIBM Meetup on November 1, 2018: Machine Learning made easy with Watson Studio
IBM Meetup on November 1, 2018: Machine Learning made easy with Watson Studio
 
Four Problems You Run into When DIY-ing a “Big Data” Analytics System
Four Problems You Run into When DIY-ing a “Big Data” Analytics SystemFour Problems You Run into When DIY-ing a “Big Data” Analytics System
Four Problems You Run into When DIY-ing a “Big Data” Analytics System
 
Lecture-1-2-+(1).pdf
Lecture-1-2-+(1).pdfLecture-1-2-+(1).pdf
Lecture-1-2-+(1).pdf
 
Lecture-1-2-+(1).pdf
Lecture-1-2-+(1).pdfLecture-1-2-+(1).pdf
Lecture-1-2-+(1).pdf
 
IRJET- Object Detection in an Image using Convolutional Neural Network
IRJET- Object Detection in an Image using Convolutional Neural NetworkIRJET- Object Detection in an Image using Convolutional Neural Network
IRJET- Object Detection in an Image using Convolutional Neural Network
 
MachinaFiesta: A Vision into Machine Learning 🚀
MachinaFiesta: A Vision into Machine Learning 🚀MachinaFiesta: A Vision into Machine Learning 🚀
MachinaFiesta: A Vision into Machine Learning 🚀
 
An introduction to Machine Learning with scikit-learn (October 2018)
An introduction to Machine Learning with scikit-learn (October 2018)An introduction to Machine Learning with scikit-learn (October 2018)
An introduction to Machine Learning with scikit-learn (October 2018)
 
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
 
Machine Learning for Speech
Machine Learning for Speech Machine Learning for Speech
Machine Learning for Speech
 
How to test an AI application
How to test an AI applicationHow to test an AI application
How to test an AI application
 
Integrating R with the CDK: Enhanced Chemical Data Mining
Integrating R with the CDK: Enhanced Chemical Data MiningIntegrating R with the CDK: Enhanced Chemical Data Mining
Integrating R with the CDK: Enhanced Chemical Data Mining
 
How to implement artificial intelligence solutions
How to implement artificial intelligence solutionsHow to implement artificial intelligence solutions
How to implement artificial intelligence solutions
 
Apple Machine Learning
Apple Machine LearningApple Machine Learning
Apple Machine Learning
 
Final Year Project Guidance
Final Year Project GuidanceFinal Year Project Guidance
Final Year Project Guidance
 

More from Albert Orriols-Puig (20)

Lecture22
Lecture22Lecture22
Lecture22
 
Lecture21
Lecture21Lecture21
Lecture21
 
Lecture20
Lecture20Lecture20
Lecture20
 
Lecture19
Lecture19Lecture19
Lecture19
 
Lecture18
Lecture18Lecture18
Lecture18
 
Lecture17
Lecture17Lecture17
Lecture17
 
Lecture16 - Advances topics on association rules PART III
Lecture16 - Advances topics on association rules PART IIILecture16 - Advances topics on association rules PART III
Lecture16 - Advances topics on association rules PART III
 
Lecture15 - Advances topics on association rules PART II
Lecture15 - Advances topics on association rules PART IILecture15 - Advances topics on association rules PART II
Lecture15 - Advances topics on association rules PART II
 
Lecture14 - Advanced topics in association rules
Lecture14 - Advanced topics in association rulesLecture14 - Advanced topics in association rules
Lecture14 - Advanced topics in association rules
 
Lecture13 - Association Rules
Lecture13 - Association RulesLecture13 - Association Rules
Lecture13 - Association Rules
 
Lecture12 - SVM
Lecture12 - SVMLecture12 - SVM
Lecture12 - SVM
 
Lecture11 - neural networks
Lecture11 - neural networksLecture11 - neural networks
Lecture11 - neural networks
 
Lecture10 - Naïve Bayes
Lecture10 - Naïve BayesLecture10 - Naïve Bayes
Lecture10 - Naïve Bayes
 
Lecture9 - Bayesian-Decision-Theory
Lecture9 - Bayesian-Decision-TheoryLecture9 - Bayesian-Decision-Theory
Lecture9 - Bayesian-Decision-Theory
 
Lecture8 - From CBR to IBk
Lecture8 - From CBR to IBkLecture8 - From CBR to IBk
Lecture8 - From CBR to IBk
 
Lecture6 - C4.5
Lecture6 - C4.5Lecture6 - C4.5
Lecture6 - C4.5
 
Lecture5 - C4.5
Lecture5 - C4.5Lecture5 - C4.5
Lecture5 - C4.5
 
Lecture4 - Machine Learning
Lecture4 - Machine LearningLecture4 - Machine Learning
Lecture4 - Machine Learning
 
Lecture3 - Machine Learning
Lecture3 - Machine LearningLecture3 - Machine Learning
Lecture3 - Machine Learning
 
Lecture2 - Machine Learning
Lecture2 - Machine LearningLecture2 - Machine Learning
Lecture2 - Machine Learning
 

Recently uploaded

Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
Thiyagu K
 
Fish and Chips - have they had their chips
Fish and Chips - have they had their chipsFish and Chips - have they had their chips
Fish and Chips - have they had their chips
GeoBlogs
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
BhavyaRajput3
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Thiyagu K
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
joachimlavalley1
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
siemaillard
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
Anna Sz.
 
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptxMARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
bennyroshan06
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
Celine George
 
Sectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdfSectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdf
Vivekanand Anglo Vedic Academy
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
Vikramjit Singh
 
Introduction to Quality Improvement Essentials
Introduction to Quality Improvement EssentialsIntroduction to Quality Improvement Essentials
Introduction to Quality Improvement Essentials
Excellence Foundation for South Sudan
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
Celine George
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
siemaillard
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
Pavel ( NSTU)
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
Balvir Singh
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
EverAndrsGuerraGuerr
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
TechSoup
 

Recently uploaded (20)

Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
 
Fish and Chips - have they had their chips
Fish and Chips - have they had their chipsFish and Chips - have they had their chips
Fish and Chips - have they had their chips
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
 
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptxMARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
 
Sectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdfSectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdf
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
 
Introduction to Quality Improvement Essentials
Introduction to Quality Improvement EssentialsIntroduction to Quality Improvement Essentials
Introduction to Quality Improvement Essentials
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
 

HIS'2008: Artificial Data Sets based on Knowledge Generators: Analysis of Learning Algorithms Efficiency

  • 1. Artificial Data Sets based on Knowledge Generators: Analysis of g y Learning Algorithms Efficiency Joaquin Rios-Boutin Rios Boutin Albert Orriols-Puig Josep Maria Garrell Guiu Josep-Maria Garrell-Guiu Grup de Recerca en Sistemes Intel·ligents Enginyeria i Arquitectura La Salle Universitat Ramon Llull Salle, {jrios, aorriols, josepmg}@salle.url.edu
  • 2. Motivation What is the Holy Grail of Machine Learning? – Find the right Learning Algorithm to every Problem – Real Problems are black boxes • We don’t know which knowledge is contained DI • We can’t answer: – When to stop training? – How much efficient is the learning process? – Artificial Problems: DI K • Knowledge-driven • Property-driven Complex.Met. Enginyeria i Arquitectura la Salle Slide 2 GRSI
  • 3. Framework Machine Learning as a Communication System Communication Chanel Learning Environment. Algorithm. Al ith Data Set Knowledge Learned to be learned Knowledge Enginyeria i Arquitectura la Salle Slide 3 GRSI
  • 4. Outline 1. Algorithm Evaluation Methodology Definition 1 Al ih E l i M hdl D fi i i 2. 2 Methodology Implementation 3. Experiment Description 4. Results and Analysis 5. Conclusions and Further Work Enginyeria i Arquitectura la Salle Slide 4 GRSI
  • 5. 1 Algorithm Evaluation Process g Process Execution and Control DB Problem Sampling Sampling Algorithm Data Set Method Size Parameters Accuracy DI Learning Generation Algorithm 1.2 DS1kMulplx6m1 1 0.8 0.6 0.4 0.2 0 0 2000 4000 6000 8000 10000 Knowledge Comparison 1.2 DS1kMulplx6m1 1 Optimal p 0.8 0.6 Population 0.4 0.2 0 0 2000 4000 6000 8000 10000 Enginyeria i Arquitectura la Salle Slide 5 GRSI
  • 6. 1 Algorithm Evaluation Process Dimensions 100000 10000 mpling g 1000 Size 100 Sam S Apn A 10 Alg.P 1 AP1 aram SRS SIS RRS RIS . Sampling Methods To each Problem Enginyeria i Arquitectura la Salle Slide 6 GRSI
  • 7. Outline 1. Algorithm Evaluation Methodology Definition 1 Al ih E l i M hdl D fi i i 2. 2 Methodology Implementation 3. Experiment Description 4. Results and Analysis 5. Conclusions and Further Work Enginyeria i Arquitectura la Salle Slide 7 GRSI
  • 8. 2 Knowledge Representation g p Condition Class/Action a11 a12 a1j a1m C1 Rule1 ai1 ai2 aij aim Ci Rule Set an1 an2 anj anm Cn aij={0,1, #} CiєN {0,1, Enginyeria i Arquitectura la Salle Slide 8 GRSI
  • 9. 2 Sampling Methods pg SRS Sequential Rule Selection SIS Sequential Instance Selection Sequential # substitution 1st 2nd Random # substitution 2nd 1st RRS Random Rule Selection RIS Random Instance Selection Random # substitution Sequential # 2nd 1st substitution 2nd 1st Enginyeria i Arquitectura la Salle Slide 9 GRSI
  • 10. 2 Problems to learn and Learning Algorithm Mux6 Mux11 Parity5 0 0 # # # 0 0 0 0 0 0 0 0 0 0 # # # 1 1 0 0 0 0 1 1 0 1 # # 0 # 0 0 0 0 1 0 1 0 1 # # 1 # 1 0 0 0 1 1 0 1 1 0 # # # 0 1 1 1 1 0 0 XCS 1 1 1 # # # 1 1 1 1 1 1 1 Parity5-3 Position5 Position11 0 0 0 0 0 0 0 0 0 0 0 # # # 0 0 0 0 0 1 1 0 0 0 0 1 # # # 1 0 0 0 1 # 2 0 0 0 1 0 # # # 1 0 0 1 # # 3 0 0 0 1 1 # # # 0 1 # # # # 5 1 1 1 1 0 # # # 0 1 1 1 1 1 # # # 1 Enginyeria i Arquitectura la Salle Slide 10 GRSI
  • 11. 2 Problem Properties p Optimal Rule Sets – Complete – Non overlapped – Irreducible Why? – Simple structure of knowledge complexity –V Very k known artificial problems tifi i l bl Enginyeria i Arquitectura la Salle Slide 11 GRSI
  • 12. Outline 1. Algorithm Evaluation Methodology Definition 1 Al ih E l i M hdl D fi i i 2. 2 Methodology Implementation 3. Experiment Description 4. Results and Analysis 5. Conclusions and Further Work Enginyeria i Arquitectura la Salle Slide 12 GRSI
  • 13. 3 Sampling and Learning Iteration pg g { {Sampling Iteration} Problem {Training Iteration} pg } { g } Sampling Sampling Algorithm Data Set Method Size Parameters Accuracy DI Learning Genaration Algorithm 1.2 DS1kMulplx6m1 1 0.8 0.6 0.4 0.2 0 0 2000 4000 6000 8000 10000 Knowledge Comparison 1.2 DS1kMulplx6m1 1 Optimal 0.8 Population P l ti 0.6 0.4 0.2 0 0 2000 4000 6000 8000 10000 Enginyeria i Arquitectura la Salle Slide 13 GRSI
  • 14. 3 Output Results and Iteration Reduction p Output Results – 2 Plots to every Problem Sampling Method Sampling Size and Problem, Method, Algorithm Parameters. 1.2 DS1kMulplx6m1 • Optimal Population 1 • Accuracy 0.8 Iteration R d ti It ti Reduction 0.6 – SIS Pure sequential 0.4 • No Sampling Iteration Needed 0.2 – Problems without “don’t care” 0 • SRS=SIS and RRS=RIS 0 2000 4000 6000 8000 10000 Slide 14 GRSI
  • 15. 3 Experimental Parameters p Number of Problems = 6 Number f Sampling M th d = 4 N b of S li Methods Number of different Sampling Sizes = 4 Number of different Algorithms Parameters Sets = 2 Number f Sampling It ti N b of S li Iterations = 10 Number of Training Iterations = 10 Number of Data Sets Generated = 744 Number of Training Process = 14880 Slide 15 GRSI
  • 16. Outline 1. Algorithm Evaluation Methodology Definition 1 Al ih E l i M hdl D fi i i 2. 2 Methodology Implementation 3. Experiment Description 4. Results and Analysis 5. Conclusions and Further Work Enginyeria i Arquitectura la Salle Slide 16 GRSI
  • 17. Problem Dimension Sampling M. = RIS Sampling Size = 1000 Learning Alg. Param. = pDNC 0.2 M Alg Param 02 1.2 Mux6 1.1 DS1kMulplx6m4 DS1kParity5m4 Parity5 1 1 0.9 0.8 0.8 0.6 0.7 0.6 0.4 0.5 0.2 0.4 0 0.3 -0.2 0.2 0 2000 4000 6000 8000 10000 0 2000 4000 6000 8000 10000 1.05 DS1kMulplx6m4 1.05 DS1kParity5m4 1 1 0.95 0.9 0.95 0.85 0.9 0.8 0.75 0.85 0.7 0.65 0.8 0.6 0.75 0.55 0 2000 4000 6000 8000 10000 0 2000 4000 6000 8000 10000 Slide 17 GRSI
  • 18. Sampling Method Dimension pg Problem = Position5 Sampling Size = 1000 Learning Alg. Param. = pDNC 0.2 Alg Param 02 SRS Sequential Rule Selection RIS Random Instance Selection 1.2 0.9 DS1kPosition5m1 DS1kPosition5m4 0.8 1 0.7 0.6 0.8 0.5 0.6 0.4 0.3 0.4 0.2 0.1 0.2 0 0 -0.1 0 2000 4000 6000 8000 10000 0 2000 4000 6000 8000 10000 1.1 1.1 DS1kPosition5m1 DS1kPosition5m4 1 1 0.9 0.9 0.8 0.8 0.7 0.7 0.6 0.6 0.5 0.5 0.4 0.4 0.3 0.3 0 2000 4000 6000 8000 10000 0 2000 4000 6000 8000 10000 Slide 18 GRSI
  • 19. Sampling Size Dimension pg Problem = Parity5 Sampling M.= RIS Learning Alg. Param. = pDNC 0.2 M= Alg Param 02 1.1 1.1 DS100Parity5m4 100 DS10kParity5m4 10000 1 1 0.9 0.9 0.8 0.8 0.7 0.7 0.6 0.6 0.5 0.5 0.4 0.4 0.3 0.3 0.2 0.1 0.2 0 2000 4000 6000 8000 10000 0 2000 4000 6000 8000 10000 1.05 1.05 DS10kParity5m4 DS100Parity5m4 1 1 0.95 0.95 0.9 0.9 0.85 0.85 0.8 0.8 0.75 0.7 0.75 0.65 0.7 0.6 0.65 0.55 0 2000 4000 6000 8000 10000 0.6 0 2000 4000 6000 8000 10000 Slide 19 GRSI
  • 20. Parameter Algorithm Dimension g Problem = Mux6 Sampling M. = RIS Sampling Size = 1000 M 1 1.2 DS1kMulplx6m4 DS1kMulplx6m4 pDNC 0.8 0.9 pDNC 0.2 1 0.8 0.7 0.8 0.6 0.6 0.5 0.4 0.4 0.3 0.2 0.2 0.1 0 0 -0.1 -0.2 0 2000 4000 6000 8000 10000 0 2000 4000 6000 8000 10000 1.05 DS1kMulplx6m4 1.05 DS1kMulplx6m4 1 0.95 1 0.9 0.85 0.95 0.8 0.9 0.75 0.7 0.85 0.65 0.6 0.8 0.55 0.5 0.75 0 2000 4000 6000 8000 10000 0 2000 4000 6000 8000 10000 Slide 20 GRSI
  • 21. Outline 1. Algorithm Evaluation Methodology Definition 1 Al ih E l i M hdl D fi i i 2. 2 Methodology Implementation 3. Experiment Description 4. Results and Analysis 5. Conclusions and Further Work Enginyeria i Arquitectura la Salle Slide 21 GRSI
  • 22. Conclusions and Further Work Conclusions – Automatic Learning Algorithm Analyzer based on Artificial Data Sets – Four dimensions comparisons – Methodology Implementation, Experiment and Results Analysis Further Work – Non ORS Problems – R l Att ib t Real Attributes – Sampling Methods based on distance or transition matrix – Multi Step Problems p – Different Learning Algorithms – Different Knowledge representations – Knowledge Covering Metrics – Applying Data Set Complexity Metrics Suite Slide 22 GRSI
  • 23. GRSI Artificial Data Sets based on Knowledge Generators: Analysis of Learning Algorithms Efficiency y gg y Joaquin Rios Boutin, Albert Orriols-Puig, Josep-Maria Garrell-Guiu {j {jrios, aorriols, josepmg}@salle.url.edu j p g}@ GRSI (Grup de Recerca en Sistemes Intel·ligents) http://www.salle.url.edu/GRSI • http://www salle url edu/GRSI – Oriented to: • CBR (Computer Based Reasoning) Algorithms • Evolutive Computation Algorithms • Data Mining Technology Transfer Enginyeria i Arquitectura la Salle Slide 23 GRSI