SlideShare a Scribd company logo
1 of 23
Download to read offline
Artificial Data Sets based on
Knowledge Generators: Analysis of
         g                   y
  Learning Algorithms Efficiency

                       Joaquin Rios-Boutin
                               Rios Boutin
                        Albert Orriols-Puig
                    Josep Maria Garrell Guiu
                    Josep-Maria Garrell-Guiu

             Grup de Recerca en Sistemes Intel·ligents
      Enginyeria i Arquitectura La Salle Universitat Ramon Llull
                                     Salle,
               {jrios, aorriols, josepmg}@salle.url.edu
Motivation

       What is the Holy Grail of Machine Learning?
       – Find the right Learning Algorithm to every Problem
       – Real Problems are black boxes
          • We don’t know which knowledge is contained

                                                                            DI
          • We can’t answer:
              – When to stop training?
              – How much efficient is the learning process?



       – Artificial Problems:
                                                                            DI
                                                                     K
          • Knowledge-driven
          • Property-driven
                                                                         Complex.Met.

                                Enginyeria i Arquitectura la Salle                 Slide 2
GRSI
Framework

       Machine Learning as a Communication System



                       Communication Chanel
                                                                 Learning
       Environment.
                                                                Algorithm.
                                                                Al   ith
                               Data Set
        Knowledge
                                                                 Learned
       to be learned
                                                                Knowledge




                           Enginyeria i Arquitectura la Salle                Slide 3
GRSI
Outline



        1. Algorithm Evaluation Methodology Definition
        1 Al    ih E l      i   M hdl       D fi i i

        2.
        2 Methodology Implementation

        3. Experiment Description

        4. Results and Analysis

        5. Conclusions and Further Work




                            Enginyeria i Arquitectura la Salle   Slide 4
GRSI
1 Algorithm Evaluation Process
           g

                 Process Execution and Control DB

                                            Problem
             Sampling   Sampling                                                                                       Algorithm
                                            Data Set
              Method      Size                                                                                        Parameters




                                                                                                                                              Accuracy
                  DI                                                                                                    Learning
               Generation                                                                                               Algorithm   1.2
                                                                                                                                                            DS1kMulplx6m1


                                                                                                                                     1


                                                                                                                                    0.8


                                                                                                                                    0.6


                                                                                                                                    0.4


                                                                                                                                    0.2


                                                                                                                                     0
                                                                                                                                          0   2000   4000      6000    8000   10000




                  Knowledge
                  Comparison                                    1.2
                                                                                        DS1kMulplx6m1


                                                                 1




                                                                                                                   Optimal
                                                                                                                    p
                                                                0.8


                                                                0.6




                                                                                                                  Population
                                                                0.4


                                                                0.2


                                                                 0
                                                                      0   2000   4000      6000    8000   10000




                                   Enginyeria i Arquitectura la Salle                                                                                        Slide 5
GRSI
1 Algorithm Evaluation Process Dimensions




              100000
               10000
     mpling
          g




                1000
     Size




                 100
   Sam
     S




                                                                            Apn
                                                                            A
                  10                                                          Alg.P
                   1                                                  AP1     aram
                       SRS    SIS           RRS                 RIS             .
                        Sampling Methods

                             To each Problem
                                Enginyeria i Arquitectura la Salle                    Slide 6
GRSI
Outline



        1. Algorithm Evaluation Methodology Definition
        1 Al    ih E l      i   M hdl       D fi i i

        2.
        2 Methodology Implementation

        3. Experiment Description

        4. Results and Analysis

        5. Conclusions and Further Work




                            Enginyeria i Arquitectura la Salle   Slide 7
GRSI
2 Knowledge Representation
                g    p

                              Condition                                 Class/Action


                  a11   a12              a1j                        a1m    C1     Rule1



                  ai1   ai2              aij                        aim    Ci
        Rule
        Set


                  an1   an2              anj                        anm    Cn



               aij={0,1, #} CiєN
                   {0,1,

                                   Enginyeria i Arquitectura la Salle                     Slide 8
GRSI
2 Sampling Methods
            pg

       SRS Sequential Rule Selection                                   SIS Sequential Instance Selection
                                                                                                     Sequential #
                                                                                                     substitution
                                                                                               1st
                                                          2nd
                      Random # substitution
                2nd


         1st




       RRS Random Rule Selection                                        RIS Random Instance Selection
                      Random # substitution                                                            Sequential #
                2nd
                                                                                                 1st   substitution
                                                               2nd
         1st




                                              Enginyeria i Arquitectura la Salle                               Slide 9
GRSI
2 Problems to learn and Learning Algorithm

               Mux6 Mux11                                                  Parity5
               0   0   #   #   #       0   0                               0    0   0   0   0   0
               0   0   #   #   #       1   1                               0    0   0   0   1   1
               0   1   #   #   0       #   0                               0    0   0   1   0   1
               0   1   #   #   1       #   1                               0    0   0   1   1   0


               1   1   0   #   #       #   0                               1    1   1   1   0   0


                                               XCS
               1   1   1   #   #       #   1                               1    1   1   1   1   1




                                                                           Parity5-3
           Position5 Position11
           0       0   0   0       0       0                               0    0   0   0   0   #   #   #   0
           0       0   0   0       1       1                               0    0   0   0   1   #   #   #   1
           0       0   0   1       #       2                               0    0   0   1   0   #   #   #   1
           0       0   1   #       #       3                               0    0   0   1   1   #   #   #   0


           1       #   #   #       #       5                               1    1   1   1   0   #   #   #   0
                                                                           1    1   1   1   1   #   #   #   1


                                               Enginyeria i Arquitectura la Salle                               Slide 10
GRSI
2 Problem Properties
                    p


        Optimal Rule Sets
         – Complete
         – Non overlapped
         – Irreducible
        Why?
         – Simple structure of knowledge complexity
         –V
          Very k
               known artificial problems
                       tifi i l    bl




                                Enginyeria i Arquitectura la Salle   Slide 11
GRSI
Outline



        1. Algorithm Evaluation Methodology Definition
        1 Al    ih E l      i   M hdl       D fi i i

        2.
        2 Methodology Implementation

        3. Experiment Description

        4. Results and Analysis

        5. Conclusions and Further Work




                            Enginyeria i Arquitectura la Salle   Slide 12
GRSI
3 Sampling and Learning Iteration
            pg               g


         {
         {Sampling Iteration} Problem {Training Iteration}
              pg               }          {      g       }
             Sampling Sampling               Algorithm
                                 Data Set
              Method    Size                Parameters




                                                                                                                               Accuracy
                 DI                                                                                      Learning
              Genaration                                                                                 Algorithm   1.2
                                                                                                                                             DS1kMulplx6m1


                                                                                                                      1


                                                                                                                     0.8


                                                                                                                     0.6


                                                                                                                     0.4


                                                                                                                     0.2


                                                                                                                      0
                                                                                                                           0   2000   4000      6000    8000   10000




               Knowledge
               Comparison                 1.2
                                                                  DS1kMulplx6m1


                                           1




                                                                                              Optimal
                                          0.8




                                                                                            Population
                                                                                            P    l ti
                                          0.6


                                          0.4


                                          0.2


                                           0
                                                0   2000   4000      6000    8000   10000




                           Enginyeria i Arquitectura la Salle                                                                           Slide 13
GRSI
3 Output Results and Iteration Reduction
            p


        Output Results
         – 2 Plots to every Problem Sampling Method Sampling Size and
                            Problem,         Method,
           Algorithm Parameters.                 1.2
                                                                        DS1kMulplx6m1

             • Optimal Population                1


             • Accuracy                         0.8

        Iteration R d ti
        It ti Reduction                         0.6

         – SIS Pure sequential
                                                0.4

             • No Sampling Iteration Needed
                                                0.2
         – Problems without “don’t care”
                                                 0
             • SRS=SIS and RRS=RIS                    0   2000   4000      6000    8000   10000




                                                                                                  Slide 14
GRSI
3 Experimental Parameters
           p


        Number of Problems = 6
        Number f Sampling M th d = 4
        N b of S     li Methods
        Number of different Sampling Sizes = 4
        Number of different Algorithms Parameters Sets = 2
        Number f Sampling It ti
        N b of S     li Iterations = 10
        Number of Training Iterations = 10
        Number of Data Sets Generated = 744
        Number of Training Process = 14880



                                                             Slide 15
GRSI
Outline



        1. Algorithm Evaluation Methodology Definition
        1 Al    ih E l      i   M hdl       D fi i i

        2.
        2 Methodology Implementation

        3. Experiment Description

        4. Results and Analysis

        5. Conclusions and Further Work




                            Enginyeria i Arquitectura la Salle   Slide 16
GRSI
Problem Dimension

         Sampling M. = RIS Sampling Size = 1000 Learning Alg. Param. = pDNC 0.2
                  M                                      Alg Param          02
                    1.2
       Mux6                                                            1.1
                                            DS1kMulplx6m4
                                                                                               DS1kParity5m4
                                                                                                                           Parity5
                                                                        1
                     1

                                                                       0.9
                    0.8
                                                                       0.8
                    0.6
                                                                       0.7

                                                                       0.6
                    0.4

                                                                       0.5
                    0.2
                                                                       0.4
                     0
                                                                       0.3

                   -0.2                                                0.2
                          0   2000   4000      6000    8000   10000          0   2000   4000      6000    8000    10000
                   1.05
                                        DS1kMulplx6m4                 1.05
                                                                                               DS1kParity5m4
                                                                        1
                     1
                                                                      0.95

                                                                       0.9
                   0.95

                                                                      0.85
                    0.9
                                                                       0.8

                                                                      0.75
                   0.85
                                                                       0.7

                                                                      0.65
                    0.8
                                                                       0.6

                   0.75                                               0.55
                          0   2000   4000      6000    8000   10000          0   2000   4000      6000     8000    10000




                                                                                                                                     Slide 17
GRSI
Sampling Method Dimension
           pg

          Problem = Position5 Sampling Size = 1000 Learning Alg. Param. = pDNC 0.2
                                                            Alg Param          02
SRS Sequential Rule Selection                                                      RIS Random Instance Selection
             1.2                                              0.9
                                DS1kPosition5m1                                   DS1kPosition5m4
                                                              0.8
              1
                                                              0.7

                                                              0.6
             0.8

                                                              0.5
             0.6
                                                              0.4

                                                              0.3
             0.4
                                                              0.2

                                                              0.1
             0.2

                                                               0
              0
                                                             -0.1
                   0   2000   4000   6000     8000   10000
                                                                    0   2000   4000    6000    8000    10000

             1.1                                             1.1
                                 DS1kPosition5m1                                 DS1kPosition5m4

              1                                               1

             0.9                                             0.9

             0.8                                             0.8

             0.7                                             0.7

             0.6
                                                             0.6

             0.5
                                                             0.5

             0.4
                                                             0.4

             0.3
                                                             0.3
                   0   2000   4000    6000    8000   10000
                                                                   0    2000   4000    6000     8000   10000
                                                                                                               Slide 18
 GRSI
Sampling Size Dimension
          pg

        Problem = Parity5 Sampling M.= RIS Learning Alg. Param. = pDNC 0.2
                                   M=               Alg Param          02
               1.1
                                                                  1.1
                                       DS100Parity5m4
        100                                                                               DS10kParity5m4
                                                                                                                     10000
                1
                                                                   1
               0.9
                                                                  0.9
               0.8
                                                                  0.8
               0.7
                                                                  0.7
               0.6
                                                                  0.6
               0.5
                                                                  0.5
               0.4

                                                                  0.4
               0.3

                                                                  0.3
               0.2

               0.1                                                0.2
                     0       2000   4000   6000    8000   10000         0       2000   4000   6000    8000   10000

                                                                  1.05
               1.05
                                                                                          DS10kParity5m4
                                       DS100Parity5m4
                                                                        1
                     1
                                                                  0.95
               0.95
                                                                   0.9
                0.9
                                                                  0.85
               0.85
                                                                   0.8

                0.8                                               0.75

                                                                   0.7
               0.75
                                                                  0.65
                0.7
                                                                   0.6
               0.65
                                                                  0.55
                                                                            0   2000   4000   6000    8000   10000
                0.6
                         0   2000   4000   6000    8000   10000

                                                                                                                             Slide 19
GRSI
Parameter Algorithm Dimension
                   g

        Problem = Mux6 Sampling M. = RIS Sampling Size = 1000
                                M
                1                                                 1.2
                                       DS1kMulplx6m4                                            DS1kMulplx6m4

  pDNC 0.8    0.9
                                                                                                                             pDNC 0.2
                                                                    1
              0.8

              0.7                                                 0.8
              0.6
                                                                  0.6
              0.5

              0.4
                                                                  0.4
              0.3

                                                                  0.2
              0.2

              0.1
                                                                    0
                0

              -0.1                                                -0.2
                     0   2000   4000      6000    8000    10000          0       2000    4000      6000     8000    10000


             1.05
                                   DS1kMulplx6m4                   1.05
                                                                                                DS1kMulplx6m4
                1

             0.95                                                        1
              0.9

             0.85                                                  0.95

              0.8
                                                                    0.9
             0.75

              0.7
                                                                   0.85
             0.65

              0.6
                                                                    0.8
             0.55

              0.5
                                                                   0.75
                     0   2000   4000     6000    8000    10000
                                                                             0    2000    4000       6000    8000    10000




                                                                                                                                        Slide 20
GRSI
Outline



        1. Algorithm Evaluation Methodology Definition
        1 Al    ih E l      i   M hdl       D fi i i

        2.
        2 Methodology Implementation

        3. Experiment Description

        4. Results and Analysis

        5. Conclusions and Further Work




                            Enginyeria i Arquitectura la Salle   Slide 21
GRSI
Conclusions and Further Work

        Conclusions
         – Automatic Learning Algorithm Analyzer based on Artificial Data Sets
         – Four dimensions comparisons
         – Methodology Implementation, Experiment and Results Analysis

        Further Work
         – Non ORS Problems
         – R l Att ib t
           Real Attributes
         – Sampling Methods based on distance or transition matrix
         – Multi Step Problems
                    p
         – Different Learning Algorithms
         – Different Knowledge representations
         – Knowledge Covering Metrics
         – Applying Data Set Complexity Metrics Suite



                                                                                 Slide 22
GRSI
GRSI


       Artificial Data Sets based on Knowledge Generators:
       Analysis of Learning Algorithms Efficiency
            y              gg                   y
         Joaquin Rios Boutin, Albert Orriols-Puig, Josep-Maria Garrell-Guiu
                         {j
                         {jrios, aorriols, josepmg}@salle.url.edu
                                           j   p g}@

       GRSI (Grup de Recerca en Sistemes Intel·ligents)
             http://www.salle.url.edu/GRSI
           • http://www salle url edu/GRSI


       – Oriented to:
           • CBR (Computer Based Reasoning) Algorithms
           • Evolutive Computation Algorithms
           • Data Mining Technology Transfer




                                 Enginyeria i Arquitectura la Salle           Slide 23
GRSI

More Related Content

Viewers also liked

ESTYLF'2008: Modelado Causal en Marketing mediante Aprendizaje no Supervisado...
ESTYLF'2008: Modelado Causal en Marketing mediante Aprendizaje no Supervisado...ESTYLF'2008: Modelado Causal en Marketing mediante Aprendizaje no Supervisado...
ESTYLF'2008: Modelado Causal en Marketing mediante Aprendizaje no Supervisado...Albert Orriols-Puig
 
CCIA'2008: On the dimensions of data complexity through synthetic data sets
CCIA'2008: On the dimensions of data complexity through synthetic data setsCCIA'2008: On the dimensions of data complexity through synthetic data sets
CCIA'2008: On the dimensions of data complexity through synthetic data setsAlbert Orriols-Puig
 
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCSHIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCSAlbert Orriols-Puig
 
GECCO'2007: Modeling Selection Pressure in XCS for Proportionate and Tourname...
GECCO'2007: Modeling Selection Pressure in XCS for Proportionate and Tourname...GECCO'2007: Modeling Selection Pressure in XCS for Proportionate and Tourname...
GECCO'2007: Modeling Selection Pressure in XCS for Proportionate and Tourname...Albert Orriols-Puig
 
HAIS09-BeyondHomemadeArtificialDatasets
HAIS09-BeyondHomemadeArtificialDatasetsHAIS09-BeyondHomemadeArtificialDatasets
HAIS09-BeyondHomemadeArtificialDatasetsAlbert Orriols-Puig
 
Lecture1 AI1 Introduction to artificial intelligence
Lecture1 AI1 Introduction to artificial intelligenceLecture1 AI1 Introduction to artificial intelligence
Lecture1 AI1 Introduction to artificial intelligenceAlbert Orriols-Puig
 
Artificial Intelligence Presentation
Artificial Intelligence PresentationArtificial Intelligence Presentation
Artificial Intelligence Presentationlpaviglianiti
 

Viewers also liked (11)

ESTYLF'2008: Modelado Causal en Marketing mediante Aprendizaje no Supervisado...
ESTYLF'2008: Modelado Causal en Marketing mediante Aprendizaje no Supervisado...ESTYLF'2008: Modelado Causal en Marketing mediante Aprendizaje no Supervisado...
ESTYLF'2008: Modelado Causal en Marketing mediante Aprendizaje no Supervisado...
 
CCIA'2008: On the dimensions of data complexity through synthetic data sets
CCIA'2008: On the dimensions of data complexity through synthetic data setsCCIA'2008: On the dimensions of data complexity through synthetic data sets
CCIA'2008: On the dimensions of data complexity through synthetic data sets
 
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCSHIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
 
GECCO'2007: Modeling Selection Pressure in XCS for Proportionate and Tourname...
GECCO'2007: Modeling Selection Pressure in XCS for Proportionate and Tourname...GECCO'2007: Modeling Selection Pressure in XCS for Proportionate and Tourname...
GECCO'2007: Modeling Selection Pressure in XCS for Proportionate and Tourname...
 
Lecture7 - IBk
Lecture7 - IBkLecture7 - IBk
Lecture7 - IBk
 
Lecture24
Lecture24Lecture24
Lecture24
 
Lecture23
Lecture23Lecture23
Lecture23
 
HAIS09-BeyondHomemadeArtificialDatasets
HAIS09-BeyondHomemadeArtificialDatasetsHAIS09-BeyondHomemadeArtificialDatasets
HAIS09-BeyondHomemadeArtificialDatasets
 
Lecture1 AI1 Introduction to artificial intelligence
Lecture1 AI1 Introduction to artificial intelligenceLecture1 AI1 Introduction to artificial intelligence
Lecture1 AI1 Introduction to artificial intelligence
 
Artificial Intelligence Presentation
Artificial Intelligence PresentationArtificial Intelligence Presentation
Artificial Intelligence Presentation
 
Artificial inteligence
Artificial inteligenceArtificial inteligence
Artificial inteligence
 

Similar to Artificial Data Sets based on Knowledge Generators

The Influence of Extensible Algorithms on Operating Systems
The Influence of Extensible Algorithms on Operating SystemsThe Influence of Extensible Algorithms on Operating Systems
The Influence of Extensible Algorithms on Operating Systemsricky_pi_tercios
 
Meetup Python Madrid 2018: ¿Segmentación semántica? ¿Pero de qué me estás hab...
Meetup Python Madrid 2018: ¿Segmentación semántica? ¿Pero de qué me estás hab...Meetup Python Madrid 2018: ¿Segmentación semántica? ¿Pero de qué me estás hab...
Meetup Python Madrid 2018: ¿Segmentación semántica? ¿Pero de qué me estás hab...Ricardo Guerrero Gómez-Olmedo
 
Artificial Intelligence IA at the service of Laboratories
Artificial Intelligence IA at the service of LaboratoriesArtificial Intelligence IA at the service of Laboratories
Artificial Intelligence IA at the service of LaboratoriesYvon Gervaise
 
Conférence Y. GervaiseEN1st Green Analytical Y. Gervaise.pdf
Conférence Y. GervaiseEN1st Green Analytical Y. Gervaise.pdfConférence Y. GervaiseEN1st Green Analytical Y. Gervaise.pdf
Conférence Y. GervaiseEN1st Green Analytical Y. Gervaise.pdfYvonGervaise
 
Data Mining
Data MiningData Mining
Data Miningswami920
 
Smart Data Webinar: Machine Learning Update
Smart Data Webinar: Machine Learning UpdateSmart Data Webinar: Machine Learning Update
Smart Data Webinar: Machine Learning UpdateDATAVERSITY
 
IBM Meetup on November 1, 2018: Machine Learning made easy with Watson Studio
IBM Meetup on November 1, 2018: Machine Learning made easy with Watson StudioIBM Meetup on November 1, 2018: Machine Learning made easy with Watson Studio
IBM Meetup on November 1, 2018: Machine Learning made easy with Watson StudioSvetlana Levitan, PhD
 
Four Problems You Run into When DIY-ing a “Big Data” Analytics System
Four Problems You Run into When DIY-ing a “Big Data” Analytics SystemFour Problems You Run into When DIY-ing a “Big Data” Analytics System
Four Problems You Run into When DIY-ing a “Big Data” Analytics SystemTreasure Data, Inc.
 
Lecture-1-2-+(1).pdf
Lecture-1-2-+(1).pdfLecture-1-2-+(1).pdf
Lecture-1-2-+(1).pdfsamaghorab
 
Lecture-1-2-+(1).pdf
Lecture-1-2-+(1).pdfLecture-1-2-+(1).pdf
Lecture-1-2-+(1).pdfsamaghorab
 
IRJET- Object Detection in an Image using Convolutional Neural Network
IRJET- Object Detection in an Image using Convolutional Neural NetworkIRJET- Object Detection in an Image using Convolutional Neural Network
IRJET- Object Detection in an Image using Convolutional Neural NetworkIRJET Journal
 
MachinaFiesta: A Vision into Machine Learning 🚀
MachinaFiesta: A Vision into Machine Learning 🚀MachinaFiesta: A Vision into Machine Learning 🚀
MachinaFiesta: A Vision into Machine Learning 🚀GDSCNiT
 
An introduction to Machine Learning with scikit-learn (October 2018)
An introduction to Machine Learning with scikit-learn (October 2018)An introduction to Machine Learning with scikit-learn (October 2018)
An introduction to Machine Learning with scikit-learn (October 2018)Julien SIMON
 
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...vinayiqbusiness
 
Machine Learning for Speech
Machine Learning for Speech Machine Learning for Speech
Machine Learning for Speech butest
 
How to test an AI application
How to test an AI applicationHow to test an AI application
How to test an AI applicationKari Kakkonen
 
Integrating R with the CDK: Enhanced Chemical Data Mining
Integrating R with the CDK: Enhanced Chemical Data MiningIntegrating R with the CDK: Enhanced Chemical Data Mining
Integrating R with the CDK: Enhanced Chemical Data MiningRajarshi Guha
 
How to implement artificial intelligence solutions
How to implement artificial intelligence solutionsHow to implement artificial intelligence solutions
How to implement artificial intelligence solutionsCarlos Toxtli
 
Final Year Project Guidance
Final Year Project GuidanceFinal Year Project Guidance
Final Year Project GuidanceVarad Meru
 

Similar to Artificial Data Sets based on Knowledge Generators (20)

The Influence of Extensible Algorithms on Operating Systems
The Influence of Extensible Algorithms on Operating SystemsThe Influence of Extensible Algorithms on Operating Systems
The Influence of Extensible Algorithms on Operating Systems
 
Meetup Python Madrid 2018: ¿Segmentación semántica? ¿Pero de qué me estás hab...
Meetup Python Madrid 2018: ¿Segmentación semántica? ¿Pero de qué me estás hab...Meetup Python Madrid 2018: ¿Segmentación semántica? ¿Pero de qué me estás hab...
Meetup Python Madrid 2018: ¿Segmentación semántica? ¿Pero de qué me estás hab...
 
Artificial Intelligence IA at the service of Laboratories
Artificial Intelligence IA at the service of LaboratoriesArtificial Intelligence IA at the service of Laboratories
Artificial Intelligence IA at the service of Laboratories
 
Conférence Y. GervaiseEN1st Green Analytical Y. Gervaise.pdf
Conférence Y. GervaiseEN1st Green Analytical Y. Gervaise.pdfConférence Y. GervaiseEN1st Green Analytical Y. Gervaise.pdf
Conférence Y. GervaiseEN1st Green Analytical Y. Gervaise.pdf
 
Data Mining
Data MiningData Mining
Data Mining
 
Smart Data Webinar: Machine Learning Update
Smart Data Webinar: Machine Learning UpdateSmart Data Webinar: Machine Learning Update
Smart Data Webinar: Machine Learning Update
 
IBM Meetup on November 1, 2018: Machine Learning made easy with Watson Studio
IBM Meetup on November 1, 2018: Machine Learning made easy with Watson StudioIBM Meetup on November 1, 2018: Machine Learning made easy with Watson Studio
IBM Meetup on November 1, 2018: Machine Learning made easy with Watson Studio
 
Four Problems You Run into When DIY-ing a “Big Data” Analytics System
Four Problems You Run into When DIY-ing a “Big Data” Analytics SystemFour Problems You Run into When DIY-ing a “Big Data” Analytics System
Four Problems You Run into When DIY-ing a “Big Data” Analytics System
 
Lecture-1-2-+(1).pdf
Lecture-1-2-+(1).pdfLecture-1-2-+(1).pdf
Lecture-1-2-+(1).pdf
 
Lecture-1-2-+(1).pdf
Lecture-1-2-+(1).pdfLecture-1-2-+(1).pdf
Lecture-1-2-+(1).pdf
 
IRJET- Object Detection in an Image using Convolutional Neural Network
IRJET- Object Detection in an Image using Convolutional Neural NetworkIRJET- Object Detection in an Image using Convolutional Neural Network
IRJET- Object Detection in an Image using Convolutional Neural Network
 
MachinaFiesta: A Vision into Machine Learning 🚀
MachinaFiesta: A Vision into Machine Learning 🚀MachinaFiesta: A Vision into Machine Learning 🚀
MachinaFiesta: A Vision into Machine Learning 🚀
 
An introduction to Machine Learning with scikit-learn (October 2018)
An introduction to Machine Learning with scikit-learn (October 2018)An introduction to Machine Learning with scikit-learn (October 2018)
An introduction to Machine Learning with scikit-learn (October 2018)
 
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
 
Machine Learning for Speech
Machine Learning for Speech Machine Learning for Speech
Machine Learning for Speech
 
How to test an AI application
How to test an AI applicationHow to test an AI application
How to test an AI application
 
Integrating R with the CDK: Enhanced Chemical Data Mining
Integrating R with the CDK: Enhanced Chemical Data MiningIntegrating R with the CDK: Enhanced Chemical Data Mining
Integrating R with the CDK: Enhanced Chemical Data Mining
 
How to implement artificial intelligence solutions
How to implement artificial intelligence solutionsHow to implement artificial intelligence solutions
How to implement artificial intelligence solutions
 
Apple Machine Learning
Apple Machine LearningApple Machine Learning
Apple Machine Learning
 
Final Year Project Guidance
Final Year Project GuidanceFinal Year Project Guidance
Final Year Project Guidance
 

More from Albert Orriols-Puig (20)

Lecture22
Lecture22Lecture22
Lecture22
 
Lecture21
Lecture21Lecture21
Lecture21
 
Lecture20
Lecture20Lecture20
Lecture20
 
Lecture19
Lecture19Lecture19
Lecture19
 
Lecture18
Lecture18Lecture18
Lecture18
 
Lecture17
Lecture17Lecture17
Lecture17
 
Lecture16 - Advances topics on association rules PART III
Lecture16 - Advances topics on association rules PART IIILecture16 - Advances topics on association rules PART III
Lecture16 - Advances topics on association rules PART III
 
Lecture15 - Advances topics on association rules PART II
Lecture15 - Advances topics on association rules PART IILecture15 - Advances topics on association rules PART II
Lecture15 - Advances topics on association rules PART II
 
Lecture14 - Advanced topics in association rules
Lecture14 - Advanced topics in association rulesLecture14 - Advanced topics in association rules
Lecture14 - Advanced topics in association rules
 
Lecture13 - Association Rules
Lecture13 - Association RulesLecture13 - Association Rules
Lecture13 - Association Rules
 
Lecture12 - SVM
Lecture12 - SVMLecture12 - SVM
Lecture12 - SVM
 
Lecture11 - neural networks
Lecture11 - neural networksLecture11 - neural networks
Lecture11 - neural networks
 
Lecture10 - Naïve Bayes
Lecture10 - Naïve BayesLecture10 - Naïve Bayes
Lecture10 - Naïve Bayes
 
Lecture9 - Bayesian-Decision-Theory
Lecture9 - Bayesian-Decision-TheoryLecture9 - Bayesian-Decision-Theory
Lecture9 - Bayesian-Decision-Theory
 
Lecture8 - From CBR to IBk
Lecture8 - From CBR to IBkLecture8 - From CBR to IBk
Lecture8 - From CBR to IBk
 
Lecture6 - C4.5
Lecture6 - C4.5Lecture6 - C4.5
Lecture6 - C4.5
 
Lecture5 - C4.5
Lecture5 - C4.5Lecture5 - C4.5
Lecture5 - C4.5
 
Lecture4 - Machine Learning
Lecture4 - Machine LearningLecture4 - Machine Learning
Lecture4 - Machine Learning
 
Lecture3 - Machine Learning
Lecture3 - Machine LearningLecture3 - Machine Learning
Lecture3 - Machine Learning
 
Lecture2 - Machine Learning
Lecture2 - Machine LearningLecture2 - Machine Learning
Lecture2 - Machine Learning
 

Recently uploaded

Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...Pooja Nehwal
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...anjaliyadav012327
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxShobhayan Kirtania
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 

Recently uploaded (20)

Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptx
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 

Artificial Data Sets based on Knowledge Generators

  • 1. Artificial Data Sets based on Knowledge Generators: Analysis of g y Learning Algorithms Efficiency Joaquin Rios-Boutin Rios Boutin Albert Orriols-Puig Josep Maria Garrell Guiu Josep-Maria Garrell-Guiu Grup de Recerca en Sistemes Intel·ligents Enginyeria i Arquitectura La Salle Universitat Ramon Llull Salle, {jrios, aorriols, josepmg}@salle.url.edu
  • 2. Motivation What is the Holy Grail of Machine Learning? – Find the right Learning Algorithm to every Problem – Real Problems are black boxes • We don’t know which knowledge is contained DI • We can’t answer: – When to stop training? – How much efficient is the learning process? – Artificial Problems: DI K • Knowledge-driven • Property-driven Complex.Met. Enginyeria i Arquitectura la Salle Slide 2 GRSI
  • 3. Framework Machine Learning as a Communication System Communication Chanel Learning Environment. Algorithm. Al ith Data Set Knowledge Learned to be learned Knowledge Enginyeria i Arquitectura la Salle Slide 3 GRSI
  • 4. Outline 1. Algorithm Evaluation Methodology Definition 1 Al ih E l i M hdl D fi i i 2. 2 Methodology Implementation 3. Experiment Description 4. Results and Analysis 5. Conclusions and Further Work Enginyeria i Arquitectura la Salle Slide 4 GRSI
  • 5. 1 Algorithm Evaluation Process g Process Execution and Control DB Problem Sampling Sampling Algorithm Data Set Method Size Parameters Accuracy DI Learning Generation Algorithm 1.2 DS1kMulplx6m1 1 0.8 0.6 0.4 0.2 0 0 2000 4000 6000 8000 10000 Knowledge Comparison 1.2 DS1kMulplx6m1 1 Optimal p 0.8 0.6 Population 0.4 0.2 0 0 2000 4000 6000 8000 10000 Enginyeria i Arquitectura la Salle Slide 5 GRSI
  • 6. 1 Algorithm Evaluation Process Dimensions 100000 10000 mpling g 1000 Size 100 Sam S Apn A 10 Alg.P 1 AP1 aram SRS SIS RRS RIS . Sampling Methods To each Problem Enginyeria i Arquitectura la Salle Slide 6 GRSI
  • 7. Outline 1. Algorithm Evaluation Methodology Definition 1 Al ih E l i M hdl D fi i i 2. 2 Methodology Implementation 3. Experiment Description 4. Results and Analysis 5. Conclusions and Further Work Enginyeria i Arquitectura la Salle Slide 7 GRSI
  • 8. 2 Knowledge Representation g p Condition Class/Action a11 a12 a1j a1m C1 Rule1 ai1 ai2 aij aim Ci Rule Set an1 an2 anj anm Cn aij={0,1, #} CiєN {0,1, Enginyeria i Arquitectura la Salle Slide 8 GRSI
  • 9. 2 Sampling Methods pg SRS Sequential Rule Selection SIS Sequential Instance Selection Sequential # substitution 1st 2nd Random # substitution 2nd 1st RRS Random Rule Selection RIS Random Instance Selection Random # substitution Sequential # 2nd 1st substitution 2nd 1st Enginyeria i Arquitectura la Salle Slide 9 GRSI
  • 10. 2 Problems to learn and Learning Algorithm Mux6 Mux11 Parity5 0 0 # # # 0 0 0 0 0 0 0 0 0 0 # # # 1 1 0 0 0 0 1 1 0 1 # # 0 # 0 0 0 0 1 0 1 0 1 # # 1 # 1 0 0 0 1 1 0 1 1 0 # # # 0 1 1 1 1 0 0 XCS 1 1 1 # # # 1 1 1 1 1 1 1 Parity5-3 Position5 Position11 0 0 0 0 0 0 0 0 0 0 0 # # # 0 0 0 0 0 1 1 0 0 0 0 1 # # # 1 0 0 0 1 # 2 0 0 0 1 0 # # # 1 0 0 1 # # 3 0 0 0 1 1 # # # 0 1 # # # # 5 1 1 1 1 0 # # # 0 1 1 1 1 1 # # # 1 Enginyeria i Arquitectura la Salle Slide 10 GRSI
  • 11. 2 Problem Properties p Optimal Rule Sets – Complete – Non overlapped – Irreducible Why? – Simple structure of knowledge complexity –V Very k known artificial problems tifi i l bl Enginyeria i Arquitectura la Salle Slide 11 GRSI
  • 12. Outline 1. Algorithm Evaluation Methodology Definition 1 Al ih E l i M hdl D fi i i 2. 2 Methodology Implementation 3. Experiment Description 4. Results and Analysis 5. Conclusions and Further Work Enginyeria i Arquitectura la Salle Slide 12 GRSI
  • 13. 3 Sampling and Learning Iteration pg g { {Sampling Iteration} Problem {Training Iteration} pg } { g } Sampling Sampling Algorithm Data Set Method Size Parameters Accuracy DI Learning Genaration Algorithm 1.2 DS1kMulplx6m1 1 0.8 0.6 0.4 0.2 0 0 2000 4000 6000 8000 10000 Knowledge Comparison 1.2 DS1kMulplx6m1 1 Optimal 0.8 Population P l ti 0.6 0.4 0.2 0 0 2000 4000 6000 8000 10000 Enginyeria i Arquitectura la Salle Slide 13 GRSI
  • 14. 3 Output Results and Iteration Reduction p Output Results – 2 Plots to every Problem Sampling Method Sampling Size and Problem, Method, Algorithm Parameters. 1.2 DS1kMulplx6m1 • Optimal Population 1 • Accuracy 0.8 Iteration R d ti It ti Reduction 0.6 – SIS Pure sequential 0.4 • No Sampling Iteration Needed 0.2 – Problems without “don’t care” 0 • SRS=SIS and RRS=RIS 0 2000 4000 6000 8000 10000 Slide 14 GRSI
  • 15. 3 Experimental Parameters p Number of Problems = 6 Number f Sampling M th d = 4 N b of S li Methods Number of different Sampling Sizes = 4 Number of different Algorithms Parameters Sets = 2 Number f Sampling It ti N b of S li Iterations = 10 Number of Training Iterations = 10 Number of Data Sets Generated = 744 Number of Training Process = 14880 Slide 15 GRSI
  • 16. Outline 1. Algorithm Evaluation Methodology Definition 1 Al ih E l i M hdl D fi i i 2. 2 Methodology Implementation 3. Experiment Description 4. Results and Analysis 5. Conclusions and Further Work Enginyeria i Arquitectura la Salle Slide 16 GRSI
  • 17. Problem Dimension Sampling M. = RIS Sampling Size = 1000 Learning Alg. Param. = pDNC 0.2 M Alg Param 02 1.2 Mux6 1.1 DS1kMulplx6m4 DS1kParity5m4 Parity5 1 1 0.9 0.8 0.8 0.6 0.7 0.6 0.4 0.5 0.2 0.4 0 0.3 -0.2 0.2 0 2000 4000 6000 8000 10000 0 2000 4000 6000 8000 10000 1.05 DS1kMulplx6m4 1.05 DS1kParity5m4 1 1 0.95 0.9 0.95 0.85 0.9 0.8 0.75 0.85 0.7 0.65 0.8 0.6 0.75 0.55 0 2000 4000 6000 8000 10000 0 2000 4000 6000 8000 10000 Slide 17 GRSI
  • 18. Sampling Method Dimension pg Problem = Position5 Sampling Size = 1000 Learning Alg. Param. = pDNC 0.2 Alg Param 02 SRS Sequential Rule Selection RIS Random Instance Selection 1.2 0.9 DS1kPosition5m1 DS1kPosition5m4 0.8 1 0.7 0.6 0.8 0.5 0.6 0.4 0.3 0.4 0.2 0.1 0.2 0 0 -0.1 0 2000 4000 6000 8000 10000 0 2000 4000 6000 8000 10000 1.1 1.1 DS1kPosition5m1 DS1kPosition5m4 1 1 0.9 0.9 0.8 0.8 0.7 0.7 0.6 0.6 0.5 0.5 0.4 0.4 0.3 0.3 0 2000 4000 6000 8000 10000 0 2000 4000 6000 8000 10000 Slide 18 GRSI
  • 19. Sampling Size Dimension pg Problem = Parity5 Sampling M.= RIS Learning Alg. Param. = pDNC 0.2 M= Alg Param 02 1.1 1.1 DS100Parity5m4 100 DS10kParity5m4 10000 1 1 0.9 0.9 0.8 0.8 0.7 0.7 0.6 0.6 0.5 0.5 0.4 0.4 0.3 0.3 0.2 0.1 0.2 0 2000 4000 6000 8000 10000 0 2000 4000 6000 8000 10000 1.05 1.05 DS10kParity5m4 DS100Parity5m4 1 1 0.95 0.95 0.9 0.9 0.85 0.85 0.8 0.8 0.75 0.7 0.75 0.65 0.7 0.6 0.65 0.55 0 2000 4000 6000 8000 10000 0.6 0 2000 4000 6000 8000 10000 Slide 19 GRSI
  • 20. Parameter Algorithm Dimension g Problem = Mux6 Sampling M. = RIS Sampling Size = 1000 M 1 1.2 DS1kMulplx6m4 DS1kMulplx6m4 pDNC 0.8 0.9 pDNC 0.2 1 0.8 0.7 0.8 0.6 0.6 0.5 0.4 0.4 0.3 0.2 0.2 0.1 0 0 -0.1 -0.2 0 2000 4000 6000 8000 10000 0 2000 4000 6000 8000 10000 1.05 DS1kMulplx6m4 1.05 DS1kMulplx6m4 1 0.95 1 0.9 0.85 0.95 0.8 0.9 0.75 0.7 0.85 0.65 0.6 0.8 0.55 0.5 0.75 0 2000 4000 6000 8000 10000 0 2000 4000 6000 8000 10000 Slide 20 GRSI
  • 21. Outline 1. Algorithm Evaluation Methodology Definition 1 Al ih E l i M hdl D fi i i 2. 2 Methodology Implementation 3. Experiment Description 4. Results and Analysis 5. Conclusions and Further Work Enginyeria i Arquitectura la Salle Slide 21 GRSI
  • 22. Conclusions and Further Work Conclusions – Automatic Learning Algorithm Analyzer based on Artificial Data Sets – Four dimensions comparisons – Methodology Implementation, Experiment and Results Analysis Further Work – Non ORS Problems – R l Att ib t Real Attributes – Sampling Methods based on distance or transition matrix – Multi Step Problems p – Different Learning Algorithms – Different Knowledge representations – Knowledge Covering Metrics – Applying Data Set Complexity Metrics Suite Slide 22 GRSI
  • 23. GRSI Artificial Data Sets based on Knowledge Generators: Analysis of Learning Algorithms Efficiency y gg y Joaquin Rios Boutin, Albert Orriols-Puig, Josep-Maria Garrell-Guiu {j {jrios, aorriols, josepmg}@salle.url.edu j p g}@ GRSI (Grup de Recerca en Sistemes Intel·ligents) http://www.salle.url.edu/GRSI • http://www salle url edu/GRSI – Oriented to: • CBR (Computer Based Reasoning) Algorithms • Evolutive Computation Algorithms • Data Mining Technology Transfer Enginyeria i Arquitectura la Salle Slide 23 GRSI