SlideShare a Scribd company logo
1 of 2
In a big number of situations where a supervised classifier is employed, we have to face the problem of
labelling data at a huge cost. In such cases we can apply active learning techniques where the model
which is learning, is allowed to select the next instance or set of instances to be labelled. Hence, the
model will try to choose the best instance to be labelled in a way it can learn a good classifier using
fewer labels than would be required in the traditional paradigm. Diverse approaches have been
proposed for applying active learning. Most of them are thought of for binary classification problems.
One of these approaches is the one exposed by [Cohn 1] that consists of choosing instances within the
uncertainty region so the region gets smaller and smaller in every iteration. This approach could be too
expensive due to the necessity of recalculating explicitly the uncertainty region in every iteration. Other
approaches have been exposed where the uncertainty region does not need to be calculated in a
explicitly way. One of them is Uncertainty Sampling proposed by [Lewis] and consists of choosing the
instance which presents the biggest uncertainty which is measured using the current classifier model.
This is only applicable to some kinds of learning probabilistic models which return a membership
probability of an instance to every possible label. A general purpose approach is Query By Committee
which was proposed by [QBC]. The main idea of this method is to keep a committee of classifiers in
order to infer the label of every candidate instance. Finally, the instance whose disagreement in the
committee is biggest is chosen. There are other papers which apply active learning to other kinds of
problems. The next step is the multiclass problem. Some criterions have been proposed for this kind of
problem, such as the ones we can find in [multiclass] and [video multiclass]. Active learning has been
applied to regression problems with successful results. [Cohn 2] studied several ways for valuing the
information of instances in this kind of problem such as the variance or the KL-divergence. Active
learning has also been applied to other kinds of problems such as class based ranking ones. One
example of this is [bootstrap-LV], which uses a committee where the criterion for choosing a new
instance is based on the variance.



We face two different kinds of situations in our system:



• Learning the weights of the metaheuristics: we must train a model in order to infer what weight is
adequate for every instance and every metaheuristic. This value is in the range [0, 1] and the sum of the
weights of all metaheuristics for an instance is 1. We can not class this problem as a pure regression
one or a ranking problem but it keeps several properties of both of them.



• Learning the parameters for every metaheuristic: When an instance is given, we need to infer what
parameters are the best for every metaheuristic. Hence, it is a set of classification problems where the
labels are the possible values a parameter can take. It is important to consider that more than 2 labels
can exist to be chosen.
In the first situation we have chosen to make a committee strategy. In this way, in every iteration, the
variance of the distribution that results when an instance is inferred by all the members of the
committee is calculated.Therefore we get a variance vector of length n for every instance, being n the
number of metaheuristics considered. Finally we select the instance whose sum of variances is the
biggest, so that it is the way of measuring the disagreement between the members of the committee.



In the second situation we have chosen to make another committee for every parameter. However, in
this case, we have used another selection criterion. Now it is not needed to infer the weights but to
decide between a set of labels, so we are interested in choosing the instance whose uncertainty is the
biggest. [multiclass] proposes a method which has reported very good scores in multiclass problems. It
consists of choosing the instance which minimizes the difference between the committee’s output for
the most and the second most popular class label. That is the option we have chosen and we have got
very good results.



It is important to point out that in the first situation we can not use the last approach because the
problem is not a classification one. For example, if in the first situation we have an instance whose
margin between the most and the second most popular label is pretty low, it does not mean that this
class has a lot of uncertainty because it is possible that these estimated values are very close to the real
ones.

More Related Content

Similar to activeLearning

Assignment oprations research luv
Assignment oprations research luvAssignment oprations research luv
Assignment oprations research luvAshok Sharma
 
Introduction to decision making methods
Introduction to decision making methodsIntroduction to decision making methods
Introduction to decision making methodsDr Ghaiath Hussein
 
1645 track2 brandenburger_lempola
1645 track2 brandenburger_lempola1645 track2 brandenburger_lempola
1645 track2 brandenburger_lempolaRising Media, Inc.
 
Presentation on supervised learning
Presentation on supervised learningPresentation on supervised learning
Presentation on supervised learningTonmoy Bhagawati
 
Research trends in data warehousing and data mining
Research trends in data warehousing and data miningResearch trends in data warehousing and data mining
Research trends in data warehousing and data miningEr. Nawaraj Bhandari
 
Binary Class and Multi Class Strategies for Machine Learning
Binary Class and Multi Class Strategies for Machine LearningBinary Class and Multi Class Strategies for Machine Learning
Binary Class and Multi Class Strategies for Machine LearningPaxcel Technologies
 
Heuristics for the Maximal Diversity Selection Problem
Heuristics for the Maximal Diversity Selection ProblemHeuristics for the Maximal Diversity Selection Problem
Heuristics for the Maximal Diversity Selection ProblemIJMER
 
ABOU-NAOUM_AMANE_ROUGUI_Article
ABOU-NAOUM_AMANE_ROUGUI_ArticleABOU-NAOUM_AMANE_ROUGUI_Article
ABOU-NAOUM_AMANE_ROUGUI_ArticleAnthony Abou Naoum
 
Multi label text classification
Multi label text classificationMulti label text classification
Multi label text classificationraghavr186
 
Multi-Cluster Based Approach for skewed Data in Data Mining
Multi-Cluster Based Approach for skewed Data in Data MiningMulti-Cluster Based Approach for skewed Data in Data Mining
Multi-Cluster Based Approach for skewed Data in Data MiningIOSR Journals
 
Figure 1
Figure 1Figure 1
Figure 1butest
 
Adapted Branch-and-Bound Algorithm Using SVM With Model Selection
Adapted Branch-and-Bound Algorithm Using SVM With Model SelectionAdapted Branch-and-Bound Algorithm Using SVM With Model Selection
Adapted Branch-and-Bound Algorithm Using SVM With Model SelectionIJECEIAES
 
International Journal of Mathematics and Statistics Invention (IJMSI)
International Journal of Mathematics and Statistics Invention (IJMSI)International Journal of Mathematics and Statistics Invention (IJMSI)
International Journal of Mathematics and Statistics Invention (IJMSI)inventionjournals
 
simple discriminant
simple discriminantsimple discriminant
simple discriminantneha singh
 
DIY market segmentation 20170125
DIY market segmentation 20170125DIY market segmentation 20170125
DIY market segmentation 20170125Displayr
 

Similar to activeLearning (20)

Assignment oprations research luv
Assignment oprations research luvAssignment oprations research luv
Assignment oprations research luv
 
Introduction to decision making methods
Introduction to decision making methodsIntroduction to decision making methods
Introduction to decision making methods
 
AJSR_23_01
AJSR_23_01AJSR_23_01
AJSR_23_01
 
U0 vqmtq2otq=
U0 vqmtq2otq=U0 vqmtq2otq=
U0 vqmtq2otq=
 
Angelie
AngelieAngelie
Angelie
 
1645 track2 brandenburger_lempola
1645 track2 brandenburger_lempola1645 track2 brandenburger_lempola
1645 track2 brandenburger_lempola
 
Presentation on supervised learning
Presentation on supervised learningPresentation on supervised learning
Presentation on supervised learning
 
Research trends in data warehousing and data mining
Research trends in data warehousing and data miningResearch trends in data warehousing and data mining
Research trends in data warehousing and data mining
 
Binary Class and Multi Class Strategies for Machine Learning
Binary Class and Multi Class Strategies for Machine LearningBinary Class and Multi Class Strategies for Machine Learning
Binary Class and Multi Class Strategies for Machine Learning
 
Heuristics for the Maximal Diversity Selection Problem
Heuristics for the Maximal Diversity Selection ProblemHeuristics for the Maximal Diversity Selection Problem
Heuristics for the Maximal Diversity Selection Problem
 
ABOU-NAOUM_AMANE_ROUGUI_Article
ABOU-NAOUM_AMANE_ROUGUI_ArticleABOU-NAOUM_AMANE_ROUGUI_Article
ABOU-NAOUM_AMANE_ROUGUI_Article
 
Ar03402580261
Ar03402580261Ar03402580261
Ar03402580261
 
Multi label text classification
Multi label text classificationMulti label text classification
Multi label text classification
 
Multi-Cluster Based Approach for skewed Data in Data Mining
Multi-Cluster Based Approach for skewed Data in Data MiningMulti-Cluster Based Approach for skewed Data in Data Mining
Multi-Cluster Based Approach for skewed Data in Data Mining
 
Figure 1
Figure 1Figure 1
Figure 1
 
Adapted Branch-and-Bound Algorithm Using SVM With Model Selection
Adapted Branch-and-Bound Algorithm Using SVM With Model SelectionAdapted Branch-and-Bound Algorithm Using SVM With Model Selection
Adapted Branch-and-Bound Algorithm Using SVM With Model Selection
 
International Journal of Mathematics and Statistics Invention (IJMSI)
International Journal of Mathematics and Statistics Invention (IJMSI)International Journal of Mathematics and Statistics Invention (IJMSI)
International Journal of Mathematics and Statistics Invention (IJMSI)
 
simple discriminant
simple discriminantsimple discriminant
simple discriminant
 
DIY market segmentation 20170125
DIY market segmentation 20170125DIY market segmentation 20170125
DIY market segmentation 20170125
 
Attitude scales
Attitude scalesAttitude scales
Attitude scales
 

More from ingles garantizado murcia

2000 frases murcia examen ingles first certificate cambridge .soy juez del ex...
2000 frases murcia examen ingles first certificate cambridge .soy juez del ex...2000 frases murcia examen ingles first certificate cambridge .soy juez del ex...
2000 frases murcia examen ingles first certificate cambridge .soy juez del ex...ingles garantizado murcia
 
11th_Diagnostic_Data_Sufficiency_Explanations
11th_Diagnostic_Data_Sufficiency_Explanations11th_Diagnostic_Data_Sufficiency_Explanations
11th_Diagnostic_Data_Sufficiency_Explanationsingles garantizado murcia
 
United States Citizen and Immigration Services
United States Citizen and Immigration ServicesUnited States Citizen and Immigration Services
United States Citizen and Immigration Servicesingles garantizado murcia
 

More from ingles garantizado murcia (20)

2000 frases murcia examen ingles first certificate cambridge .soy juez del ex...
2000 frases murcia examen ingles first certificate cambridge .soy juez del ex...2000 frases murcia examen ingles first certificate cambridge .soy juez del ex...
2000 frases murcia examen ingles first certificate cambridge .soy juez del ex...
 
academia ingles murcia
academia ingles murciaacademia ingles murcia
academia ingles murcia
 
toefl6
toefl6toefl6
toefl6
 
el libro de Seth Green EdU.
el libro de Seth Green EdU.el libro de Seth Green EdU.
el libro de Seth Green EdU.
 
esl position murcia teach english
esl position murcia teach english esl position murcia teach english
esl position murcia teach english
 
Flash idiomas Murcia
Flash idiomas MurciaFlash idiomas Murcia
Flash idiomas Murcia
 
toefl3
toefl3toefl3
toefl3
 
teacher cv
teacher cvteacher cv
teacher cv
 
TOPIC ONE
TOPIC ONETOPIC ONE
TOPIC ONE
 
11th_Diagnostic_Data_Sufficiency_Explanations
11th_Diagnostic_Data_Sufficiency_Explanations11th_Diagnostic_Data_Sufficiency_Explanations
11th_Diagnostic_Data_Sufficiency_Explanations
 
ingles garantizado murcia
ingles garantizado murciaingles garantizado murcia
ingles garantizado murcia
 
Copy of Untitled
Copy of UntitledCopy of Untitled
Copy of Untitled
 
www.inglesgarantizado.com
www.inglesgarantizado.comwww.inglesgarantizado.com
www.inglesgarantizado.com
 
Charles O'Hear CV
Charles O'Hear CVCharles O'Hear CV
Charles O'Hear CV
 
ingles clases murcia
ingles clases murciaingles clases murcia
ingles clases murcia
 
ingles garantizado murcia
ingles garantizado murciaingles garantizado murcia
ingles garantizado murcia
 
Test 7 Writing
Test 7 WritingTest 7 Writing
Test 7 Writing
 
United States Citizen and Immigration Services
United States Citizen and Immigration ServicesUnited States Citizen and Immigration Services
United States Citizen and Immigration Services
 
Doble nacionalidad
Doble nacionalidadDoble nacionalidad
Doble nacionalidad
 
ingles murcia application
ingles murcia applicationingles murcia application
ingles murcia application
 

Recently uploaded

Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...jaredbarbolino94
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,Virag Sontakke
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
Capitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitolTechU
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
CELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxCELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxJiesonDelaCerna
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfadityarao40181
 
MARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized GroupMARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized GroupJonathanParaisoCruz
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentInMediaRes1
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceSamikshaHamane
 

Recently uploaded (20)

Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginners
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
Capitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptx
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
CELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxCELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptx
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdf
 
MARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized GroupMARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized Group
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media Component
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in Pharmacovigilance
 

activeLearning

  • 1. In a big number of situations where a supervised classifier is employed, we have to face the problem of labelling data at a huge cost. In such cases we can apply active learning techniques where the model which is learning, is allowed to select the next instance or set of instances to be labelled. Hence, the model will try to choose the best instance to be labelled in a way it can learn a good classifier using fewer labels than would be required in the traditional paradigm. Diverse approaches have been proposed for applying active learning. Most of them are thought of for binary classification problems. One of these approaches is the one exposed by [Cohn 1] that consists of choosing instances within the uncertainty region so the region gets smaller and smaller in every iteration. This approach could be too expensive due to the necessity of recalculating explicitly the uncertainty region in every iteration. Other approaches have been exposed where the uncertainty region does not need to be calculated in a explicitly way. One of them is Uncertainty Sampling proposed by [Lewis] and consists of choosing the instance which presents the biggest uncertainty which is measured using the current classifier model. This is only applicable to some kinds of learning probabilistic models which return a membership probability of an instance to every possible label. A general purpose approach is Query By Committee which was proposed by [QBC]. The main idea of this method is to keep a committee of classifiers in order to infer the label of every candidate instance. Finally, the instance whose disagreement in the committee is biggest is chosen. There are other papers which apply active learning to other kinds of problems. The next step is the multiclass problem. Some criterions have been proposed for this kind of problem, such as the ones we can find in [multiclass] and [video multiclass]. Active learning has been applied to regression problems with successful results. [Cohn 2] studied several ways for valuing the information of instances in this kind of problem such as the variance or the KL-divergence. Active learning has also been applied to other kinds of problems such as class based ranking ones. One example of this is [bootstrap-LV], which uses a committee where the criterion for choosing a new instance is based on the variance. We face two different kinds of situations in our system: • Learning the weights of the metaheuristics: we must train a model in order to infer what weight is adequate for every instance and every metaheuristic. This value is in the range [0, 1] and the sum of the weights of all metaheuristics for an instance is 1. We can not class this problem as a pure regression one or a ranking problem but it keeps several properties of both of them. • Learning the parameters for every metaheuristic: When an instance is given, we need to infer what parameters are the best for every metaheuristic. Hence, it is a set of classification problems where the labels are the possible values a parameter can take. It is important to consider that more than 2 labels can exist to be chosen.
  • 2. In the first situation we have chosen to make a committee strategy. In this way, in every iteration, the variance of the distribution that results when an instance is inferred by all the members of the committee is calculated.Therefore we get a variance vector of length n for every instance, being n the number of metaheuristics considered. Finally we select the instance whose sum of variances is the biggest, so that it is the way of measuring the disagreement between the members of the committee. In the second situation we have chosen to make another committee for every parameter. However, in this case, we have used another selection criterion. Now it is not needed to infer the weights but to decide between a set of labels, so we are interested in choosing the instance whose uncertainty is the biggest. [multiclass] proposes a method which has reported very good scores in multiclass problems. It consists of choosing the instance which minimizes the difference between the committee’s output for the most and the second most popular class label. That is the option we have chosen and we have got very good results. It is important to point out that in the first situation we can not use the last approach because the problem is not a classification one. For example, if in the first situation we have an instance whose margin between the most and the second most popular label is pretty low, it does not mean that this class has a lot of uncertainty because it is possible that these estimated values are very close to the real ones.