SlideShare a Scribd company logo
Concept Learning and
the General-to-Specific Ordering
이 종우
자연언어처리연구실
Concept Learning
• Concepts or Categories
– “birds”
– “car”
– “situations in which I should study more in
order to pass the exam”
– Concept
• some subset of objects or events defined over a
larger set, or a boolean valued function defined over
this larger set.
– Learning
• inducing general functions from specific training
examples
– Concept Learning
• acquiring the definition of a general category given
a sample of positive and negative training examples
of the category
A Concept Learning Task
• Target Concept
– “days on which Aldo enjoys water sport”
• Hypothesis
– vector of 6 constraints (Sky, AirTemp, Humidity,
Wind, Water, Forecast, EnjoySport )
– Each attribute (“?”, single value or “0”)
– e.g. <?, Cold, High, ?, ?, ?>
Instance Sky AirTemp Humidity Wind Water Forecast EnjoySport
A Sunny Warm Normal Strong Warm Same No
B Sumny Warm High Strong Warm Same Yes
C Rainy Cold High Strong Warm Change No
D Sunny Warm High Strong Cool Change Yes
Training examples for the target concept EnjoySport
• Given :
– instances (X): set of iterms over which the concept
is defined.
– target concept (c) : c : X → {0, 1}
– training examples (positive/negative) : <x,c(x)>
– training set D: available training examples
– set of all possible hypotheses: H
• Determine :
– to find h(x) = c(x) (for all x in X)
Inductive Learning Hypothesis
• Inductive Learning Hypothesis
– Any good hypothesis over a sufficiently large
set of training examples will also approximate
the target function. well over unseen examples.
Concept Learning as Search
• Issue of Search
– to find training examples hypothesis that best fits
training examples
• Kinds of Space in EnjoySport
– 3*2*2*2*2 = 96: instant space
– 5*4*4*4*4 = 5120: syntactically distinct
hypotheses within H
– 1+4*3*3*3*3 = 973: semantically distinct
hypotheses
• Search Problem
– efficient search in hypothesis
space(finite/infinite)
General-to-Specific Ordering of
Hypotheses
• Hypotheses의 General-to-Specific Ordering
– x satisfies h ⇔ h(x)=1
– more_general_than_or_equal_to relations
• <Sunny,?,?,Strong,?,?> ≦ g <Sunny,?,?,?,?,?>
– more_general_than_or_equal_to relations
)]
1
)
(
(
)
1
)[(
( 





 x
h
h
X
x
h
h j
k
k
g
j
)
(
)
( j
g
k
k
g
j
k
g
j h
h
h
h
h
h 





– partial order (reflexive,antisymmetric,transitive)
Concept Learning as Search
Find-S: Finding a Maximally
Specific Hypothesis
• algorithm
• 1. Initialize h to the most specific hypothesis in H
• 2. For each positive training example x
• For each attribute constraint ai in h
– If the constraint ai is satisfied by x
– then do nothing
– else replace ai in h by the next more general constraint that is
satisfied by x
• 3. Output hypothesis h
• Property
• guaranteed to output the most specific hypothesis
• no way to determine unique hypothesis
• not cope with inconsistent errors or noises
Find-S:Finding a Maximally
Specific Hypothesis(2)
Version Spaces and the Candidate-
Elimination Algorithm
– output all hypotheses consistent with the training
examples.
– perform poorly with noisy training data.
• Representation
– Consistent(h,D) ⇔(∀<x,c(x)>  D) h(x) = c(x)
– VSH,D ⇔ {h  H | Consistent(h,D)}
• List-Then-Eliminate Algorithm
– lists all hypotheses -> remove inconsistent ones.
– Appliable to finite H
Version Spaces and the Candidate-
Elimination Algorithm(2)
• More Compact Representation for Version Spaces
– general boundary G
– specific boundary S
– Version Space redefined with S and G
)]}
,
'
(
)
'
)[
'
(
)
,
(
|
{ D
s
Consistent
g
g
H
g
D
s
Consistent
H
g
G g 






)]}
,
'
(
)
'
)[(
'
(
)
,
(
|
{ D
s
Consistent
s
s
H
s
D
s
Consistent
H
s
S g 






)}
)(
)(
(
|
{
, S
h
g
G
g
S
s
H
h
VS g
g
D
H 







Version Spaces and the Candidate-
Elimination Algorithm(3)
Version Spaces and the Candidate-
Elimination Algorithm(4)
• Condidate-Elimination Learning Algorithm
• Initialize G to the set of maximally general hypotheses in H
• Initialize S to the set of maximally specific hypotheses in H
• For each training example d, do
• If d is a positive example
• Remove from G any hypothesis inconsistent with d
• For each hypothesis s in S that is not consistent with d
• Remove s from S
• Add to S all minimal generalizations h of s such that
• h is consistent with d, and some member of G is more general
• than h
• Remove from S any hypothesis that is more general than another
• hypothesis in S
Version Spaces and the Candidate-
Elimination Algorithm(5)
• If d is a negative example
• Remove from S any hypothesis inconsistent with d
• For each hypothesis g in G that is not consistent with d
• Remove g from G
• Add to G all minimal specializations h of g such that
• h is consistent with d, and some member of S is more specific
than h
• Remove from G any hypothesis that is less general than another
hypothesis in G
Version Spaces and the Candidate-
Elimination Algorithm(6)
• Illustrative Example
Version Spaces and the Candidate-
Elimination Algorithm(7)
Version Spaces and the Candidate-
Elimination Algorithm(8)
Version Spaces and the Candidate-
Elimination Algorithm(9)
Remarks on Version Spaces and
Candidate-Elimination
• Will the Candidate-Elimination Algorithm
Converge to the Correct Hypothesis?
– Prerequisite
– 1. No error in training examples
– 2. Hypothesis exists which correctly describes c(x).
– S and G boundary sets converge to an empty set =>
no hypothesis in H consistent with observed examples.
• What Training Example Should the Learner
Request Next?
– Negative one specifies G , positive one generalizes S.
– optimal query satisfy half the hypotheses.
Remarks on Version Spaces and
Candidate-Elimination(2)
• How Can Partially Learned Concepts Be Used?
Instance Sky AirTemp Humidity Wind Water Forecast EnjoySport
A Sunny Warm Normal Strong Cool Change ?
B Rainy Cold Normal Light Warm Same ?
C Sunny Warm Normal Light Warm Same ?
D Sunny Cold Normal Strong Warm Same ?
A : classified to positive
B : classified to negative
C : 3 positive , 3 negative
D : 2 positive, 4 negative
Inductive Bias
• A Biased Hypothesis Space
Example Sky AirTemp Humidity Wind Water Forecast EnjoySport
1 Sunny Warm Normal Strong Cool Change Yes
2 Cloudy Warm Normal Strong Cool Change Yes
3 Rainy Warm Normal Strong Cool Change No
- zero hypothesis in the version space
- caused by only conjunctive hypothesis
Inductive Bias(2)
• An Unbiased Learner
– Power set of X : set of all subsets of a set X
• number of size of power set : 2|X|
– e.g. <Sunny,?,?,?,?,?> ∨ <Cloudy,?,?,?,?,?>
– new problem : unable to generalize beyond the
observed examples.
• Observed examples are only unambiguously classified.
• Voting results in no majority or minority.
Inductive Bias(3)
• The Futility of Bias-Free Learning
– no inductive bias => cannot classify unseen data
reasonably
– inductive bias of L : any minimal set of assertions B
such that
– inductive bias of Candidate-Elimination algorithm
• c ∈ H
– advantage of introducing inductive bias
• generalizing beyond the observed data
• allows comparison of different learners
)]
,
(
)
)[(
( c
i
i
c
i D
x
L
x
D
B
X
x 




Inductive Bias(4)
• e.g
– Rote-learner : no inductive bias
– Candidate-Elimination algo : c ∈ H => more strong
– Find-S : c ∈ H and that all are negative unless not
proved positive
Inductive Bias(5)
Summary
• Concept learning can be cast as a problem of searching
through a large predefined space of potential hypotheses.
• General-to-specific partial ordering of hypotheses provides
a useful structure for search.
• Find-S algorithm performs specific-to-general search to
find the most specific hypothesis.
• Candidate-Elimination algorithm computes version space
by incrementally computing the sets of maximally specific
(S) and maximally general (G) hypotheses.
• S and G delimit the entire set of hypotheses consistent with
the data.
• Version spaces and Candidate-Elimination algorithm
provide a useful conceptual framework for studying
concept learning.
• Candidate-Elimination algorithm not robust to noisy data
or to situations where the unknown target concept is not
expressible in the provided hypothesis space.
• Inductive bias in Candidate-Elimination algorithm is that
target concept exists in H
• If hypothesis space be enriched so that there is a every
possible hypothesis, that would remove the ability to
classify any instance beyond the observed examples.

More Related Content

Similar to ML02.ppt

M18 learning
M18 learningM18 learning
M18 learning
rajshreeshrivastva
 
Inductive bias
Inductive biasInductive bias
Inductive bias
swapnac12
 
ML-Inductive Bias.pdf
ML-Inductive Bias.pdfML-Inductive Bias.pdf
ML-Inductive Bias.pdf
ShivaVemula2
 
Lecture 7
Lecture 7Lecture 7
Lecture 7
butest
 
Lecture 7
Lecture 7Lecture 7
Lecture 7
butest
 
Poggi analytics - concepts - 1a
Poggi   analytics - concepts - 1aPoggi   analytics - concepts - 1a
Poggi analytics - concepts - 1a
Gaston Liberman
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learning
butest
 
AI Lesson 34
AI Lesson 34AI Lesson 34
AI Lesson 34
Assistant Professor
 
Computational Learning Theory
Computational Learning TheoryComputational Learning Theory
Computational Learning Theory
butest
 
Machine learning Lecture 2
Machine learning Lecture 2Machine learning Lecture 2
Machine learning Lecture 2
Srinivasan R
 
Lec 17
Lec  17Lec  17
Lec 17
Nilt1234
 
L1 intro2 supervised_learning
L1 intro2 supervised_learningL1 intro2 supervised_learning
L1 intro2 supervised_learning
Yogendra Singh
 
Domain Adaptation
Domain AdaptationDomain Adaptation
Domain Adaptation
Mark Chang
 
lecture_mooney.ppt
lecture_mooney.pptlecture_mooney.ppt
lecture_mooney.ppt
butest
 
Generalization abstraction
Generalization abstractionGeneralization abstraction
Generalization abstraction
Edward Blurock
 
"Let us talk about output features! by Florence d’Alché-Buc, LTCI & Full Prof...
"Let us talk about output features! by Florence d’Alché-Buc, LTCI & Full Prof..."Let us talk about output features! by Florence d’Alché-Buc, LTCI & Full Prof...
"Let us talk about output features! by Florence d’Alché-Buc, LTCI & Full Prof...
Paris Women in Machine Learning and Data Science
 
Model Selection and Validation
Model Selection and ValidationModel Selection and Validation
Model Selection and Validation
gmorishita
 
-BayesianLearning in machine Learning 12
-BayesianLearning in machine Learning 12-BayesianLearning in machine Learning 12
-BayesianLearning in machine Learning 12
Kumari Naveen
 
Bayesian Learning- part of machine learning
Bayesian Learning-  part of machine learningBayesian Learning-  part of machine learning
Bayesian Learning- part of machine learning
kensaleste
 
Machine learning
Machine learningMachine learning
Machine learning
Sukhwinder Singh
 

Similar to ML02.ppt (20)

M18 learning
M18 learningM18 learning
M18 learning
 
Inductive bias
Inductive biasInductive bias
Inductive bias
 
ML-Inductive Bias.pdf
ML-Inductive Bias.pdfML-Inductive Bias.pdf
ML-Inductive Bias.pdf
 
Lecture 7
Lecture 7Lecture 7
Lecture 7
 
Lecture 7
Lecture 7Lecture 7
Lecture 7
 
Poggi analytics - concepts - 1a
Poggi   analytics - concepts - 1aPoggi   analytics - concepts - 1a
Poggi analytics - concepts - 1a
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learning
 
AI Lesson 34
AI Lesson 34AI Lesson 34
AI Lesson 34
 
Computational Learning Theory
Computational Learning TheoryComputational Learning Theory
Computational Learning Theory
 
Machine learning Lecture 2
Machine learning Lecture 2Machine learning Lecture 2
Machine learning Lecture 2
 
Lec 17
Lec  17Lec  17
Lec 17
 
L1 intro2 supervised_learning
L1 intro2 supervised_learningL1 intro2 supervised_learning
L1 intro2 supervised_learning
 
Domain Adaptation
Domain AdaptationDomain Adaptation
Domain Adaptation
 
lecture_mooney.ppt
lecture_mooney.pptlecture_mooney.ppt
lecture_mooney.ppt
 
Generalization abstraction
Generalization abstractionGeneralization abstraction
Generalization abstraction
 
"Let us talk about output features! by Florence d’Alché-Buc, LTCI & Full Prof...
"Let us talk about output features! by Florence d’Alché-Buc, LTCI & Full Prof..."Let us talk about output features! by Florence d’Alché-Buc, LTCI & Full Prof...
"Let us talk about output features! by Florence d’Alché-Buc, LTCI & Full Prof...
 
Model Selection and Validation
Model Selection and ValidationModel Selection and Validation
Model Selection and Validation
 
-BayesianLearning in machine Learning 12
-BayesianLearning in machine Learning 12-BayesianLearning in machine Learning 12
-BayesianLearning in machine Learning 12
 
Bayesian Learning- part of machine learning
Bayesian Learning-  part of machine learningBayesian Learning-  part of machine learning
Bayesian Learning- part of machine learning
 
Machine learning
Machine learningMachine learning
Machine learning
 

More from ssuserec53e73

iotshoe for elderly people monitoring..
iotshoe  for elderly people monitoring..iotshoe  for elderly people monitoring..
iotshoe for elderly people monitoring..
ssuserec53e73
 
Threats in network that can be noted in security
Threats in network that can be noted in securityThreats in network that can be noted in security
Threats in network that can be noted in security
ssuserec53e73
 
Lsn21_NumPy in data science using python
Lsn21_NumPy in data science using pythonLsn21_NumPy in data science using python
Lsn21_NumPy in data science using python
ssuserec53e73
 
OpenSecure socket layerin cyber security
OpenSecure socket layerin cyber securityOpenSecure socket layerin cyber security
OpenSecure socket layerin cyber security
ssuserec53e73
 
Hash functions, digital signatures and hmac
Hash functions, digital signatures and hmacHash functions, digital signatures and hmac
Hash functions, digital signatures and hmac
ssuserec53e73
 
Asian Elephant Adaptations - Chelsea P..pptx
Asian Elephant Adaptations - Chelsea P..pptxAsian Elephant Adaptations - Chelsea P..pptx
Asian Elephant Adaptations - Chelsea P..pptx
ssuserec53e73
 
Module 10-Introduction to OOP.pptx
Module 10-Introduction to OOP.pptxModule 10-Introduction to OOP.pptx
Module 10-Introduction to OOP.pptx
ssuserec53e73
 
unit-1-l3.ppt
unit-1-l3.pptunit-1-l3.ppt
unit-1-l3.ppt
ssuserec53e73
 
AI.ppt
AI.pptAI.ppt
50134147-Knowledge-Representation-Using-Rules.ppt
50134147-Knowledge-Representation-Using-Rules.ppt50134147-Knowledge-Representation-Using-Rules.ppt
50134147-Knowledge-Representation-Using-Rules.ppt
ssuserec53e73
 
Dr Jose Reena K.pdf
Dr Jose Reena K.pdfDr Jose Reena K.pdf
Dr Jose Reena K.pdf
ssuserec53e73
 
Enumeration.pptx
Enumeration.pptxEnumeration.pptx
Enumeration.pptx
ssuserec53e73
 
footscan.PPT
footscan.PPTfootscan.PPT
footscan.PPT
ssuserec53e73
 
UNIT II.pptx
UNIT II.pptxUNIT II.pptx
UNIT II.pptx
ssuserec53e73
 
Unit 1 iot.pptx
Unit 1 iot.pptxUnit 1 iot.pptx
Unit 1 iot.pptx
ssuserec53e73
 
IoT Reference Architecture.pptx
IoT Reference Architecture.pptxIoT Reference Architecture.pptx
IoT Reference Architecture.pptx
ssuserec53e73
 
patent ppt.pptx
patent ppt.pptxpatent ppt.pptx
patent ppt.pptx
ssuserec53e73
 
Introduction to measurement.pptx
Introduction to measurement.pptxIntroduction to measurement.pptx
Introduction to measurement.pptx
ssuserec53e73
 
ML-DecisionTrees.ppt
ML-DecisionTrees.pptML-DecisionTrees.ppt
ML-DecisionTrees.ppt
ssuserec53e73
 
ML_Lecture_7.ppt
ML_Lecture_7.pptML_Lecture_7.ppt
ML_Lecture_7.ppt
ssuserec53e73
 

More from ssuserec53e73 (20)

iotshoe for elderly people monitoring..
iotshoe  for elderly people monitoring..iotshoe  for elderly people monitoring..
iotshoe for elderly people monitoring..
 
Threats in network that can be noted in security
Threats in network that can be noted in securityThreats in network that can be noted in security
Threats in network that can be noted in security
 
Lsn21_NumPy in data science using python
Lsn21_NumPy in data science using pythonLsn21_NumPy in data science using python
Lsn21_NumPy in data science using python
 
OpenSecure socket layerin cyber security
OpenSecure socket layerin cyber securityOpenSecure socket layerin cyber security
OpenSecure socket layerin cyber security
 
Hash functions, digital signatures and hmac
Hash functions, digital signatures and hmacHash functions, digital signatures and hmac
Hash functions, digital signatures and hmac
 
Asian Elephant Adaptations - Chelsea P..pptx
Asian Elephant Adaptations - Chelsea P..pptxAsian Elephant Adaptations - Chelsea P..pptx
Asian Elephant Adaptations - Chelsea P..pptx
 
Module 10-Introduction to OOP.pptx
Module 10-Introduction to OOP.pptxModule 10-Introduction to OOP.pptx
Module 10-Introduction to OOP.pptx
 
unit-1-l3.ppt
unit-1-l3.pptunit-1-l3.ppt
unit-1-l3.ppt
 
AI.ppt
AI.pptAI.ppt
AI.ppt
 
50134147-Knowledge-Representation-Using-Rules.ppt
50134147-Knowledge-Representation-Using-Rules.ppt50134147-Knowledge-Representation-Using-Rules.ppt
50134147-Knowledge-Representation-Using-Rules.ppt
 
Dr Jose Reena K.pdf
Dr Jose Reena K.pdfDr Jose Reena K.pdf
Dr Jose Reena K.pdf
 
Enumeration.pptx
Enumeration.pptxEnumeration.pptx
Enumeration.pptx
 
footscan.PPT
footscan.PPTfootscan.PPT
footscan.PPT
 
UNIT II.pptx
UNIT II.pptxUNIT II.pptx
UNIT II.pptx
 
Unit 1 iot.pptx
Unit 1 iot.pptxUnit 1 iot.pptx
Unit 1 iot.pptx
 
IoT Reference Architecture.pptx
IoT Reference Architecture.pptxIoT Reference Architecture.pptx
IoT Reference Architecture.pptx
 
patent ppt.pptx
patent ppt.pptxpatent ppt.pptx
patent ppt.pptx
 
Introduction to measurement.pptx
Introduction to measurement.pptxIntroduction to measurement.pptx
Introduction to measurement.pptx
 
ML-DecisionTrees.ppt
ML-DecisionTrees.pptML-DecisionTrees.ppt
ML-DecisionTrees.ppt
 
ML_Lecture_7.ppt
ML_Lecture_7.pptML_Lecture_7.ppt
ML_Lecture_7.ppt
 

Recently uploaded

A Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptxA Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptx
thanhdowork
 
Pride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School DistrictPride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School District
David Douglas School District
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
tarandeep35
 
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
IreneSebastianRueco1
 
Top five deadliest dog breeds in America
Top five deadliest dog breeds in AmericaTop five deadliest dog breeds in America
Top five deadliest dog breeds in America
Bisnar Chase Personal Injury Attorneys
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
TechSoup
 
MARY JANE WILSON, A “BOA MÃE” .
MARY JANE WILSON, A “BOA MÃE”           .MARY JANE WILSON, A “BOA MÃE”           .
MARY JANE WILSON, A “BOA MÃE” .
Colégio Santa Teresinha
 
DRUGS AND ITS classification slide share
DRUGS AND ITS classification slide shareDRUGS AND ITS classification slide share
DRUGS AND ITS classification slide share
taiba qazi
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
Scholarhat
 
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
National Information Standards Organization (NISO)
 
How to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP ModuleHow to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP Module
Celine George
 
World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024
ak6969907
 
How to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRMHow to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRM
Celine George
 
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdfবাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
eBook.com.bd (প্রয়োজনীয় বাংলা বই)
 
Advanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docxAdvanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docx
adhitya5119
 
The basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptxThe basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptx
heathfieldcps1
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
Peter Windle
 
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
Levi Shapiro
 
Aficamten in HCM (SEQUOIA HCM TRIAL 2024)
Aficamten in HCM (SEQUOIA HCM TRIAL 2024)Aficamten in HCM (SEQUOIA HCM TRIAL 2024)
Aficamten in HCM (SEQUOIA HCM TRIAL 2024)
Ashish Kohli
 
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective UpskillingYour Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Excellence Foundation for South Sudan
 

Recently uploaded (20)

A Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptxA Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptx
 
Pride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School DistrictPride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School District
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
 
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
 
Top five deadliest dog breeds in America
Top five deadliest dog breeds in AmericaTop five deadliest dog breeds in America
Top five deadliest dog breeds in America
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
 
MARY JANE WILSON, A “BOA MÃE” .
MARY JANE WILSON, A “BOA MÃE”           .MARY JANE WILSON, A “BOA MÃE”           .
MARY JANE WILSON, A “BOA MÃE” .
 
DRUGS AND ITS classification slide share
DRUGS AND ITS classification slide shareDRUGS AND ITS classification slide share
DRUGS AND ITS classification slide share
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
 
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
 
How to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP ModuleHow to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP Module
 
World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024
 
How to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRMHow to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRM
 
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdfবাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
 
Advanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docxAdvanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docx
 
The basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptxThe basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptx
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
 
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
 
Aficamten in HCM (SEQUOIA HCM TRIAL 2024)
Aficamten in HCM (SEQUOIA HCM TRIAL 2024)Aficamten in HCM (SEQUOIA HCM TRIAL 2024)
Aficamten in HCM (SEQUOIA HCM TRIAL 2024)
 
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective UpskillingYour Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective Upskilling
 

ML02.ppt

  • 1. Concept Learning and the General-to-Specific Ordering 이 종우 자연언어처리연구실
  • 2. Concept Learning • Concepts or Categories – “birds” – “car” – “situations in which I should study more in order to pass the exam” – Concept • some subset of objects or events defined over a larger set, or a boolean valued function defined over this larger set.
  • 3. – Learning • inducing general functions from specific training examples – Concept Learning • acquiring the definition of a general category given a sample of positive and negative training examples of the category
  • 4. A Concept Learning Task • Target Concept – “days on which Aldo enjoys water sport” • Hypothesis – vector of 6 constraints (Sky, AirTemp, Humidity, Wind, Water, Forecast, EnjoySport ) – Each attribute (“?”, single value or “0”) – e.g. <?, Cold, High, ?, ?, ?>
  • 5. Instance Sky AirTemp Humidity Wind Water Forecast EnjoySport A Sunny Warm Normal Strong Warm Same No B Sumny Warm High Strong Warm Same Yes C Rainy Cold High Strong Warm Change No D Sunny Warm High Strong Cool Change Yes Training examples for the target concept EnjoySport
  • 6. • Given : – instances (X): set of iterms over which the concept is defined. – target concept (c) : c : X → {0, 1} – training examples (positive/negative) : <x,c(x)> – training set D: available training examples – set of all possible hypotheses: H • Determine : – to find h(x) = c(x) (for all x in X)
  • 7. Inductive Learning Hypothesis • Inductive Learning Hypothesis – Any good hypothesis over a sufficiently large set of training examples will also approximate the target function. well over unseen examples.
  • 8. Concept Learning as Search • Issue of Search – to find training examples hypothesis that best fits training examples • Kinds of Space in EnjoySport – 3*2*2*2*2 = 96: instant space – 5*4*4*4*4 = 5120: syntactically distinct hypotheses within H – 1+4*3*3*3*3 = 973: semantically distinct hypotheses
  • 9. • Search Problem – efficient search in hypothesis space(finite/infinite)
  • 10. General-to-Specific Ordering of Hypotheses • Hypotheses의 General-to-Specific Ordering – x satisfies h ⇔ h(x)=1 – more_general_than_or_equal_to relations • <Sunny,?,?,Strong,?,?> ≦ g <Sunny,?,?,?,?,?> – more_general_than_or_equal_to relations )] 1 ) ( ( ) 1 )[( (        x h h X x h h j k k g j ) ( ) ( j g k k g j k g j h h h h h h      
  • 11. – partial order (reflexive,antisymmetric,transitive) Concept Learning as Search
  • 12. Find-S: Finding a Maximally Specific Hypothesis • algorithm • 1. Initialize h to the most specific hypothesis in H • 2. For each positive training example x • For each attribute constraint ai in h – If the constraint ai is satisfied by x – then do nothing – else replace ai in h by the next more general constraint that is satisfied by x • 3. Output hypothesis h • Property • guaranteed to output the most specific hypothesis • no way to determine unique hypothesis • not cope with inconsistent errors or noises
  • 14. Version Spaces and the Candidate- Elimination Algorithm – output all hypotheses consistent with the training examples. – perform poorly with noisy training data. • Representation – Consistent(h,D) ⇔(∀<x,c(x)>  D) h(x) = c(x) – VSH,D ⇔ {h  H | Consistent(h,D)} • List-Then-Eliminate Algorithm – lists all hypotheses -> remove inconsistent ones. – Appliable to finite H
  • 15. Version Spaces and the Candidate- Elimination Algorithm(2) • More Compact Representation for Version Spaces – general boundary G – specific boundary S – Version Space redefined with S and G )]} , ' ( ) ' )[ ' ( ) , ( | { D s Consistent g g H g D s Consistent H g G g        )]} , ' ( ) ' )[( ' ( ) , ( | { D s Consistent s s H s D s Consistent H s S g        )} )( )( ( | { , S h g G g S s H h VS g g D H        
  • 16. Version Spaces and the Candidate- Elimination Algorithm(3)
  • 17. Version Spaces and the Candidate- Elimination Algorithm(4) • Condidate-Elimination Learning Algorithm • Initialize G to the set of maximally general hypotheses in H • Initialize S to the set of maximally specific hypotheses in H • For each training example d, do • If d is a positive example • Remove from G any hypothesis inconsistent with d • For each hypothesis s in S that is not consistent with d • Remove s from S • Add to S all minimal generalizations h of s such that • h is consistent with d, and some member of G is more general • than h • Remove from S any hypothesis that is more general than another • hypothesis in S
  • 18. Version Spaces and the Candidate- Elimination Algorithm(5) • If d is a negative example • Remove from S any hypothesis inconsistent with d • For each hypothesis g in G that is not consistent with d • Remove g from G • Add to G all minimal specializations h of g such that • h is consistent with d, and some member of S is more specific than h • Remove from G any hypothesis that is less general than another hypothesis in G
  • 19. Version Spaces and the Candidate- Elimination Algorithm(6) • Illustrative Example
  • 20. Version Spaces and the Candidate- Elimination Algorithm(7)
  • 21. Version Spaces and the Candidate- Elimination Algorithm(8)
  • 22. Version Spaces and the Candidate- Elimination Algorithm(9)
  • 23. Remarks on Version Spaces and Candidate-Elimination • Will the Candidate-Elimination Algorithm Converge to the Correct Hypothesis? – Prerequisite – 1. No error in training examples – 2. Hypothesis exists which correctly describes c(x). – S and G boundary sets converge to an empty set => no hypothesis in H consistent with observed examples. • What Training Example Should the Learner Request Next? – Negative one specifies G , positive one generalizes S. – optimal query satisfy half the hypotheses.
  • 24. Remarks on Version Spaces and Candidate-Elimination(2) • How Can Partially Learned Concepts Be Used? Instance Sky AirTemp Humidity Wind Water Forecast EnjoySport A Sunny Warm Normal Strong Cool Change ? B Rainy Cold Normal Light Warm Same ? C Sunny Warm Normal Light Warm Same ? D Sunny Cold Normal Strong Warm Same ? A : classified to positive B : classified to negative C : 3 positive , 3 negative D : 2 positive, 4 negative
  • 25. Inductive Bias • A Biased Hypothesis Space Example Sky AirTemp Humidity Wind Water Forecast EnjoySport 1 Sunny Warm Normal Strong Cool Change Yes 2 Cloudy Warm Normal Strong Cool Change Yes 3 Rainy Warm Normal Strong Cool Change No - zero hypothesis in the version space - caused by only conjunctive hypothesis
  • 26. Inductive Bias(2) • An Unbiased Learner – Power set of X : set of all subsets of a set X • number of size of power set : 2|X| – e.g. <Sunny,?,?,?,?,?> ∨ <Cloudy,?,?,?,?,?> – new problem : unable to generalize beyond the observed examples. • Observed examples are only unambiguously classified. • Voting results in no majority or minority.
  • 27. Inductive Bias(3) • The Futility of Bias-Free Learning – no inductive bias => cannot classify unseen data reasonably – inductive bias of L : any minimal set of assertions B such that – inductive bias of Candidate-Elimination algorithm • c ∈ H – advantage of introducing inductive bias • generalizing beyond the observed data • allows comparison of different learners )] , ( ) )[( ( c i i c i D x L x D B X x     
  • 28. Inductive Bias(4) • e.g – Rote-learner : no inductive bias – Candidate-Elimination algo : c ∈ H => more strong – Find-S : c ∈ H and that all are negative unless not proved positive
  • 30. Summary • Concept learning can be cast as a problem of searching through a large predefined space of potential hypotheses. • General-to-specific partial ordering of hypotheses provides a useful structure for search. • Find-S algorithm performs specific-to-general search to find the most specific hypothesis. • Candidate-Elimination algorithm computes version space by incrementally computing the sets of maximally specific (S) and maximally general (G) hypotheses. • S and G delimit the entire set of hypotheses consistent with the data.
  • 31. • Version spaces and Candidate-Elimination algorithm provide a useful conceptual framework for studying concept learning. • Candidate-Elimination algorithm not robust to noisy data or to situations where the unknown target concept is not expressible in the provided hypothesis space. • Inductive bias in Candidate-Elimination algorithm is that target concept exists in H • If hypothesis space be enriched so that there is a every possible hypothesis, that would remove the ability to classify any instance beyond the observed examples.

Editor's Notes

  1. 4
  2. 23
  3. 24