SlideShare a Scribd company logo
1 of 29
1
Machine Learning
It is a field of study that gives computers the ability to learn without being explicitly
programmed.
Forms of Machine learning
• Supervised Learning
⮚ Prior knowledge about class label
⮚ Common examples are Random Forest, Decision Tree, Naïve Bayes etc.
• Unsupervised Learning
⮚ No prior knowledge about class label
⮚ Common examples are K-means, Apriori etc.
Reinforcement Learning
⮚ Based on reward or penalty
⮚ Agent is able to perceive and interpret its environment, take actions and learn through
trial and error.
⮚ Common examples are Q-learning etc.
Decision Tree Learning
• Decision tree learning is a method for approximating discrete-valued target functions.
• The learned function is represented by a decision tree.
⮚ A learned decision tree can also be represented as a set of if-then rules.
• Decision tree learning is one of the most widely used and practical methods for
inductive inference.
• It is robust to noisy data and capable of learning disjunctive expressions.
• Decision tree learning method searches a completely expressive hypothesis.
⮚ Avoids the difficulties of restricted hypothesis spaces.
⮚ Its inductive bias is a preference for small trees over large trees.
• The decision tree algorithms such as ID3, C4.5 are very popular inductive inference
algorithms, and they successfully applied to many leaning tasks.
Decision Tree
• Decision Tree represents a disjunction of conjunctions of constraints on the
attributes values of instances.
• Each path from the tree root to a leaf corresponds to a conjuction of attribute
tests
• The tree itself is a disjunction of these conjunctions.
(Outlook = Sunny ˄ Humidity
= Normal)
˅ (Outlook = Overcast)
˅ (Outlook = Rain ˄ Wind
= Weak )
Decision Tree
• Decision tree classify instances by sorting them down the tree from the root to some
leaf node, which provides the classification of the instance.
• Each node in the tree specifies a test of some attributes of the instance.
• Each branch descending from a node corresponds to one of the possible values for the
attribute.
• Each leaf node assigns a classification.
• The instance
(Outlook = Sunny, Temperature = Hot, Humidity = High, Wind =
Strong) is classified as a negative instance.
4
Which Attribute is “best”?
• We would like to select the attribute that is most useful for classifying examples.
• Information Gain measures how well a given attribute separates the training examples
according to their target classification.
• ID3 uses this information gain measure to select among the candidate attribute at
each step while growing the tree.
• In order to define information gain precisely, we use a measure commonly used in
information theory, called entropy
• Entropy characterizes the (im)purity of an arbitrary collection of examples.
5
•
6
•
7
Which attribute is best classifier?
8
ID3 Training examples – [9+, 5-]
9
ID3 Selecting Next Attribute
10
ID3 Selecting Next Attribute
11
ID3 Selecting Next Attribute
12
Best Attribute - Outlook
13
•
14
ID3 - Result
15
Converting Decision Tree into Rules
16
Split Information
17
Linear Models
A strong high-bias assumption is linear separability:
– in 2 dimensions, can separate classes by a line
– in higher dimensions, need hyperplanes
A linear model is a model that assumes the data is linearly separable
18
A linear model in n-dimensional space (i.e. n features) is define by n+1
weights:
In two dimensions, a line:
In three dimensions, a plane:
In m-dimensions, a hyperplane
19
(where b = -
a)
Artificial Neural Network (ANN)
• Artificial Neural Network (ANNs) are programs designed to solve any problem by
trying to mimic the structure and the function of our nervous system.
• Neural networks are based on simulated neuron, which are joined together in a
variety of ways to form networks.
• Neural network resembles human brain in the following two ways
⮚ A neural network acquires knowledge through learning
⮚ A neural network’s knowledge is stored within the interconnection strengths known as
synaptic weights.
20
•
21
•
22
23
24
25
Backpropagation Algorithm
• The backpropagation algorithm (Rumelhart and McClelland,1986) is used in
layered feed-forward Artificial Neural Networks.
• Backpropagation is a multi-layer feed forward, supervised learning network based
on gradient descent learning rule.
• We provide the algorithm with examples of the inputs and outputs we want the
network to compute, and then the error (difference between actual and expected
results) is calculated.
• The idea of the backpropagation algorithm is to reduce this error, until the
Artificial Neural Network learns the training data.
26
27
• The backpropogation algorithm now calculates the error depends on the output,
inputs and weights.
• The adjustment of each weight(Δwji) will be the negative of a constant eta (η)
multiplied by the dependence of the “wji” previous weights on the error of the
network.
• First, we need to calculate how much the error depends on the output
• Next, how much the output depends on the activation, which in turn depends
weights
• And so, the adjustment to each weight will be
28
29

More Related Content

Similar to Lec 18-19.pptx

Similar to Lec 18-19.pptx (20)

Demystifying Machine Learning
Demystifying Machine LearningDemystifying Machine Learning
Demystifying Machine Learning
 
Machine learning and decision trees
Machine learning and decision treesMachine learning and decision trees
Machine learning and decision trees
 
AI Unit 5 machine learning
AI Unit 5 machine learning AI Unit 5 machine learning
AI Unit 5 machine learning
 
Artificial Neural Networks for data mining
Artificial Neural Networks for data miningArtificial Neural Networks for data mining
Artificial Neural Networks for data mining
 
Artificial Neural Networks for Data Mining
Artificial Neural Networks for Data MiningArtificial Neural Networks for Data Mining
Artificial Neural Networks for Data Mining
 
c23_ml1.ppt
c23_ml1.pptc23_ml1.ppt
c23_ml1.ppt
 
7 decision tree
7 decision tree7 decision tree
7 decision tree
 
Machine learning
Machine learningMachine learning
Machine learning
 
Artificial_Neural_Network.pdf
Artificial_Neural_Network.pdfArtificial_Neural_Network.pdf
Artificial_Neural_Network.pdf
 
Machine Learning
Machine Learning Machine Learning
Machine Learning
 
AI -learning and machine learning.pptx
AI  -learning and machine learning.pptxAI  -learning and machine learning.pptx
AI -learning and machine learning.pptx
 
module_3_1.pptx
module_3_1.pptxmodule_3_1.pptx
module_3_1.pptx
 
module_3_1.pptx
module_3_1.pptxmodule_3_1.pptx
module_3_1.pptx
 
Unit-V Machine Learning.ppt
Unit-V Machine Learning.pptUnit-V Machine Learning.ppt
Unit-V Machine Learning.ppt
 
Artificial Neural Network Learning Algorithm.ppt
Artificial Neural Network Learning Algorithm.pptArtificial Neural Network Learning Algorithm.ppt
Artificial Neural Network Learning Algorithm.ppt
 
Decision Tree.pptx
Decision Tree.pptxDecision Tree.pptx
Decision Tree.pptx
 
Introduction to-machine-learning
Introduction to-machine-learningIntroduction to-machine-learning
Introduction to-machine-learning
 
Supervised and unsupervised learning
Supervised and unsupervised learningSupervised and unsupervised learning
Supervised and unsupervised learning
 
Artificial Neural Network (ANN
Artificial Neural Network (ANNArtificial Neural Network (ANN
Artificial Neural Network (ANN
 
SC1.pptx
SC1.pptxSC1.pptx
SC1.pptx
 

Recently uploaded

What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxwendy cai
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx959SahilShah
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxbritheesh05
 
Electronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfElectronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfme23b1001
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AIabhishek36461
 
Introduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECHIntroduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECHC Sai Kiran
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girlsssuser7cb4ff
 
pipeline in computer architecture design
pipeline in computer architecture  designpipeline in computer architecture  design
pipeline in computer architecture designssuser87fa0c1
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile servicerehmti665
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfAsst.prof M.Gokilavani
 
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)dollysharma2066
 
DATA ANALYTICS PPT definition usage example
DATA ANALYTICS PPT definition usage exampleDATA ANALYTICS PPT definition usage example
DATA ANALYTICS PPT definition usage examplePragyanshuParadkar1
 
Concrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxConcrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxKartikeyaDwivedi3
 
Churning of Butter, Factors affecting .
Churning of Butter, Factors affecting  .Churning of Butter, Factors affecting  .
Churning of Butter, Factors affecting .Satyam Kumar
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionDr.Costas Sachpazis
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSCAESB
 
complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...asadnawaz62
 

Recently uploaded (20)

What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptx
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptx
 
Electronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfElectronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdf
 
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxExploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AI
 
Introduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECHIntroduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECH
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girls
 
pipeline in computer architecture design
pipeline in computer architecture  designpipeline in computer architecture  design
pipeline in computer architecture design
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
 
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
 
DATA ANALYTICS PPT definition usage example
DATA ANALYTICS PPT definition usage exampleDATA ANALYTICS PPT definition usage example
DATA ANALYTICS PPT definition usage example
 
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Serviceyoung call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
 
Concrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxConcrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptx
 
Churning of Butter, Factors affecting .
Churning of Butter, Factors affecting  .Churning of Butter, Factors affecting  .
Churning of Butter, Factors affecting .
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentation
 
complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...
 

Lec 18-19.pptx

  • 1. 1 Machine Learning It is a field of study that gives computers the ability to learn without being explicitly programmed. Forms of Machine learning • Supervised Learning ⮚ Prior knowledge about class label ⮚ Common examples are Random Forest, Decision Tree, Naïve Bayes etc. • Unsupervised Learning ⮚ No prior knowledge about class label ⮚ Common examples are K-means, Apriori etc. Reinforcement Learning ⮚ Based on reward or penalty ⮚ Agent is able to perceive and interpret its environment, take actions and learn through trial and error. ⮚ Common examples are Q-learning etc.
  • 2. Decision Tree Learning • Decision tree learning is a method for approximating discrete-valued target functions. • The learned function is represented by a decision tree. ⮚ A learned decision tree can also be represented as a set of if-then rules. • Decision tree learning is one of the most widely used and practical methods for inductive inference. • It is robust to noisy data and capable of learning disjunctive expressions. • Decision tree learning method searches a completely expressive hypothesis. ⮚ Avoids the difficulties of restricted hypothesis spaces. ⮚ Its inductive bias is a preference for small trees over large trees. • The decision tree algorithms such as ID3, C4.5 are very popular inductive inference algorithms, and they successfully applied to many leaning tasks.
  • 3. Decision Tree • Decision Tree represents a disjunction of conjunctions of constraints on the attributes values of instances. • Each path from the tree root to a leaf corresponds to a conjuction of attribute tests • The tree itself is a disjunction of these conjunctions. (Outlook = Sunny ˄ Humidity = Normal) ˅ (Outlook = Overcast) ˅ (Outlook = Rain ˄ Wind = Weak )
  • 4. Decision Tree • Decision tree classify instances by sorting them down the tree from the root to some leaf node, which provides the classification of the instance. • Each node in the tree specifies a test of some attributes of the instance. • Each branch descending from a node corresponds to one of the possible values for the attribute. • Each leaf node assigns a classification. • The instance (Outlook = Sunny, Temperature = Hot, Humidity = High, Wind = Strong) is classified as a negative instance. 4
  • 5. Which Attribute is “best”? • We would like to select the attribute that is most useful for classifying examples. • Information Gain measures how well a given attribute separates the training examples according to their target classification. • ID3 uses this information gain measure to select among the candidate attribute at each step while growing the tree. • In order to define information gain precisely, we use a measure commonly used in information theory, called entropy • Entropy characterizes the (im)purity of an arbitrary collection of examples. 5
  • 8. Which attribute is best classifier? 8
  • 9. ID3 Training examples – [9+, 5-] 9
  • 10. ID3 Selecting Next Attribute 10
  • 11. ID3 Selecting Next Attribute 11
  • 12. ID3 Selecting Next Attribute 12
  • 13. Best Attribute - Outlook 13
  • 16. Converting Decision Tree into Rules 16
  • 18. Linear Models A strong high-bias assumption is linear separability: – in 2 dimensions, can separate classes by a line – in higher dimensions, need hyperplanes A linear model is a model that assumes the data is linearly separable 18
  • 19. A linear model in n-dimensional space (i.e. n features) is define by n+1 weights: In two dimensions, a line: In three dimensions, a plane: In m-dimensions, a hyperplane 19 (where b = - a)
  • 20. Artificial Neural Network (ANN) • Artificial Neural Network (ANNs) are programs designed to solve any problem by trying to mimic the structure and the function of our nervous system. • Neural networks are based on simulated neuron, which are joined together in a variety of ways to form networks. • Neural network resembles human brain in the following two ways ⮚ A neural network acquires knowledge through learning ⮚ A neural network’s knowledge is stored within the interconnection strengths known as synaptic weights. 20
  • 23. 23
  • 24. 24
  • 25. 25
  • 26. Backpropagation Algorithm • The backpropagation algorithm (Rumelhart and McClelland,1986) is used in layered feed-forward Artificial Neural Networks. • Backpropagation is a multi-layer feed forward, supervised learning network based on gradient descent learning rule. • We provide the algorithm with examples of the inputs and outputs we want the network to compute, and then the error (difference between actual and expected results) is calculated. • The idea of the backpropagation algorithm is to reduce this error, until the Artificial Neural Network learns the training data. 26
  • 27. 27
  • 28. • The backpropogation algorithm now calculates the error depends on the output, inputs and weights. • The adjustment of each weight(Δwji) will be the negative of a constant eta (η) multiplied by the dependence of the “wji” previous weights on the error of the network. • First, we need to calculate how much the error depends on the output • Next, how much the output depends on the activation, which in turn depends weights • And so, the adjustment to each weight will be 28
  • 29. 29