Introduction to Machine Learning

Artificial Intelligence Lecture 15 Introduction to Machine Learning

Overview Machine Learning ID3 Decision Tree Algorithm Discussions

Machine Learning Supervised Learning Training examples consist of pairs of input vectors, and desired outputs Unsupervised Learning Training examples do not contain hints about correct outputs Usually used to identify unusual structures in data

Inductive Learning Basics Inferring a boolean/real-valued function from training examples A training example is a pair of (x, f(x)) x is the input f(x) is the output of the function applied to x

Hypothesis Any function that approximates the given set of examples Bias : preference for one hypothesis beyond mere consistency (a) (b) (c) (d)

Hypothesis Space A set of all hypotheses consistent with data denoted by H:{H 1 , H 2 , …, H n } Inductive learning is searching for a good hypothesis in the hypothesis space Occam’s razor: prefer the simplest hypothesis consistent with data Inductive Learning Assumption : Any hypothesis found to approximate the target function well over a sufficiently large set of training examples will also approximate the target function well over other unobserved examples .

Decision Tree for PlayTennis ? ? PlayTennis Wind Humidity Temperature Outlook Weak High Hot Overcast Weak High Hot Sunny

Decision Tree Representation Decision Tree Each internal node tests an attribute Each branch takes an attribute value Each leaf node predict a class label Disjunction of conjunctions of a set of attribute values

Hypothesis Space Search by ID3

Hypothesis Space Search by ID3 Hypothesis space is complete Target function surely in there … Outputs a single hypothesis (Which one?) Cannot determine how many alternatives No back tracking Local minima … Use statistical properties of all training data at each step in search Robust to noisy data

Inductive Bias in ID3 Note the hypothesis space H is the power set of instances X. Unbiased? Not really. Preference for short trees, and for those with high information gain attributes near the root Bias is a preference for some hypotheses, rather than a restriction of hypothesis space H, e.g. target concept is not in H.

Summary Machine Learning ID3 Decision Tree Algorithm Discussions

Introduction to Machine Learning

More Related Content

What's hot

Similar to Introduction to Machine Learning

More from butest

Introduction to Machine Learning