SlideShare a Scribd company logo
1 of 16
Download to read offline
Machine Learning types Dr. Mostafa A. Elhosseini
YouTube Channel
ML Jargons
≡ Training set: the examples that the system uses to learn
≡ Training instance: Training sample -> each training example
≡ Label: spam or not
≡ feature = attribute + value
≡ Predictors: features
≡ Ex. Price of a car
▪ Set of features (mileage, age, brand) called predictors
▪ Feature : Mileage = 15,000
≡ Performance measure P:
≡ Ratio of correctly classified emails
Machine learning definition
≡ Machine Learning is the science (and art) of programming
computers so they can learn from data
≡ The field of study that gives computers the ability to learn without
being explicitly programmed – Arthur Samuel, 1959
≡ A computer program is said to learn from experience E with respect
to some task T and some performance measure P, if its performance
on T, as measured by P, improves with experience E. Tom Mitchell,
1997
ML types
≡ Whether or not they are trained with human supervision
▪ supervised,
▪ unsupervised,
▪ semisupervised, and
▪ Reinforcement Learning
≡ Whether or not they can learn incrementally on the fly
▪ online versus batch learning
≡ Whether they work by simply comparing new data points to known
data points, or instead detect patterns in the training data and build
a predictive model, much like scientists do
▪ instance-based versus model-based learning
Supervised / Unsupervised
≡ In supervised learning, the training data you feed to the algorithm
includes the desired solutions, called labels
▪ Classification
▪ Regression
≡ Classification
▪ The spam filter is a good example of this: it is trained with many example emails along with their
class (spam or ham), and it must learn how to classify new
Supervised / Unsupervised
≡ Regression
▪ predict a target numeric value, such
as the price of a car, given a set of
features (mileage, age, brand, etc.)
called predictors.
≡ some regression algorithms can
be used for classification as well
▪ Logistic Regression is commonly used
for classification, as it can output a
value that corresponds to the
probability of belonging to a given
class
Supervised / Unsupervised
≡ The training data is unlabeled.
≡ The system tries to learn
without a teacher.
≡ Most important unsupervised
Applications
▪ Clustering
▪ Blog visitors
▪ Visualization and dimensionality
reduction
▪ Association rule learning
Unsupervised// Dimensionality Reduction
≡ Simplify the data without losing too much information
≡ One way to do this is to merge several correlated features into one
▪ car’s mileage may be very correlated with its age, so the dimensionality
reduction algorithm will merge them into one feature that represents the
car’s wear and tear. This is called feature extraction
≡ It is often a good idea to try to reduce the dimension of your
training data
▪ It will run much faster,
▪ The data will take up less disk and memory space, and in some cases
▪ It may also perform better
Unsupervised // anomaly detection
≡ Detecting unusual credit card
transactions to prevent fraud,
catching manufacturing defects, or
automatically removing outliers
from a dataset before feeding it to
another learning algorithm.
≡ The system is trained with normal
instances, and when it sees a new
instance it can tell whether it looks
like a normal one or whether it is
likely an anomaly
Unsupervised // Association rule learning
≡ The goal is to dig into large amounts of
data and discover interesting relations
between attributes.
≡ For example, suppose you own a
supermarket. Running an association rule
on your sales logs may reveal that
people who purchase barbecue sauce
and potato chips also tend to buy steak.
Thus, you may want to place these items
close to each other.
Batch/ Online learning
≡ Incapable of learning incrementally
≡ Trained using all the available data
≡ Take a lot of time and computing resources
≡ Done offline
≡ Ex. If a new type of spam -> you need to train a new version of the
system from scratch on the full dataset
Batch/ Online learning
≡ you train the system incrementally by feeding it data instances
sequentially, either individually or by small groups called mini-
batches.
≡ Each learning step is fast and cheap, so the system can learn about
new data on the fly, as it arrives
≡ This whole process is usually done offline (i.e., not on the live
system), so online learning can be a confusing name. Think of it as
incremental learning.
≡ Learning rate
▪ How fast they should adapt to changing data
Instance-based // Model-based
≡One more way to categorize Machine Learning systems is by how they
generalize
▪ Needs to be able to generalize to examples it has never seen before
≡ Having a good performance measure on the training data is good,
but insufficient; the true goal is to perform well on new instances
≡ There are two main approaches to generalization: instance-based
learning and model-based learning
Instance-based // Model-based
≡Simply to learn by heart
▪ For Spam filter: just flag all emails that are identical to emails that have
already been flagged by users
▪ Not the worst solution, but certainly not the best.
≡ Instead of just flagging emails that are identical to known spam
emails, your spam filter could be Programmed to also flag emails that
are very similar to known spam emails.
≡ This requires a measure of Similarity between two emails
≡ The system would flag an email as spam if it has many words in
common with a known spam email
Instance-based // Model-based
Instance-based // Model-based
≡ Another way to
generalize from a
set of examples is
to build a model
of these
examples, then
use that model to
make predictions.
This is called
model-based
learning

More Related Content

Similar to Lecture 02 ml supervised and unsupervised

Introduction To Machine Learning
Introduction To Machine LearningIntroduction To Machine Learning
Introduction To Machine LearningKnoldus Inc.
 
Supervised learning techniques and applications
Supervised learning techniques and applicationsSupervised learning techniques and applications
Supervised learning techniques and applicationsBenjaminlapid1
 
Machine Learning an Research Overview
Machine Learning an Research OverviewMachine Learning an Research Overview
Machine Learning an Research OverviewKathirvel Ayyaswamy
 
Machine learning it is time...
Machine learning it is time...Machine learning it is time...
Machine learning it is time...Sandip Chatterjee
 
It's Machine Learning Basics -- For You!
It's Machine Learning Basics -- For You!It's Machine Learning Basics -- For You!
It's Machine Learning Basics -- For You!To Sum It Up
 
End-to-End Machine Learning Project
End-to-End Machine Learning ProjectEnd-to-End Machine Learning Project
End-to-End Machine Learning ProjectEng Teong Cheah
 
Machine Learning
Machine LearningMachine Learning
Machine LearningAmit Kumar
 
Unit 1-ML (1) (1).pptx
Unit 1-ML (1) (1).pptxUnit 1-ML (1) (1).pptx
Unit 1-ML (1) (1).pptxChitrachitrap
 
Machine Learning in the Financial Industry
Machine Learning in the Financial IndustryMachine Learning in the Financial Industry
Machine Learning in the Financial IndustrySubrat Panda, PhD
 
Machine Learning in NutShell
Machine Learning in NutShellMachine Learning in NutShell
Machine Learning in NutShellAshwin Shiv
 
Knowledge discovery claudiad amato
Knowledge discovery claudiad amatoKnowledge discovery claudiad amato
Knowledge discovery claudiad amatoSSSW
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningSujith Jayaprakash
 
An Introduction to Machine Learning
An Introduction to Machine LearningAn Introduction to Machine Learning
An Introduction to Machine LearningVedaj Padman
 
Machine Learning SPPU Unit 1
Machine Learning SPPU Unit 1Machine Learning SPPU Unit 1
Machine Learning SPPU Unit 1Amruta Aphale
 
Unit 1 - ML - Introduction to Machine Learning.pptx
Unit 1 - ML - Introduction to Machine Learning.pptxUnit 1 - ML - Introduction to Machine Learning.pptx
Unit 1 - ML - Introduction to Machine Learning.pptxjawad184956
 

Similar to Lecture 02 ml supervised and unsupervised (20)

Introduction To Machine Learning
Introduction To Machine LearningIntroduction To Machine Learning
Introduction To Machine Learning
 
Supervised learning techniques and applications
Supervised learning techniques and applicationsSupervised learning techniques and applications
Supervised learning techniques and applications
 
Machine Learning an Research Overview
Machine Learning an Research OverviewMachine Learning an Research Overview
Machine Learning an Research Overview
 
Machine learning it is time...
Machine learning it is time...Machine learning it is time...
Machine learning it is time...
 
It's Machine Learning Basics -- For You!
It's Machine Learning Basics -- For You!It's Machine Learning Basics -- For You!
It's Machine Learning Basics -- For You!
 
Lec 1.pptx
Lec 1.pptxLec 1.pptx
Lec 1.pptx
 
End-to-End Machine Learning Project
End-to-End Machine Learning ProjectEnd-to-End Machine Learning Project
End-to-End Machine Learning Project
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Unit 1-ML (1) (1).pptx
Unit 1-ML (1) (1).pptxUnit 1-ML (1) (1).pptx
Unit 1-ML (1) (1).pptx
 
Machine Learning in the Financial Industry
Machine Learning in the Financial IndustryMachine Learning in the Financial Industry
Machine Learning in the Financial Industry
 
Machine Learning in NutShell
Machine Learning in NutShellMachine Learning in NutShell
Machine Learning in NutShell
 
Knowledge discovery claudiad amato
Knowledge discovery claudiad amatoKnowledge discovery claudiad amato
Knowledge discovery claudiad amato
 
Machine learning
Machine learningMachine learning
Machine learning
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
An Introduction to Machine Learning
An Introduction to Machine LearningAn Introduction to Machine Learning
An Introduction to Machine Learning
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Machine Learning SPPU Unit 1
Machine Learning SPPU Unit 1Machine Learning SPPU Unit 1
Machine Learning SPPU Unit 1
 
Machine learning
Machine learningMachine learning
Machine learning
 
Unit 1 - ML - Introduction to Machine Learning.pptx
Unit 1 - ML - Introduction to Machine Learning.pptxUnit 1 - ML - Introduction to Machine Learning.pptx
Unit 1 - ML - Introduction to Machine Learning.pptx
 

More from Mostafa El-Hosseini

More from Mostafa El-Hosseini (17)

why now Deep Neural Networks?
why now Deep Neural Networks?why now Deep Neural Networks?
why now Deep Neural Networks?
 
Activation functions types
Activation functions typesActivation functions types
Activation functions types
 
Why activation function
Why activation functionWhy activation function
Why activation function
 
Logistic Regression (Binary Classification)
Logistic Regression (Binary Classification)Logistic Regression (Binary Classification)
Logistic Regression (Binary Classification)
 
Model validation and_early_stopping_-_shooting
Model validation and_early_stopping_-_shootingModel validation and_early_stopping_-_shooting
Model validation and_early_stopping_-_shooting
 
Lecture 01 _perceptron_intro
Lecture 01 _perceptron_introLecture 01 _perceptron_intro
Lecture 01 _perceptron_intro
 
Lecture 19 chapter_4_regularized_linear_models
Lecture 19 chapter_4_regularized_linear_modelsLecture 19 chapter_4_regularized_linear_models
Lecture 19 chapter_4_regularized_linear_models
 
Svm rbf kernel
Svm rbf kernelSvm rbf kernel
Svm rbf kernel
 
Lecture 24 support vector machine kernel
Lecture 24  support vector machine kernelLecture 24  support vector machine kernel
Lecture 24 support vector machine kernel
 
Lecture 23 support vector classifier
Lecture 23  support vector classifierLecture 23  support vector classifier
Lecture 23 support vector classifier
 
Lecture 12 binary classifier confusion matrix
Lecture 12 binary classifier confusion matrixLecture 12 binary classifier confusion matrix
Lecture 12 binary classifier confusion matrix
 
Lecture 11 linear regression
Lecture 11 linear regressionLecture 11 linear regression
Lecture 11 linear regression
 
Numpy 02
Numpy 02Numpy 02
Numpy 02
 
Naive bayes classifier python session
Naive bayes classifier  python sessionNaive bayes classifier  python session
Naive bayes classifier python session
 
Ga
GaGa
Ga
 
Numpy 01
Numpy 01Numpy 01
Numpy 01
 
Lecture 08 prepare the data for ml algorithm
Lecture 08 prepare the data for ml algorithmLecture 08 prepare the data for ml algorithm
Lecture 08 prepare the data for ml algorithm
 

Recently uploaded

Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...Call Girls in Nagpur High Profile
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations120cr0395
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdfKamal Acharya
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...ranjana rawat
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Bookingdharasingh5698
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdfankushspencer015
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingrakeshbaidya232001
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Christo Ananth
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 

Recently uploaded (20)

Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdf
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 

Lecture 02 ml supervised and unsupervised

  • 1. Machine Learning types Dr. Mostafa A. Elhosseini YouTube Channel
  • 2. ML Jargons ≡ Training set: the examples that the system uses to learn ≡ Training instance: Training sample -> each training example ≡ Label: spam or not ≡ feature = attribute + value ≡ Predictors: features ≡ Ex. Price of a car ▪ Set of features (mileage, age, brand) called predictors ▪ Feature : Mileage = 15,000 ≡ Performance measure P: ≡ Ratio of correctly classified emails
  • 3. Machine learning definition ≡ Machine Learning is the science (and art) of programming computers so they can learn from data ≡ The field of study that gives computers the ability to learn without being explicitly programmed – Arthur Samuel, 1959 ≡ A computer program is said to learn from experience E with respect to some task T and some performance measure P, if its performance on T, as measured by P, improves with experience E. Tom Mitchell, 1997
  • 4. ML types ≡ Whether or not they are trained with human supervision ▪ supervised, ▪ unsupervised, ▪ semisupervised, and ▪ Reinforcement Learning ≡ Whether or not they can learn incrementally on the fly ▪ online versus batch learning ≡ Whether they work by simply comparing new data points to known data points, or instead detect patterns in the training data and build a predictive model, much like scientists do ▪ instance-based versus model-based learning
  • 5. Supervised / Unsupervised ≡ In supervised learning, the training data you feed to the algorithm includes the desired solutions, called labels ▪ Classification ▪ Regression ≡ Classification ▪ The spam filter is a good example of this: it is trained with many example emails along with their class (spam or ham), and it must learn how to classify new
  • 6. Supervised / Unsupervised ≡ Regression ▪ predict a target numeric value, such as the price of a car, given a set of features (mileage, age, brand, etc.) called predictors. ≡ some regression algorithms can be used for classification as well ▪ Logistic Regression is commonly used for classification, as it can output a value that corresponds to the probability of belonging to a given class
  • 7. Supervised / Unsupervised ≡ The training data is unlabeled. ≡ The system tries to learn without a teacher. ≡ Most important unsupervised Applications ▪ Clustering ▪ Blog visitors ▪ Visualization and dimensionality reduction ▪ Association rule learning
  • 8. Unsupervised// Dimensionality Reduction ≡ Simplify the data without losing too much information ≡ One way to do this is to merge several correlated features into one ▪ car’s mileage may be very correlated with its age, so the dimensionality reduction algorithm will merge them into one feature that represents the car’s wear and tear. This is called feature extraction ≡ It is often a good idea to try to reduce the dimension of your training data ▪ It will run much faster, ▪ The data will take up less disk and memory space, and in some cases ▪ It may also perform better
  • 9. Unsupervised // anomaly detection ≡ Detecting unusual credit card transactions to prevent fraud, catching manufacturing defects, or automatically removing outliers from a dataset before feeding it to another learning algorithm. ≡ The system is trained with normal instances, and when it sees a new instance it can tell whether it looks like a normal one or whether it is likely an anomaly
  • 10. Unsupervised // Association rule learning ≡ The goal is to dig into large amounts of data and discover interesting relations between attributes. ≡ For example, suppose you own a supermarket. Running an association rule on your sales logs may reveal that people who purchase barbecue sauce and potato chips also tend to buy steak. Thus, you may want to place these items close to each other.
  • 11. Batch/ Online learning ≡ Incapable of learning incrementally ≡ Trained using all the available data ≡ Take a lot of time and computing resources ≡ Done offline ≡ Ex. If a new type of spam -> you need to train a new version of the system from scratch on the full dataset
  • 12. Batch/ Online learning ≡ you train the system incrementally by feeding it data instances sequentially, either individually or by small groups called mini- batches. ≡ Each learning step is fast and cheap, so the system can learn about new data on the fly, as it arrives ≡ This whole process is usually done offline (i.e., not on the live system), so online learning can be a confusing name. Think of it as incremental learning. ≡ Learning rate ▪ How fast they should adapt to changing data
  • 13. Instance-based // Model-based ≡One more way to categorize Machine Learning systems is by how they generalize ▪ Needs to be able to generalize to examples it has never seen before ≡ Having a good performance measure on the training data is good, but insufficient; the true goal is to perform well on new instances ≡ There are two main approaches to generalization: instance-based learning and model-based learning
  • 14. Instance-based // Model-based ≡Simply to learn by heart ▪ For Spam filter: just flag all emails that are identical to emails that have already been flagged by users ▪ Not the worst solution, but certainly not the best. ≡ Instead of just flagging emails that are identical to known spam emails, your spam filter could be Programmed to also flag emails that are very similar to known spam emails. ≡ This requires a measure of Similarity between two emails ≡ The system would flag an email as spam if it has many words in common with a known spam email
  • 16. Instance-based // Model-based ≡ Another way to generalize from a set of examples is to build a model of these examples, then use that model to make predictions. This is called model-based learning