SlideShare a Scribd company logo
1 of 80
INTRODUCTION TO MACHINE
LEARNING
CHILD LEARNING
Child:Daddy what is danger?
Dad: The possibility of suffering harm or injury.
Child:Daddy what is an injury?
Dad: An instance of being injured.
Child:Daddy what is an instance?
Dad: An example or single occurrence of something.
Child:Daddy does it bother you that I’m asking so many
questions?
Dad: Not at all, if you don't ask you will never know.
CHILD LEARNING
Dad: Let me give you some examples…
CHILD LEARNING
Child:Now I understand, everything is dangerous
Dad: No, there are things that aren't dangerous
CHILD LEARNING
Child:And what are those?
CHILD LEARNING
And there is the most natural mode of learning
Action Reaction Lesson
Touching hot stove aching hand Do not touch again
Playing with toys Fun Continue playing
Running in to the road Screaming parent Don’t run to roads
Running in the house Fun Run in the house
Eating chocolate Fun Search for chocolate
Eating too much chocolate Stomach ache Don’t eat too much
Saying bla bla No Reaction Try variations
Saying daddy Overexcited parents Do that again
SO, HOW CHILDREN LEARN?
1. From explanation
2. From examples
3. Reinforcement Learning
SO, HOW CHILDREN LEARN?
1. From explanation
2. From examples
3. Reinforcement Learning
ABOUT US
Algorithms Technology
Business
AGENDA
• What is Machine Learning
• Typical Machine Learning Tasks
• Supervised Learning
• Unsupervised Learning
• How to Get Started
• Summary
AGENDA
• What is Machine Learning
• Typical Machine Learning Tasks
• Supervised Learning
• Unsupervised Learning
• How to Get Started
• Summary
WHAT IS MACHINE LEARNING?
We say that a computer program is learning a task, if its
performance on that task is improving as more experience is
processed
WHAT IS MACHINE LEARNING?
Machine
Learning
Statistics
Databases
& Big Data
Decision
Theory
Artificial
Intelligence
Optimization
WHAT IS MACHINE LEARNING?
Machine
Learning
Statistics
Databases
& Big Data
Decision
Theory
Artificial
Intelligence
Optimization
Data Science
AGENDA
• What is Machine Learning
• Typical Machine Learning Tasks
• Supervised Learning
• Unsupervised Learning
• How to Get Started
• Summary
TYPICAL MACHINE LEARNING TASKS
No two Machine Learning tasks are identical. Yet, we often use
the following categories:
• Supervised Learning
• Unsupervised Learning
• Reinforcement Learning
SUPERVISED LEARNING
Estimate or Predict an unknown result, given explicit values of some
explaining features.
The learning takes place as history of observations, for which both the
explaining features and the results are known.
Experience = supervised examples (exactly as in inferring what is dangerous
from examples)
We call the dataset that describe the experience training set
SUPERVISED LEARNING
Estimate or Predict an unknown result, given explicit values of some
explaining features.
We call the dataset that describe the experience training set
When the unknown result is numeric, we call the task Regression
When the unknown result is categorical, we call the task Classification
SUPERVISED LEARNING
Example 1: What will be the annual spent of a new customer,
given a set of explaining features (e.g., demographics, first
purchases, first deposit etc.)?
Task qualifications: Prediction, Regression
Training set: a file, in which each row represents a customer. For
each such customer we will extract the explaining features, at
the prediction point, as well as the annual spent (a year later).
SUPERVISED LEARNING
Example 2: What is the activity currently performed by a user
who is wearing a smart watch with inertial sensors?
Task qualifications: Assessment, Classification
Input: A set of sensor-based signals, along with an annotation of
the activity during each signal.
Requires a significant amount of pre-processing in order to
produce the training set.
SUPERVISED LEARNING
PredictionAssessment
Classification
Regression
UNSUPERVISED LEARNING
Given a specific set of records, described by a given set of
features, either:
1. Extract interesting patterns that appear in the data
2. Provide insightful representation of the distribution of the
data
Experience: the more records we have, the more significant are
the patterns that we can extract, or more accurate is the
representation
UNSUPERVISED LEARNING
Example: Market Segmentation
Input data: Customers’ descriptions
Objective: Provide an insightful representation of the market
(what types of customers are there?)
Also known as cluster analysis
REINFORCEMENT LEARNING
Learning how to best react to situation through trial and error.
Simple Example: Multiple A/B testing
More Typical: Robot Navigation
Designing a RL system requires solving two difficult challenges:
• The exploration – exploitation dilemma
• Attributing delayed rewards
UNSTRUCTURED INPUTS
The input data often come in an unstructured form, such as:
• Free text
• Speech
• Images
• Video
• Sensors
• Networks
AGENDA
• What is Machine Learning
• Typical Machine Learning Tasks
• Supervised Learning
• Unsupervised Learning
• How to Get Started
• Summary
SUPERVISED LEARNING
X1 X2 X3 … Xn-2 Xn-1 Xn Y
x1,1 x2,1 x3,1 … xn-2,1 xn-1,1 xn,1 y1
x1,2 x2,2 x3,2 … xn-2,2 xn-1,2 xn,2 y2
.
.
.
.
.
.
.
.
.
…
…
…
.
.
.
.
.
.
.
.
.
x1,m-1 x2,m-1 x3,m-1 … xn-2,m-1 xn-1,m-1 xn,m-1 ym-1
x1,m x2,m x3,m … xn-2,m xn-1,m xn,m ym
𝑌 = 𝑓 𝑋1, 𝑋2, … , 𝑋 𝑛
LEARNING THE CONCEPT OF A BIRD
An alien asks you: “What is a bird?”
You can try and define a bird, but the alien does not understand
Why don’t you give an example…
LEARNING THE CONCEPT OF A BIRD
Is Bird?Can Fly ?ColorExample #
YesYesBlack1
What do you say about the following classification model:
“If Color = Black and Can_Fly = Yes then Bird
Else Not_Bird”?
LEARNING THE CONCEPT OF A BIRD
Is Bird?Can Fly ?ColorExample #
YesYesBlack1
YesYesGrey2
What do you say about the following classification model:
“If Can_Fly = Yes then Bird Else Not_Bird”?
LEARNING THE CONCEPT OF A BIRD
Is Bird?Can Fly ?ColorExample #
YesYesBlack1
YesYesGrey2
NoYesBlack3
Supervised Learning means generalizing from given
observations.
GENERALIZATION VS. SPECIFICATION
• A general concept is built based on the explaining features. The
right set of explaining features is crucial for learning
• Being over specific means memorizing and not learning
• Being too general means being too coarse and missing some of
the details
• Finding the sweet spot between generalization and specificity is
hard
GENERALIZATION VS. SPECIFICATION
Let us find a function that estimates Y=f(X)
Too General / Too
Simple / Under
fitted
Too Specific / Too
Complex / Over
fitted
A nice solution to
the trade-off
OVER FITTING & UNDER FITTING
• We search for
• We know that in addition to the functional dependency (called
bias), the actual Y values are also affected by noise (called
variance)
• We want the model to learn the bias, but not to be affected by
the variance.
• A model that is too simple to learn the bias is called under fitted
• A model that is overly complex that it adapts itself to the
variance is called over fitted
𝑌 = 𝑓 𝑋1, 𝑋2, … , 𝑋 𝑛
The more complexity you add to the
model, you can always better fit it to
the training observations.
This is not always a good practice!
A PARTIAL LIST OF SUPERVISED
LEARNING METHODS
• K- Nearest Neighbor
• SVM (Optimal Margin Linear Separation)
• Decision Trees
• Naïve Bayes
• Linear Regression
• Logistic Regression
• (Deep) Neural Networks
A PARTIAL LIST OF SUPERVISED
LEARNING METHODS
• K- Nearest Neighbor
• SVM (Optimal Margin Linear Separation)
• Decision Trees
• Naïve Bayes
• Linear Regression
• Logistic Regression
• (Deep) Neural Networks
K-NN
Recipients
EmailLength
Given a new observation, find
the K closest available
observations and:
• In regression, use the
average result of these K
observations
• In Classification, use voting
amongst these K
observations
K-NN
Recipients
EmailLength
K=3
Few concerns:
• What should be k?
• Which distance measure should
be used?
• Computation
LINEAR SEPERATORS
How would you classify
this data?
X1
X2
LINEAR SEPERATORS
How would you classify
this data?
X1
X2
LINEAR SEPERATORS
X1
X2
LINEAR SEPERATORS
X1
X2
In SVM we search for
the linear separator
that has the maximal
margin.
Using a
mathematical trick,
called The Kernel
Trick, SVMs can also
find non-linear
separators
DECISION TREES
Example: classify new customers into one of two groups:
Standard and VIP.
Training set: a list of customers that were once new, along with
an annotation that reflect if these customers should have been
identified as VIP (this annotation is made only after some time).
Let us say that we have 1,000 VIPs and 4,000 Standard new
customers
DECISION TREES
Let us say that we have 1,000 VIPs and 4,000 Standard new
customers
1,000 V
4,000 S
DECISION TREES
The population is a mix of different types. What if we could find
splitting criterion that will create two (or more), more pure sub
populations
1,000 V
4,000 S
DECISION TREES
The population is a mix of different types. What if we could find
splitting criterion that will create two (or more), more pure sub
populations
1,000 V
4,000 S
Self Employed
600 V
800 S
Employees
400 V
3,200 S
DECISION TREES
Now, we can take each sub-population and split it recursively,
until some stopping criteria are met.
1,000 V
4,000 S
Self Employed
600 V
800 S
Employees
400 V
3,200 S
DECISION TREES
• Decision trees are a result of recursive splitting mechanism
• Each split is chosen as to maximize the purity of the sub
populations that results from the split
• Few ways to model node purity. Often the concept of minimal
entropy (or a variation of minimal entropy) is used
• Each split is made according to the values of one of the
explaining features
LINEAR REGRESSION
0
50
100
150
200
250
300
350
400
450
0 1000 2000 3000
HousePrice($1000s)
Square Feet
LINEAR REGRESSION
0
50
100
150
200
250
300
350
400
450
0 1000 2000 3000
HousePrice($1000s)
Square Feet
SUPERVISED LEARNING EVALUATION
Since Supervised Learning is all about generalization, a good
model is a model that can be applied successfully to new
observations
In classification tasks, we are often interested in the probability
that the model will extract the true outcome. This probability is
called the model accuracy
In regression tasks, we are often interested in the average
deviation between the outcome of the model and the true
outcome. This deviation is called RMSE
Too General / Too
Simple / Under
fitted
Too Specific / Too
Complex / Over
fitted
A nice solution to
the trade-off
SUPERVISED LEARNING EVALUATION
It is always possible to build an over fitted model. So the quality
of the model on the training set say very little on the capability
of the model to generalize to new observations.
Therefor, never evaluate a model using the training set
Instead:
• Use an independent (randomly selected) test set
• Use cross validation
SUPERVISED LEARNING EVALUATION
RedBlue
17Blue
50Red
Classified As
Actual
Confusion Matrix
SUPERVISED LEARNING EVALUATION
RedBlue
17Blue
50Red
Classified As
Actual
Confusion Matrix
Accuracy (on test set) = (7+5)/(7+5+1+0)
CROSS VALIDATION
Randomly break the training set into k mutually exclusive,
collectively exhaustive sets, of similar size (often k=10).
For i=1,2,…k:
Train a model using all the sets, except for the i-th set.
Evaluate the trained model over the i-th set.
You end up with k evaluation measures. Evaluate the entire
model as the average of these k results.
SUPERVISED LEARNING SUMMARY
• Two sub problems: classification and regression
• Supervised Learning is all about generalizing from a given
training set
• There is an inherent, hard to solve trade-off between
generalization and over specification
• The more complexity you add to your model, the better it
can fit the training set. You may gain an over fitted model
• Therefor, you never evaluate a model on the training set that
was used to induce it
• Instead, use either and independent test set, or cross
validation
SUPERVISED LEARNING SUMMARY
• We also got familiar with 4 SL methods: K-NN, SVM, Decision
trees and Linear regression
AGENDA
• What is Machine Learning
• Typical Machine Learning Tasks
• Supervised Learning
• Unsupervised Learning
• How to Get Started
• Summary
UNSUPERVISED LEARNING
X1 X2 X3 … Xn-2 Xn-1 Xn
x1,1 x2,1 x3,1 … xn-2,1 xn-1,1 xn,1
x1,2 x2,2 x3,2 … xn-2,2 xn-1,2 xn,2
.
.
.
.
.
.
.
.
.
…
…
…
.
.
.
.
.
.
.
.
.
x1,m-1 x2,m-1 x3,m-1 … xn-2,m-1 xn-1,m-1 xn,m-1
x1,m x2,m x3,m … xn-2,m xn-1,m xn,m
Extract interesting patterns from the input set or
Provide an insightful representation of the input space
UNSUPERVISED LEARNING
Unsupervised Learning tasks:
• Cluster Analysis
• Association Rules Mining
• Hidden Markov Models
• Dimensionality Reduction
• Self-Organising Maps
CLUSTER ANALYSIS
Data points that share a
cluster need to be similar
Data points in different
clusters need to be different
Similarity = Low distance Difference = High distance?
CLUSTER ANALYSIS
CLUSTER ANALYSIS
CLUSTER ANALYSIS
CLUSTER ANALYSIS
K-Means:
Initialize: place k cluster centroids on the feature space
Repeat until some stopping criteria are met:
Associate each data point to the closest centroid
Move each centroid to the center of the points that are
associated to it
CLUSTER ANALYSIS
Does distance means similarity?
What distance?
CLUSTER ANALYSIS
Does distance means similarity?
What distance?
For example, let us look at similarity in monthly salary.
Mr. X earns $2,500 a month
Mrs. Y earns $250,000 a month
Mr. Z earns $100,00 a month. Is he more similar (in terms of
salary) to X or to Y?
CLUSTER ANALYSIS
Does distance means similarity?
What distance?
How should we compute a multi-dimensional distance?
Player Name Height Position Age Plays in Goals this
year
Annual
Wages
Country of
Birth
Lionel Andrés
Messi
169 cm Forward 30 Spain 41 M 36 EUR Portugal
Cristiano
Ronaldo
185cm Forward 31 Spain 27 M 17 EUR Argentina
AGENDA
• What is Machine Learning
• Typical Machine Learning Tasks
• Supervised Learning
• Unsupervised Learning
• How to Get Started
• Summary
HOW TO GET STARTED
• Maintaining and manipulating more and more data becomes
more and more affordable
• Machine Learning suggest a very reach set of boxes.
Selecting the right boxes and building a business solution
requires lots of experience
• Training the right models, tuning the parameters, evaluating
performance and implementation all require some level of
expertise but this should not be your first concerns
• The prediction is not in the box
HOW TO GET STARTED
Business Value
Implement
Machine
Learning
Business
Definition
CRISP-DM
HOW TO GET STARTED
A recommended checklist, before you even start:
1. What am trying to achieve, businesswise?
2. What data it requires? Do I have this data? Am I allowed to
use it?
3. What will be the output of a machine learning model?
4. Can my operations use that output? How?
5. What machine learning task am I trying to solve?
6. What are the success criteria?
7. Who will be the ones to run the project?
8. How long will it take? How much will it cost?
AGENDA
• What is Machine Learning
• Typical Machine Learning Tasks
• Supervised Learning
• Unsupervised Learning
• How to Get Started
• Summary
SUMMARY
• Machine learning = designing machines that learn from
experience
• Three typical tasks:
• Supervised Learning
• Unsupervised Learning
• Reinforcement Learning
• Supervised Learning:
• Learning means generalization
• Generalization vs. Specification, Over fitting and Under fitting
• Classification vs. Regression
SUMMARY
• Supervised Learning algorithms:
• K-NN
• SVM
• Decision Trees
• Linear Regression
• More
• Unsupervised Learning
• Cluster analysis: similarity and distance
• Association rules
• Reinforcement Learning
• The big data challenge of Machine Learning
• CRISP-DM
INTRODUCTION TO MACHINE LEARNING

More Related Content

What's hot

Support vector machine
Support vector machineSupport vector machine
Support vector machineRishabh Gupta
 
Introduction to-machine-learning
Introduction to-machine-learningIntroduction to-machine-learning
Introduction to-machine-learningBabu Priyavrat
 
Optimization/Gradient Descent
Optimization/Gradient DescentOptimization/Gradient Descent
Optimization/Gradient Descentkandelin
 
Classification and Regression
Classification and RegressionClassification and Regression
Classification and RegressionMegha Sharma
 
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...Simplilearn
 
Presentation on supervised learning
Presentation on supervised learningPresentation on supervised learning
Presentation on supervised learningTonmoy Bhagawati
 
Linear Regression vs Logistic Regression | Edureka
Linear Regression vs Logistic Regression | EdurekaLinear Regression vs Logistic Regression | Edureka
Linear Regression vs Logistic Regression | EdurekaEdureka!
 
Machine learning and types
Machine learning and typesMachine learning and types
Machine learning and typesPadma Metta
 
Decision Tree Algorithm | Decision Tree in Python | Machine Learning Algorith...
Decision Tree Algorithm | Decision Tree in Python | Machine Learning Algorith...Decision Tree Algorithm | Decision Tree in Python | Machine Learning Algorith...
Decision Tree Algorithm | Decision Tree in Python | Machine Learning Algorith...Edureka!
 
Convolutional Neural Networks (CNN)
Convolutional Neural Networks (CNN)Convolutional Neural Networks (CNN)
Convolutional Neural Networks (CNN)Gaurav Mittal
 
Basics of Machine Learning
Basics of Machine LearningBasics of Machine Learning
Basics of Machine Learningbutest
 
Unit 3 – AIML.pptx
Unit 3 – AIML.pptxUnit 3 – AIML.pptx
Unit 3 – AIML.pptxhiblooms
 
Linear regression in machine learning
Linear regression in machine learningLinear regression in machine learning
Linear regression in machine learningShajun Nisha
 
Applications in Machine Learning
Applications in Machine LearningApplications in Machine Learning
Applications in Machine LearningJoel Graff
 
Backpropagation And Gradient Descent In Neural Networks | Neural Network Tuto...
Backpropagation And Gradient Descent In Neural Networks | Neural Network Tuto...Backpropagation And Gradient Descent In Neural Networks | Neural Network Tuto...
Backpropagation And Gradient Descent In Neural Networks | Neural Network Tuto...Simplilearn
 
Support Vector Machines
Support Vector MachinesSupport Vector Machines
Support Vector Machinesnextlib
 
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...Simplilearn
 
Supervised vs Unsupervised vs Reinforcement Learning | Edureka
Supervised vs Unsupervised vs Reinforcement Learning | EdurekaSupervised vs Unsupervised vs Reinforcement Learning | Edureka
Supervised vs Unsupervised vs Reinforcement Learning | EdurekaEdureka!
 
Machine Learning and Real-World Applications
Machine Learning and Real-World ApplicationsMachine Learning and Real-World Applications
Machine Learning and Real-World ApplicationsMachinePulse
 

What's hot (20)

Support vector machine
Support vector machineSupport vector machine
Support vector machine
 
Introduction to-machine-learning
Introduction to-machine-learningIntroduction to-machine-learning
Introduction to-machine-learning
 
Optimization/Gradient Descent
Optimization/Gradient DescentOptimization/Gradient Descent
Optimization/Gradient Descent
 
Classification and Regression
Classification and RegressionClassification and Regression
Classification and Regression
 
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
 
Presentation on supervised learning
Presentation on supervised learningPresentation on supervised learning
Presentation on supervised learning
 
Linear Regression vs Logistic Regression | Edureka
Linear Regression vs Logistic Regression | EdurekaLinear Regression vs Logistic Regression | Edureka
Linear Regression vs Logistic Regression | Edureka
 
Machine learning and types
Machine learning and typesMachine learning and types
Machine learning and types
 
Decision Tree Algorithm | Decision Tree in Python | Machine Learning Algorith...
Decision Tree Algorithm | Decision Tree in Python | Machine Learning Algorith...Decision Tree Algorithm | Decision Tree in Python | Machine Learning Algorith...
Decision Tree Algorithm | Decision Tree in Python | Machine Learning Algorith...
 
Convolutional Neural Networks (CNN)
Convolutional Neural Networks (CNN)Convolutional Neural Networks (CNN)
Convolutional Neural Networks (CNN)
 
Basics of Machine Learning
Basics of Machine LearningBasics of Machine Learning
Basics of Machine Learning
 
Unit 3 – AIML.pptx
Unit 3 – AIML.pptxUnit 3 – AIML.pptx
Unit 3 – AIML.pptx
 
Linear regression in machine learning
Linear regression in machine learningLinear regression in machine learning
Linear regression in machine learning
 
Applications in Machine Learning
Applications in Machine LearningApplications in Machine Learning
Applications in Machine Learning
 
K - Nearest neighbor ( KNN )
K - Nearest neighbor  ( KNN )K - Nearest neighbor  ( KNN )
K - Nearest neighbor ( KNN )
 
Backpropagation And Gradient Descent In Neural Networks | Neural Network Tuto...
Backpropagation And Gradient Descent In Neural Networks | Neural Network Tuto...Backpropagation And Gradient Descent In Neural Networks | Neural Network Tuto...
Backpropagation And Gradient Descent In Neural Networks | Neural Network Tuto...
 
Support Vector Machines
Support Vector MachinesSupport Vector Machines
Support Vector Machines
 
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
 
Supervised vs Unsupervised vs Reinforcement Learning | Edureka
Supervised vs Unsupervised vs Reinforcement Learning | EdurekaSupervised vs Unsupervised vs Reinforcement Learning | Edureka
Supervised vs Unsupervised vs Reinforcement Learning | Edureka
 
Machine Learning and Real-World Applications
Machine Learning and Real-World ApplicationsMachine Learning and Real-World Applications
Machine Learning and Real-World Applications
 

Viewers also liked

An Introductory course on Digital Transformation
An Introductory course on Digital Transformation An Introductory course on Digital Transformation
An Introductory course on Digital Transformation Gokul Alex
 
Innovatiecafe regie op persoonlijke devices
Innovatiecafe regie op persoonlijke devices Innovatiecafe regie op persoonlijke devices
Innovatiecafe regie op persoonlijke devices SURF Events
 
Marlabs Capabilities Overview: Telecom
Marlabs Capabilities Overview: Telecom Marlabs Capabilities Overview: Telecom
Marlabs Capabilities Overview: Telecom Marlabs
 
Altijd en overal toegang tot onderwijs met Office 365
Altijd en overal toegang tot onderwijs met Office 365Altijd en overal toegang tot onderwijs met Office 365
Altijd en overal toegang tot onderwijs met Office 365SURF Events
 
Sea power-session 10-empire
Sea power-session 10-empireSea power-session 10-empire
Sea power-session 10-empireJim Powers
 
Opleiding Management in de Zorg Associate Degree: flexibel, op maat en blended
Opleiding Management in de Zorg Associate Degree: flexibel, op maat en blendedOpleiding Management in de Zorg Associate Degree: flexibel, op maat en blended
Opleiding Management in de Zorg Associate Degree: flexibel, op maat en blendedSURF Events
 
CEO-008-高效能的領導管理
CEO-008-高效能的領導管理CEO-008-高效能的領導管理
CEO-008-高效能的領導管理handbook
 
跨界思考與創新 (慈濟大學)
跨界思考與創新 (慈濟大學)跨界思考與創新 (慈濟大學)
跨界思考與創新 (慈濟大學)Yeong-Long Chen
 
Hr 017 社會新鮮人生涯規劃
Hr 017 社會新鮮人生涯規劃Hr 017 社會新鮮人生涯規劃
Hr 017 社會新鮮人生涯規劃handbook
 
Hadoop Troubleshooting 101 - Japanese Version
Hadoop Troubleshooting 101 - Japanese VersionHadoop Troubleshooting 101 - Japanese Version
Hadoop Troubleshooting 101 - Japanese VersionCloudera, Inc.
 
問題處理與策略規劃技巧 (淡江大學)
問題處理與策略規劃技巧 (淡江大學)問題處理與策略規劃技巧 (淡江大學)
問題處理與策略規劃技巧 (淡江大學)Yeong-Long Chen
 
Ontwerpend leren - effectief persoonlijk leren in de 21e eeuw
Ontwerpend leren - effectief persoonlijk leren in de 21e eeuwOntwerpend leren - effectief persoonlijk leren in de 21e eeuw
Ontwerpend leren - effectief persoonlijk leren in de 21e eeuwJos Cöp
 
Introduction to HiveQL
Introduction to HiveQLIntroduction to HiveQL
Introduction to HiveQLkristinferrier
 
Marlabs Capabilities Overview: Cyber Security Services
Marlabs Capabilities Overview: Cyber Security Services Marlabs Capabilities Overview: Cyber Security Services
Marlabs Capabilities Overview: Cyber Security Services Marlabs
 
20160219 devsumi cookpad matsuo
20160219 devsumi cookpad matsuo20160219 devsumi cookpad matsuo
20160219 devsumi cookpad matsuoKazuaki Matsuo
 
HBaseCon 2015: HBase at Scale in an Online and High-Demand Environment
HBaseCon 2015: HBase at Scale in an Online and  High-Demand EnvironmentHBaseCon 2015: HBase at Scale in an Online and  High-Demand Environment
HBaseCon 2015: HBase at Scale in an Online and High-Demand EnvironmentHBaseCon
 

Viewers also liked (20)

Agile versioning with Git
Agile versioning with GitAgile versioning with Git
Agile versioning with Git
 
An Introductory course on Digital Transformation
An Introductory course on Digital Transformation An Introductory course on Digital Transformation
An Introductory course on Digital Transformation
 
Innovatiecafe regie op persoonlijke devices
Innovatiecafe regie op persoonlijke devices Innovatiecafe regie op persoonlijke devices
Innovatiecafe regie op persoonlijke devices
 
Marlabs Capabilities Overview: Telecom
Marlabs Capabilities Overview: Telecom Marlabs Capabilities Overview: Telecom
Marlabs Capabilities Overview: Telecom
 
Altijd en overal toegang tot onderwijs met Office 365
Altijd en overal toegang tot onderwijs met Office 365Altijd en overal toegang tot onderwijs met Office 365
Altijd en overal toegang tot onderwijs met Office 365
 
Sea power-session 10-empire
Sea power-session 10-empireSea power-session 10-empire
Sea power-session 10-empire
 
Opleiding Management in de Zorg Associate Degree: flexibel, op maat en blended
Opleiding Management in de Zorg Associate Degree: flexibel, op maat en blendedOpleiding Management in de Zorg Associate Degree: flexibel, op maat en blended
Opleiding Management in de Zorg Associate Degree: flexibel, op maat en blended
 
CEO-008-高效能的領導管理
CEO-008-高效能的領導管理CEO-008-高效能的領導管理
CEO-008-高效能的領導管理
 
Water cycle ppt
Water cycle pptWater cycle ppt
Water cycle ppt
 
Presentacion gvLOGOS-GEDES
Presentacion gvLOGOS-GEDESPresentacion gvLOGOS-GEDES
Presentacion gvLOGOS-GEDES
 
跨界思考與創新 (慈濟大學)
跨界思考與創新 (慈濟大學)跨界思考與創新 (慈濟大學)
跨界思考與創新 (慈濟大學)
 
Hr 017 社會新鮮人生涯規劃
Hr 017 社會新鮮人生涯規劃Hr 017 社會新鮮人生涯規劃
Hr 017 社會新鮮人生涯規劃
 
Hadoop Tutorials
Hadoop TutorialsHadoop Tutorials
Hadoop Tutorials
 
Hadoop Troubleshooting 101 - Japanese Version
Hadoop Troubleshooting 101 - Japanese VersionHadoop Troubleshooting 101 - Japanese Version
Hadoop Troubleshooting 101 - Japanese Version
 
問題處理與策略規劃技巧 (淡江大學)
問題處理與策略規劃技巧 (淡江大學)問題處理與策略規劃技巧 (淡江大學)
問題處理與策略規劃技巧 (淡江大學)
 
Ontwerpend leren - effectief persoonlijk leren in de 21e eeuw
Ontwerpend leren - effectief persoonlijk leren in de 21e eeuwOntwerpend leren - effectief persoonlijk leren in de 21e eeuw
Ontwerpend leren - effectief persoonlijk leren in de 21e eeuw
 
Introduction to HiveQL
Introduction to HiveQLIntroduction to HiveQL
Introduction to HiveQL
 
Marlabs Capabilities Overview: Cyber Security Services
Marlabs Capabilities Overview: Cyber Security Services Marlabs Capabilities Overview: Cyber Security Services
Marlabs Capabilities Overview: Cyber Security Services
 
20160219 devsumi cookpad matsuo
20160219 devsumi cookpad matsuo20160219 devsumi cookpad matsuo
20160219 devsumi cookpad matsuo
 
HBaseCon 2015: HBase at Scale in an Online and High-Demand Environment
HBaseCon 2015: HBase at Scale in an Online and  High-Demand EnvironmentHBaseCon 2015: HBase at Scale in an Online and  High-Demand Environment
HBaseCon 2015: HBase at Scale in an Online and High-Demand Environment
 

Similar to Introduction to Machine Learning

EssentialsOfMachineLearning.pdf
EssentialsOfMachineLearning.pdfEssentialsOfMachineLearning.pdf
EssentialsOfMachineLearning.pdfAnkita Tiwari
 
Lecture 09(introduction to machine learning)
Lecture 09(introduction to machine learning)Lecture 09(introduction to machine learning)
Lecture 09(introduction to machine learning)Jeet Das
 
Session 2 into to qualitative research intro
Session 2   into to qualitative research introSession 2   into to qualitative research intro
Session 2 into to qualitative research introAngela Ferrara
 
Supervised learning and Unsupervised learning
Supervised learning and Unsupervised learning Supervised learning and Unsupervised learning
Supervised learning and Unsupervised learning Usama Fayyaz
 
Mariia Havrylovych "Active learning and weak supervision in NLP projects"
Mariia Havrylovych "Active learning and weak supervision in NLP projects"Mariia Havrylovych "Active learning and weak supervision in NLP projects"
Mariia Havrylovych "Active learning and weak supervision in NLP projects"Fwdays
 
Chapter01.ppt
Chapter01.pptChapter01.ppt
Chapter01.pptbutest
 
Machine Learning and Inductive Inference
Machine Learning and Inductive InferenceMachine Learning and Inductive Inference
Machine Learning and Inductive Inferencebutest
 
Machine Learning
Machine LearningMachine Learning
Machine LearningShrey Malik
 
AI -learning and machine learning.pptx
AI  -learning and machine learning.pptxAI  -learning and machine learning.pptx
AI -learning and machine learning.pptxGaytriDhingra1
 
Experimentation for PMs: A Primer by Amazon Director of Product
Experimentation for PMs: A Primer by Amazon Director of ProductExperimentation for PMs: A Primer by Amazon Director of Product
Experimentation for PMs: A Primer by Amazon Director of ProductProduct School
 
Chemistry Lab Manual 2012-13
Chemistry Lab Manual 2012-13Chemistry Lab Manual 2012-13
Chemistry Lab Manual 2012-13Stephen Taylor
 

Similar to Introduction to Machine Learning (20)

EssentialsOfMachineLearning.pdf
EssentialsOfMachineLearning.pdfEssentialsOfMachineLearning.pdf
EssentialsOfMachineLearning.pdf
 
Lecture 09(introduction to machine learning)
Lecture 09(introduction to machine learning)Lecture 09(introduction to machine learning)
Lecture 09(introduction to machine learning)
 
Session 2 into to qualitative research intro
Session 2   into to qualitative research introSession 2   into to qualitative research intro
Session 2 into to qualitative research intro
 
Supervised learning and Unsupervised learning
Supervised learning and Unsupervised learning Supervised learning and Unsupervised learning
Supervised learning and Unsupervised learning
 
Mariia Havrylovych "Active learning and weak supervision in NLP projects"
Mariia Havrylovych "Active learning and weak supervision in NLP projects"Mariia Havrylovych "Active learning and weak supervision in NLP projects"
Mariia Havrylovych "Active learning and weak supervision in NLP projects"
 
Ws 19
Ws 19Ws 19
Ws 19
 
Lecture 1
Lecture 1Lecture 1
Lecture 1
 
lec1.ppt
lec1.pptlec1.ppt
lec1.ppt
 
Week 1.pdf
Week 1.pdfWeek 1.pdf
Week 1.pdf
 
Chapter01.ppt
Chapter01.pptChapter01.ppt
Chapter01.ppt
 
ai4.ppt
ai4.pptai4.ppt
ai4.ppt
 
4.1.pptx
4.1.pptx4.1.pptx
4.1.pptx
 
ai4.ppt
ai4.pptai4.ppt
ai4.ppt
 
Learning
LearningLearning
Learning
 
Machine Learning and Inductive Inference
Machine Learning and Inductive InferenceMachine Learning and Inductive Inference
Machine Learning and Inductive Inference
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Machine Learning
Machine Learning Machine Learning
Machine Learning
 
AI -learning and machine learning.pptx
AI  -learning and machine learning.pptxAI  -learning and machine learning.pptx
AI -learning and machine learning.pptx
 
Experimentation for PMs: A Primer by Amazon Director of Product
Experimentation for PMs: A Primer by Amazon Director of ProductExperimentation for PMs: A Primer by Amazon Director of Product
Experimentation for PMs: A Primer by Amazon Director of Product
 
Chemistry Lab Manual 2012-13
Chemistry Lab Manual 2012-13Chemistry Lab Manual 2012-13
Chemistry Lab Manual 2012-13
 

Recently uploaded

FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service LucknowAminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknowmakika9823
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Digi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxDigi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxTanveerAhmed817946
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Predicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project PresentationPredicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project PresentationBoston Institute of Analytics
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 

Recently uploaded (20)

FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service LucknowAminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Digi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxDigi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptx
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Predicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project PresentationPredicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project Presentation
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 

Introduction to Machine Learning

  • 2. CHILD LEARNING Child:Daddy what is danger? Dad: The possibility of suffering harm or injury. Child:Daddy what is an injury? Dad: An instance of being injured. Child:Daddy what is an instance? Dad: An example or single occurrence of something. Child:Daddy does it bother you that I’m asking so many questions? Dad: Not at all, if you don't ask you will never know.
  • 3. CHILD LEARNING Dad: Let me give you some examples…
  • 4. CHILD LEARNING Child:Now I understand, everything is dangerous Dad: No, there are things that aren't dangerous
  • 6. CHILD LEARNING And there is the most natural mode of learning Action Reaction Lesson Touching hot stove aching hand Do not touch again Playing with toys Fun Continue playing Running in to the road Screaming parent Don’t run to roads Running in the house Fun Run in the house Eating chocolate Fun Search for chocolate Eating too much chocolate Stomach ache Don’t eat too much Saying bla bla No Reaction Try variations Saying daddy Overexcited parents Do that again
  • 7. SO, HOW CHILDREN LEARN? 1. From explanation 2. From examples 3. Reinforcement Learning
  • 8. SO, HOW CHILDREN LEARN? 1. From explanation 2. From examples 3. Reinforcement Learning
  • 10. AGENDA • What is Machine Learning • Typical Machine Learning Tasks • Supervised Learning • Unsupervised Learning • How to Get Started • Summary
  • 11. AGENDA • What is Machine Learning • Typical Machine Learning Tasks • Supervised Learning • Unsupervised Learning • How to Get Started • Summary
  • 12. WHAT IS MACHINE LEARNING? We say that a computer program is learning a task, if its performance on that task is improving as more experience is processed
  • 13. WHAT IS MACHINE LEARNING? Machine Learning Statistics Databases & Big Data Decision Theory Artificial Intelligence Optimization
  • 14. WHAT IS MACHINE LEARNING? Machine Learning Statistics Databases & Big Data Decision Theory Artificial Intelligence Optimization Data Science
  • 15. AGENDA • What is Machine Learning • Typical Machine Learning Tasks • Supervised Learning • Unsupervised Learning • How to Get Started • Summary
  • 16. TYPICAL MACHINE LEARNING TASKS No two Machine Learning tasks are identical. Yet, we often use the following categories: • Supervised Learning • Unsupervised Learning • Reinforcement Learning
  • 17. SUPERVISED LEARNING Estimate or Predict an unknown result, given explicit values of some explaining features. The learning takes place as history of observations, for which both the explaining features and the results are known. Experience = supervised examples (exactly as in inferring what is dangerous from examples) We call the dataset that describe the experience training set
  • 18. SUPERVISED LEARNING Estimate or Predict an unknown result, given explicit values of some explaining features. We call the dataset that describe the experience training set When the unknown result is numeric, we call the task Regression When the unknown result is categorical, we call the task Classification
  • 19. SUPERVISED LEARNING Example 1: What will be the annual spent of a new customer, given a set of explaining features (e.g., demographics, first purchases, first deposit etc.)? Task qualifications: Prediction, Regression Training set: a file, in which each row represents a customer. For each such customer we will extract the explaining features, at the prediction point, as well as the annual spent (a year later).
  • 20. SUPERVISED LEARNING Example 2: What is the activity currently performed by a user who is wearing a smart watch with inertial sensors? Task qualifications: Assessment, Classification Input: A set of sensor-based signals, along with an annotation of the activity during each signal. Requires a significant amount of pre-processing in order to produce the training set.
  • 22. UNSUPERVISED LEARNING Given a specific set of records, described by a given set of features, either: 1. Extract interesting patterns that appear in the data 2. Provide insightful representation of the distribution of the data Experience: the more records we have, the more significant are the patterns that we can extract, or more accurate is the representation
  • 23. UNSUPERVISED LEARNING Example: Market Segmentation Input data: Customers’ descriptions Objective: Provide an insightful representation of the market (what types of customers are there?) Also known as cluster analysis
  • 24. REINFORCEMENT LEARNING Learning how to best react to situation through trial and error. Simple Example: Multiple A/B testing More Typical: Robot Navigation Designing a RL system requires solving two difficult challenges: • The exploration – exploitation dilemma • Attributing delayed rewards
  • 25. UNSTRUCTURED INPUTS The input data often come in an unstructured form, such as: • Free text • Speech • Images • Video • Sensors • Networks
  • 26. AGENDA • What is Machine Learning • Typical Machine Learning Tasks • Supervised Learning • Unsupervised Learning • How to Get Started • Summary
  • 27. SUPERVISED LEARNING X1 X2 X3 … Xn-2 Xn-1 Xn Y x1,1 x2,1 x3,1 … xn-2,1 xn-1,1 xn,1 y1 x1,2 x2,2 x3,2 … xn-2,2 xn-1,2 xn,2 y2 . . . . . . . . . … … … . . . . . . . . . x1,m-1 x2,m-1 x3,m-1 … xn-2,m-1 xn-1,m-1 xn,m-1 ym-1 x1,m x2,m x3,m … xn-2,m xn-1,m xn,m ym 𝑌 = 𝑓 𝑋1, 𝑋2, … , 𝑋 𝑛
  • 28. LEARNING THE CONCEPT OF A BIRD An alien asks you: “What is a bird?” You can try and define a bird, but the alien does not understand Why don’t you give an example…
  • 29. LEARNING THE CONCEPT OF A BIRD Is Bird?Can Fly ?ColorExample # YesYesBlack1 What do you say about the following classification model: “If Color = Black and Can_Fly = Yes then Bird Else Not_Bird”?
  • 30. LEARNING THE CONCEPT OF A BIRD Is Bird?Can Fly ?ColorExample # YesYesBlack1 YesYesGrey2 What do you say about the following classification model: “If Can_Fly = Yes then Bird Else Not_Bird”?
  • 31. LEARNING THE CONCEPT OF A BIRD Is Bird?Can Fly ?ColorExample # YesYesBlack1 YesYesGrey2 NoYesBlack3 Supervised Learning means generalizing from given observations.
  • 32. GENERALIZATION VS. SPECIFICATION • A general concept is built based on the explaining features. The right set of explaining features is crucial for learning • Being over specific means memorizing and not learning • Being too general means being too coarse and missing some of the details • Finding the sweet spot between generalization and specificity is hard
  • 33. GENERALIZATION VS. SPECIFICATION Let us find a function that estimates Y=f(X)
  • 34.
  • 35. Too General / Too Simple / Under fitted Too Specific / Too Complex / Over fitted A nice solution to the trade-off
  • 36. OVER FITTING & UNDER FITTING • We search for • We know that in addition to the functional dependency (called bias), the actual Y values are also affected by noise (called variance) • We want the model to learn the bias, but not to be affected by the variance. • A model that is too simple to learn the bias is called under fitted • A model that is overly complex that it adapts itself to the variance is called over fitted 𝑌 = 𝑓 𝑋1, 𝑋2, … , 𝑋 𝑛 The more complexity you add to the model, you can always better fit it to the training observations. This is not always a good practice!
  • 37. A PARTIAL LIST OF SUPERVISED LEARNING METHODS • K- Nearest Neighbor • SVM (Optimal Margin Linear Separation) • Decision Trees • Naïve Bayes • Linear Regression • Logistic Regression • (Deep) Neural Networks
  • 38. A PARTIAL LIST OF SUPERVISED LEARNING METHODS • K- Nearest Neighbor • SVM (Optimal Margin Linear Separation) • Decision Trees • Naïve Bayes • Linear Regression • Logistic Regression • (Deep) Neural Networks
  • 39. K-NN Recipients EmailLength Given a new observation, find the K closest available observations and: • In regression, use the average result of these K observations • In Classification, use voting amongst these K observations
  • 40. K-NN Recipients EmailLength K=3 Few concerns: • What should be k? • Which distance measure should be used? • Computation
  • 41. LINEAR SEPERATORS How would you classify this data? X1 X2
  • 42. LINEAR SEPERATORS How would you classify this data? X1 X2
  • 44. LINEAR SEPERATORS X1 X2 In SVM we search for the linear separator that has the maximal margin. Using a mathematical trick, called The Kernel Trick, SVMs can also find non-linear separators
  • 45. DECISION TREES Example: classify new customers into one of two groups: Standard and VIP. Training set: a list of customers that were once new, along with an annotation that reflect if these customers should have been identified as VIP (this annotation is made only after some time). Let us say that we have 1,000 VIPs and 4,000 Standard new customers
  • 46. DECISION TREES Let us say that we have 1,000 VIPs and 4,000 Standard new customers 1,000 V 4,000 S
  • 47. DECISION TREES The population is a mix of different types. What if we could find splitting criterion that will create two (or more), more pure sub populations 1,000 V 4,000 S
  • 48. DECISION TREES The population is a mix of different types. What if we could find splitting criterion that will create two (or more), more pure sub populations 1,000 V 4,000 S Self Employed 600 V 800 S Employees 400 V 3,200 S
  • 49. DECISION TREES Now, we can take each sub-population and split it recursively, until some stopping criteria are met. 1,000 V 4,000 S Self Employed 600 V 800 S Employees 400 V 3,200 S
  • 50. DECISION TREES • Decision trees are a result of recursive splitting mechanism • Each split is chosen as to maximize the purity of the sub populations that results from the split • Few ways to model node purity. Often the concept of minimal entropy (or a variation of minimal entropy) is used • Each split is made according to the values of one of the explaining features
  • 51. LINEAR REGRESSION 0 50 100 150 200 250 300 350 400 450 0 1000 2000 3000 HousePrice($1000s) Square Feet
  • 52. LINEAR REGRESSION 0 50 100 150 200 250 300 350 400 450 0 1000 2000 3000 HousePrice($1000s) Square Feet
  • 53. SUPERVISED LEARNING EVALUATION Since Supervised Learning is all about generalization, a good model is a model that can be applied successfully to new observations In classification tasks, we are often interested in the probability that the model will extract the true outcome. This probability is called the model accuracy In regression tasks, we are often interested in the average deviation between the outcome of the model and the true outcome. This deviation is called RMSE
  • 54. Too General / Too Simple / Under fitted Too Specific / Too Complex / Over fitted A nice solution to the trade-off
  • 55. SUPERVISED LEARNING EVALUATION It is always possible to build an over fitted model. So the quality of the model on the training set say very little on the capability of the model to generalize to new observations. Therefor, never evaluate a model using the training set Instead: • Use an independent (randomly selected) test set • Use cross validation
  • 57. SUPERVISED LEARNING EVALUATION RedBlue 17Blue 50Red Classified As Actual Confusion Matrix Accuracy (on test set) = (7+5)/(7+5+1+0)
  • 58. CROSS VALIDATION Randomly break the training set into k mutually exclusive, collectively exhaustive sets, of similar size (often k=10). For i=1,2,…k: Train a model using all the sets, except for the i-th set. Evaluate the trained model over the i-th set. You end up with k evaluation measures. Evaluate the entire model as the average of these k results.
  • 59. SUPERVISED LEARNING SUMMARY • Two sub problems: classification and regression • Supervised Learning is all about generalizing from a given training set • There is an inherent, hard to solve trade-off between generalization and over specification • The more complexity you add to your model, the better it can fit the training set. You may gain an over fitted model • Therefor, you never evaluate a model on the training set that was used to induce it • Instead, use either and independent test set, or cross validation
  • 60. SUPERVISED LEARNING SUMMARY • We also got familiar with 4 SL methods: K-NN, SVM, Decision trees and Linear regression
  • 61. AGENDA • What is Machine Learning • Typical Machine Learning Tasks • Supervised Learning • Unsupervised Learning • How to Get Started • Summary
  • 62. UNSUPERVISED LEARNING X1 X2 X3 … Xn-2 Xn-1 Xn x1,1 x2,1 x3,1 … xn-2,1 xn-1,1 xn,1 x1,2 x2,2 x3,2 … xn-2,2 xn-1,2 xn,2 . . . . . . . . . … … … . . . . . . . . . x1,m-1 x2,m-1 x3,m-1 … xn-2,m-1 xn-1,m-1 xn,m-1 x1,m x2,m x3,m … xn-2,m xn-1,m xn,m Extract interesting patterns from the input set or Provide an insightful representation of the input space
  • 63. UNSUPERVISED LEARNING Unsupervised Learning tasks: • Cluster Analysis • Association Rules Mining • Hidden Markov Models • Dimensionality Reduction • Self-Organising Maps
  • 64. CLUSTER ANALYSIS Data points that share a cluster need to be similar Data points in different clusters need to be different Similarity = Low distance Difference = High distance?
  • 68. CLUSTER ANALYSIS K-Means: Initialize: place k cluster centroids on the feature space Repeat until some stopping criteria are met: Associate each data point to the closest centroid Move each centroid to the center of the points that are associated to it
  • 69. CLUSTER ANALYSIS Does distance means similarity? What distance?
  • 70. CLUSTER ANALYSIS Does distance means similarity? What distance? For example, let us look at similarity in monthly salary. Mr. X earns $2,500 a month Mrs. Y earns $250,000 a month Mr. Z earns $100,00 a month. Is he more similar (in terms of salary) to X or to Y?
  • 71. CLUSTER ANALYSIS Does distance means similarity? What distance? How should we compute a multi-dimensional distance? Player Name Height Position Age Plays in Goals this year Annual Wages Country of Birth Lionel Andrés Messi 169 cm Forward 30 Spain 41 M 36 EUR Portugal Cristiano Ronaldo 185cm Forward 31 Spain 27 M 17 EUR Argentina
  • 72. AGENDA • What is Machine Learning • Typical Machine Learning Tasks • Supervised Learning • Unsupervised Learning • How to Get Started • Summary
  • 73. HOW TO GET STARTED • Maintaining and manipulating more and more data becomes more and more affordable • Machine Learning suggest a very reach set of boxes. Selecting the right boxes and building a business solution requires lots of experience • Training the right models, tuning the parameters, evaluating performance and implementation all require some level of expertise but this should not be your first concerns • The prediction is not in the box
  • 74. HOW TO GET STARTED Business Value Implement Machine Learning Business Definition
  • 76. HOW TO GET STARTED A recommended checklist, before you even start: 1. What am trying to achieve, businesswise? 2. What data it requires? Do I have this data? Am I allowed to use it? 3. What will be the output of a machine learning model? 4. Can my operations use that output? How? 5. What machine learning task am I trying to solve? 6. What are the success criteria? 7. Who will be the ones to run the project? 8. How long will it take? How much will it cost?
  • 77. AGENDA • What is Machine Learning • Typical Machine Learning Tasks • Supervised Learning • Unsupervised Learning • How to Get Started • Summary
  • 78. SUMMARY • Machine learning = designing machines that learn from experience • Three typical tasks: • Supervised Learning • Unsupervised Learning • Reinforcement Learning • Supervised Learning: • Learning means generalization • Generalization vs. Specification, Over fitting and Under fitting • Classification vs. Regression
  • 79. SUMMARY • Supervised Learning algorithms: • K-NN • SVM • Decision Trees • Linear Regression • More • Unsupervised Learning • Cluster analysis: similarity and distance • Association rules • Reinforcement Learning • The big data challenge of Machine Learning • CRISP-DM