SlideShare a Scribd company logo
by Ilya Kuzovkin
ilya.kuzovkin@gmail.com
Mooncascade ML Camp
2016
Machine Learning
ESSENTIAL CONCEPTS
ONE MACHINE LEARNING USE CASE
Can we ask a computer to
create those patterns
automatically?
Can we ask a computer to
create those patterns
automatically?
Yes
Can we ask a computer to
create those patterns
automatically?
Yes
How?
Raw data
Instance
Raw data
Class (label)
A data sample:
ā€œ7ā€
Instance
Raw data
Class (label)
A data sample:
ā€œ7ā€
How to represent it in a machine-readable form?
Instance
Raw data
Class (label)
A data sample:
ā€œ7ā€
How to represent it in a machine-readable form?
Feature extraction
Instance
Raw data
Class (label)
A data sample:
ā€œ7ā€
How to represent it in a machine-readable form?
Feature extraction
28px
28 px
Instance
Raw data
Class (label)
A data sample:
ā€œ7ā€
28px
28 px
784 pixels in total
Feature vector
(0, 0, 0, ā€¦, 28, 65, 128, 255, 101, 38,ā€¦ 0, 0, 0)
How to represent it in a machine-readable form?
Feature extraction
Instance
Raw data
Class (label)
A data sample:
ā€œ7ā€
28px
28 px
784 pixels in total
Feature vector
(0, 0, 0, ā€¦, 28, 65, 128, 255, 101, 38,ā€¦ 0, 0, 0)
How to represent it in a machine-readable form?
Feature extraction
(0, 0, 0, ā€¦, 28, 65, 128, 255, 101, 38,ā€¦ 0, 0, 0)
(0, 0, 0, ā€¦, 13, 48, 102, 0, 46, 255,ā€¦ 0, 0, 0)
(0, 0, 0, ā€¦, 17, 34, 12, 43, 122, 70,ā€¦ 0, 7, 0)
(0, 0, 0, ā€¦, 98, 21, 255, 255, 231, 140,ā€¦ 0, 0, 0)
ā€œ7ā€
ā€œ2ā€
ā€œ8ā€
ā€œ2ā€
Instance
Raw data
Class (label)
A data sample:
ā€œ7ā€
28px
28 px
784 pixels in total
Feature vector
(0, 0, 0, ā€¦, 28, 65, 128, 255, 101, 38,ā€¦ 0, 0, 0)
How to represent it in a machine-readable form?
Feature extraction
(0, 0, 0, ā€¦, 28, 65, 128, 255, 101, 38,ā€¦ 0, 0, 0)
(0, 0, 0, ā€¦, 13, 48, 102, 0, 46, 255,ā€¦ 0, 0, 0)
(0, 0, 0, ā€¦, 17, 34, 12, 43, 122, 70,ā€¦ 0, 7, 0)
Dataset
(0, 0, 0, ā€¦, 98, 21, 255, 255, 231, 140,ā€¦ 0, 0, 0)
ā€œ7ā€
ā€œ2ā€
ā€œ8ā€
ā€œ2ā€
The data is in the right format ā€” whatā€™s next?
The data is in the right format ā€” whatā€™s next?
ā€¢ C4.5	
ā€¢ Random	forests	
ā€¢ Bayesian	networks	
ā€¢ Hidden	Markov	models	
ā€¢ Artificial	neural	network	
ā€¢ Data	clustering	
ā€¢ Expectation-maximization	
algorithm	
ā€¢ Self-organizing	map	
ā€¢ Radial	basis	function	network	
ā€¢ Vector	Quantization	
ā€¢ Generative	topographic	map	
ā€¢ Information	bottleneck	method	
ā€¢ IBSEAD	
ā€¢ Apriori	algorithm	
ā€¢ Eclat	algorithm	
ā€¢ FP-growth	algorithm	
ā€¢ Single-linkage	clustering	
ā€¢ Conceptual	clustering	
ā€¢ K-means	algorithm	
ā€¢ Fuzzy	clustering	
ā€¢ Temporal	difference	learning	
ā€¢ Q-learning	
ā€¢ Learning	Automata
ā€¢ AODE	
ā€¢ Artificial	neural	network	
ā€¢ Backpropagation	
ā€¢ Naive	Bayes	classifier	
ā€¢ Bayesian	network	
ā€¢ Bayesian	knowledge	base	
ā€¢ Case-based	reasoning	
ā€¢ Decision	trees	
ā€¢ Inductive	logic	
programming	
ā€¢ Gaussian	process	regression	
ā€¢ Gene	expression	
programming	
ā€¢ Group	method	of	data	
handling	(GMDH)	
ā€¢ Learning	Automata	
ā€¢ Learning	Vector	
Quantization	
ā€¢ Logistic	Model	Tree	
ā€¢ Decision	tree	
ā€¢ Decision	graphs	
ā€¢ Lazy	learning	
ā€¢ Monte	Carlo	Method	
ā€¢ SARSA
ā€¢ Instance-based	learning	
ā€¢ Nearest	Neighbor	Algorithm	
ā€¢ Analogical	modeling	
ā€¢ Probably	approximately	correct	learning	
(PACL)	
ā€¢ Symbolic	machine	learning	algorithms	
ā€¢ Subsymbolic	machine	learning	algorithms	
ā€¢ Support	vector	machines	
ā€¢ Random	Forest	
ā€¢ Ensembles	of	classifiers	
ā€¢ Bootstrap	aggregating	(bagging)	
ā€¢ Boosting	(meta-algorithm)	
ā€¢ Ordinal	classification	
ā€¢ Regression	analysis	
ā€¢ Information	fuzzy	networks	(IFN)	
ā€¢ Linear	classifiers	
ā€¢ Fisher's	linear	discriminant	
ā€¢ Logistic	regression	
ā€¢ Naive	Bayes	classifier	
ā€¢ Perceptron	
ā€¢ Support	vector	machines	
ā€¢ Quadratic	classifiers	
ā€¢ k-nearest	neighbor	
ā€¢ Boosting
Pick an algorithm
The data is in the right format ā€” whatā€™s next?
ā€¢ C4.5	
ā€¢ Random	forests	
ā€¢ Bayesian	networks	
ā€¢ Hidden	Markov	models	
ā€¢ Artificial	neural	network	
ā€¢ Data	clustering	
ā€¢ Expectation-maximization	
algorithm	
ā€¢ Self-organizing	map	
ā€¢ Radial	basis	function	network	
ā€¢ Vector	Quantization	
ā€¢ Generative	topographic	map	
ā€¢ Information	bottleneck	method	
ā€¢ IBSEAD	
ā€¢ Apriori	algorithm	
ā€¢ Eclat	algorithm	
ā€¢ FP-growth	algorithm	
ā€¢ Single-linkage	clustering	
ā€¢ Conceptual	clustering	
ā€¢ K-means	algorithm	
ā€¢ Fuzzy	clustering	
ā€¢ Temporal	difference	learning	
ā€¢ Q-learning	
ā€¢ Learning	Automata
ā€¢ AODE	
ā€¢ Artificial	neural	network	
ā€¢ Backpropagation	
ā€¢ Naive	Bayes	classifier	
ā€¢ Bayesian	network	
ā€¢ Bayesian	knowledge	base	
ā€¢ Case-based	reasoning	
ā€¢ Decision	trees	
ā€¢ Inductive	logic	
programming	
ā€¢ Gaussian	process	regression	
ā€¢ Gene	expression	
programming	
ā€¢ Group	method	of	data	
handling	(GMDH)	
ā€¢ Learning	Automata	
ā€¢ Learning	Vector	
Quantization	
ā€¢ Logistic	Model	Tree	
ā€¢ Decision	tree	
ā€¢ Decision	graphs	
ā€¢ Lazy	learning	
ā€¢ Monte	Carlo	Method	
ā€¢ SARSA
ā€¢ Instance-based	learning	
ā€¢ Nearest	Neighbor	Algorithm	
ā€¢ Analogical	modeling	
ā€¢ Probably	approximately	correct	learning	
(PACL)	
ā€¢ Symbolic	machine	learning	algorithms	
ā€¢ Subsymbolic	machine	learning	algorithms	
ā€¢ Support	vector	machines	
ā€¢ Random	Forest	
ā€¢ Ensembles	of	classifiers	
ā€¢ Bootstrap	aggregating	(bagging)	
ā€¢ Boosting	(meta-algorithm)	
ā€¢ Ordinal	classification	
ā€¢ Regression	analysis	
ā€¢ Information	fuzzy	networks	(IFN)	
ā€¢ Linear	classifiers	
ā€¢ Fisher's	linear	discriminant	
ā€¢ Logistic	regression	
ā€¢ Naive	Bayes	classifier	
ā€¢ Perceptron	
ā€¢ Support	vector	machines	
ā€¢ Quadratic	classifiers	
ā€¢ k-nearest	neighbor	
ā€¢ Boosting
Pick an algorithm
DECISION TREE
vs.
DECISION TREE
vs.
(0, ā€¦, 28, 65, ā€¦, 207, 101, 0, 0)
(0, ā€¦, 19, 34, ā€¦, 254, 54, 0, 0)
(0, ā€¦, 87, 59, ā€¦, 240, 52, 4, 0)
(0, ā€¦, 87, 52, ā€¦, 240, 19, 3, 0)
(0, ā€¦, 28, 64, ā€¦, 102, 101, 0, 0)
(0, ā€¦, 19, 23, ā€¦, 105, 54, 0, 0)
(0, ā€¦, 87, 74, ā€¦, 121, 51, 7, 0)
(0, ā€¦, 87, 112, ā€¦, 239, 52, 4, 0)
DECISION TREE
vs.
(0, ā€¦, 28, 65, ā€¦, 207, 101, 0, 0)
(0, ā€¦, 19, 34, ā€¦, 254, 54, 0, 0)
(0, ā€¦, 87, 59, ā€¦, 240, 52, 4, 0)
(0, ā€¦, 87, 52, ā€¦, 240, 19, 3, 0)
(0, ā€¦, 28, 64, ā€¦, 102, 101, 0, 0)
(0, ā€¦, 19, 23, ā€¦, 105, 54, 0, 0)
(0, ā€¦, 87, 74, ā€¦, 121, 51, 7, 0)
(0, ā€¦, 87, 112, ā€¦, 239, 52, 4, 0)
PIXEL
#417
DECISION TREE
vs.
(0, ā€¦, 28, 65, ā€¦, 207, 101, 0, 0)
(0, ā€¦, 19, 34, ā€¦, 254, 54, 0, 0)
(0, ā€¦, 87, 59, ā€¦, 240, 52, 4, 0)
(0, ā€¦, 87, 52, ā€¦, 240, 19, 3, 0)
(0, ā€¦, 28, 64, ā€¦, 102, 101, 0, 0)
(0, ā€¦, 19, 23, ā€¦, 105, 54, 0, 0)
(0, ā€¦, 87, 74, ā€¦, 121, 51, 7, 0)
(0, ā€¦, 87, 112, ā€¦, 239, 52, 4, 0)
PIXEL
#417
PIXEL
#417
>200 <200
DECISION TREE
vs.
(0, ā€¦, 28, 65, ā€¦, 207, 101, 0, 0)
(0, ā€¦, 19, 34, ā€¦, 254, 54, 0, 0)
(0, ā€¦, 87, 59, ā€¦, 240, 52, 4, 0)
(0, ā€¦, 87, 52, ā€¦, 240, 19, 3, 0)
(0, ā€¦, 28, 64, ā€¦, 102, 101, 0, 0)
(0, ā€¦, 19, 23, ā€¦, 105, 54, 0, 0)
(0, ā€¦, 87, 74, ā€¦, 121, 51, 7, 0)
(0, ā€¦, 87, 112, ā€¦, 239, 52, 4, 0)
PIXEL
#417
PIXEL
#417
>200 <200
DECISION TREE
vs.
(0, ā€¦, 28, 65, ā€¦, 207, 101, 0, 0)
(0, ā€¦, 19, 34, ā€¦, 254, 54, 0, 0)
(0, ā€¦, 87, 59, ā€¦, 240, 52, 4, 0)
(0, ā€¦, 87, 52, ā€¦, 240, 19, 3, 0)
(0, ā€¦, 28, 64, ā€¦, 102, 101, 0, 0)
(0, ā€¦, 19, 23, ā€¦, 105, 54, 0, 0)
(0, ā€¦, 87, 74, ā€¦, 121, 51, 7, 0)
(0, ā€¦, 87, 112, ā€¦, 239, 52, 4, 0)
PIXEL
#417
>200 <200
DECISION TREE
vs.
(0, ā€¦, 28, 65, ā€¦, 207, 101, 0, 0)
(0, ā€¦, 19, 34, ā€¦, 254, 54, 0, 0)
(0, ā€¦, 87, 59, ā€¦, 240, 52, 4, 0)
(0, ā€¦, 87, 52, ā€¦, 240, 19, 3, 0)
(0, ā€¦, 28, 64, ā€¦, 102, 101, 0, 0)
(0, ā€¦, 19, 23, ā€¦, 105, 54, 0, 0)
(0, ā€¦, 87, 74, ā€¦, 121, 51, 7, 0)
(0, ā€¦, 87, 112, ā€¦, 239, 52, 4, 0)
PIXEL
#417
>200 <200
PIXEL
#123
DECISION TREE
vs.
(0, ā€¦, 28, 65, ā€¦, 207, 101, 0, 0)
(0, ā€¦, 19, 34, ā€¦, 254, 54, 0, 0)
(0, ā€¦, 87, 59, ā€¦, 240, 52, 4, 0)
(0, ā€¦, 87, 52, ā€¦, 240, 19, 3, 0)
(0, ā€¦, 28, 64, ā€¦, 102, 101, 0, 0)
(0, ā€¦, 19, 23, ā€¦, 105, 54, 0, 0)
(0, ā€¦, 87, 74, ā€¦, 121, 51, 7, 0)
(0, ā€¦, 87, 112, ā€¦, 239, 52, 4, 0)
PIXEL
#417
>200 <200
PIXEL
#123
<100 >100
PIXEL
#123
DECISION TREE
vs.
(0, ā€¦, 28, 65, ā€¦, 207, 101, 0, 0)
(0, ā€¦, 19, 34, ā€¦, 254, 54, 0, 0)
(0, ā€¦, 87, 59, ā€¦, 240, 52, 4, 0)
(0, ā€¦, 87, 52, ā€¦, 240, 19, 3, 0)
(0, ā€¦, 28, 64, ā€¦, 102, 101, 0, 0)
(0, ā€¦, 19, 23, ā€¦, 105, 54, 0, 0)
(0, ā€¦, 87, 74, ā€¦, 121, 51, 7, 0)
(0, ā€¦, 87, 112, ā€¦, 239, 52, 4, 0)
PIXEL
#417
>200 <200
<100 >100
PIXEL
#123
DECISION TREE
DECISION TREE
ACCURACY
ACCURACY
Confusion matrix
Trueclass
Predicted class
ACCURACY
Confusion matrix
acc =
correctly classiļ¬ed
total number of samples
Trueclass
Predicted class
ACCURACY
Confusion matrix
acc =
correctly classiļ¬ed
total number of samples
Beware of an
imbalanced dataset!
Trueclass
Predicted class
ACCURACY
Confusion matrix
acc =
correctly classiļ¬ed
total number of samples
Beware of an
imbalanced dataset!
Consider the following model:
ā€œAlways predict 2ā€
Trueclass
Predicted class
ACCURACY
Confusion matrix
acc =
correctly classiļ¬ed
total number of samples
Beware of an
imbalanced dataset!
Consider the following model:
ā€œAlways predict 2ā€
Accuracy 0.9
Trueclass
Predicted class
DECISION TREE
DECISION TREE
ā€œYou said 100%
accurate?! Every 10th
digit your system
detects is wrong!ā€
Angry client
DECISION TREE
ā€œYou said 100%
accurate?! Every 10th
digit your system
detects is wrong!ā€
Angry client
Weā€™ve trained our system on the data the client gave us. But our
system has never seen the new data the client applied it to.
And in the real life ā€” it never willā€¦
OVERFITTING
Simulate the real-life situation ā€” split the dataset
OVERFITTING
Simulate the real-life situation ā€” split the dataset
OVERFITTING
Simulate the real-life situation ā€” split the dataset
OVERFITTING
Simulate the real-life situation ā€” split the dataset
Underļ¬tting!
ā€œToo stupidā€
OK
Overļ¬tting!
ā€œToo smartā€
OVERFITTING
Underļ¬tting!
ā€œToo stupidā€
OK
Overļ¬tting!
ā€œToo smartā€
OVERFITTING
Our current decision tree has too much capacity,
it just has memorized all of the data.
Letā€™s make it less complex.
You probably did not notice, but we are overļ¬tting again :(
TEST SET
20%
TRAINING SET
60%
THE WHOLE DATASET
VALIDATION SET
20%
TEST SET
20%
TRAINING SET
60%
THE WHOLE DATASET
VALIDATION SET
20%
Fit various models
and parameter
combinations on this
subset
TEST SET
20%
TRAINING SET
60%
THE WHOLE DATASET
VALIDATION SET
20%
Fit various models
and parameter
combinations on this
subset
ā€¢ Evaluate the
models created
with different
parameters
TEST SET
20%
TRAINING SET
60%
THE WHOLE DATASET
VALIDATION SET
20%
Fit various models
and parameter
combinations on this
subset
ā€¢ Evaluate the
models created
with different
parameters
!
ā€¢ Estimate overļ¬tting
TRA
VALI
TEST SET
20%
TRAINING SET
60%
THE WHOLE DATASET
VALIDATION SET
20%
Fit various models
and parameter
combinations on this
subset
ā€¢ Evaluate the
models created
with different
parameters
!
ā€¢ Estimate overļ¬tting
TRA
VALI
TRA
VALI
TEST SET
20%
TRAINING SET
60%
THE WHOLE DATASET
VALIDATION SET
20%
Fit various models
and parameter
combinations on this
subset
ā€¢ Evaluate the
models created
with different
parameters
!
ā€¢ Estimate overļ¬tting
TRA
VALI
TRA
VALI
TRA
VALI
TEST SET
20%
TRAINING SET
60%
THE WHOLE DATASET
VALIDATION SET
20%
Fit various models
and parameter
combinations on this
subset
ā€¢ Evaluate the
models created
with different
parameters
!
ā€¢ Estimate overļ¬tting
TRA
VALI
TRA
VALI
TRA
VALI
TRA
VALI
TEST SET
20%
TRAINING SET
60%
THE WHOLE DATASET
VALIDATION SET
20%
Fit various models
and parameter
combinations on this
subset
ā€¢ Evaluate the
models created
with different
parameters
!
ā€¢ Estimate overļ¬tting
TRA
VALI
TRA
VALI
TRA
VALI
TRA
VALI
TRA
VALI
TEST SET
20%
TRAINING SET
60%
THE WHOLE DATASET
VALIDATION SET
20%
Fit various models
and parameter
combinations on this
subset
ā€¢ Evaluate the
models created
with different
parameters
!
ā€¢ Estimate overļ¬tting
Use only once to get
the ļ¬nal performance
estimate
TRA
VALI
TRA
VALI
TRA
VALI
TRA
VALI
TRA
VALI
TEST SET
20%
TRAINING SET
60%
VALIDATION SET
20%
TEST SET
20%
TRAINING SET
60%
VALIDATION SET
20%
CROSS-VALIDATION
TRAINING SET 60%
THE WHOLE DATASET
VALIDATION SET 20%
CROSS-VALIDATION
TRAINING SET 60%
THE WHOLE DATASET
VALIDATION SET 20%
What if we got too
optimistic validation set?
CROSS-VALIDATION
TRAINING SET 60%
THE WHOLE DATASET
VALIDATION SET 20%
What if we got too
optimistic validation set?
TRAINING SET 80%
CROSS-VALIDATION
TRAINING SET 60%
THE WHOLE DATASET
VALIDATION SET 20%
What if we got too
optimistic validation set?
TRAINING SET 80%
Fix the parameter value you ned to evaluate, say msl=15
CROSS-VALIDATION
TRAINING SET 60%
THE WHOLE DATASET
VALIDATION SET 20%
What if we got too
optimistic validation set?
TRAINING SET 80%
Fix the parameter value you ned to evaluate, say msl=15
TRAINING VAL
TRAINING VAL
TRAININGVAL
Repeat 10 times
CROSS-VALIDATION
TRAINING SET 60%
THE WHOLE DATASET
VALIDATION SET 20%
What if we got too
optimistic validation set?
TRAINING SET 80%
Fix the parameter value you ned to evaluate, say msl=15
TRAINING VAL
TRAINING VAL
TRAININGVAL
Repeat 10 times
}
Take average
validation score
over 10 runs ā€”
it is a more
stable estimate.
MACHINE LEARNING PIPELINE
Take raw data Extract features
Split into TRAINING
and TEST
Pick an algorithm
and parameters
Train on the
TRAINING data
Evaluate on the
TRAINING data
with CV
Train on the
whole TRAINING
Fix the best
parameters
Evaluate on TEST
Report ļ¬nal
performance to
the client
Try our different algorithms
and parameters
MACHINE LEARNING PIPELINE
Take raw data Extract features
Split into TRAINING
and TEST
Pick an algorithm
and parameters
Train on the
TRAINING data
Evaluate on the
TRAINING data
with CV
Train on the
whole TRAINING
Fix the best
parameters
Evaluate on TEST
Report ļ¬nal
performance to
the client
Try our different algorithms
and parameters
ā€œSo it is ~87%ā€¦ermā€¦
Could you do better?ā€
MACHINE LEARNING PIPELINE
Take raw data Extract features
Split into TRAINING
and TEST
Pick an algorithm
and parameters
Train on the
TRAINING data
Evaluate on the
TRAINING data
with CV
Train on the
whole TRAINING
Fix the best
parameters
Evaluate on TEST
Report ļ¬nal
performance to
the client
Try our different algorithms
and parameters
ā€œSo it is ~87%ā€¦ermā€¦
Could you do better?ā€
Yes
ā€¢ C4.5	
ā€¢ Random	forests	
ā€¢ Bayesian	networks	
ā€¢ Hidden	Markov	models	
ā€¢ Artificial	neural	network	
ā€¢ Data	clustering	
ā€¢ Expectation-maximization	
algorithm	
ā€¢ Self-organizing	map	
ā€¢ Radial	basis	function	network	
ā€¢ Vector	Quantization	
ā€¢ Generative	topographic	map	
ā€¢ Information	bottleneck	method	
ā€¢ IBSEAD	
ā€¢ Apriori	algorithm	
ā€¢ Eclat	algorithm	
ā€¢ FP-growth	algorithm	
ā€¢ Single-linkage	clustering	
ā€¢ Conceptual	clustering	
ā€¢ K-means	algorithm	
ā€¢ Fuzzy	clustering	
ā€¢ Temporal	difference	learning	
ā€¢ Q-learning	
ā€¢ Learning	Automata
ā€¢ AODE	
ā€¢ Artificial	neural	network	
ā€¢ Backpropagation	
ā€¢ Naive	Bayes	classifier	
ā€¢ Bayesian	network	
ā€¢ Bayesian	knowledge	base	
ā€¢ Case-based	reasoning	
ā€¢ Decision	trees	
ā€¢ Inductive	logic	
programming	
ā€¢ Gaussian	process	regression	
ā€¢ Gene	expression	
programming	
ā€¢ Group	method	of	data	
handling	(GMDH)	
ā€¢ Learning	Automata	
ā€¢ Learning	Vector	
Quantization	
ā€¢ Logistic	Model	Tree	
ā€¢ Decision	tree	
ā€¢ Decision	graphs	
ā€¢ Lazy	learning	
ā€¢ Monte	Carlo	Method	
ā€¢ SARSA
ā€¢ Instance-based	learning	
ā€¢ Nearest	Neighbor	Algorithm	
ā€¢ Analogical	modeling	
ā€¢ Probably	approximately	correct	learning	
(PACL)	
ā€¢ Symbolic	machine	learning	algorithms	
ā€¢ Subsymbolic	machine	learning	algorithms	
ā€¢ Support	vector	machines	
ā€¢ Random	Forest	
ā€¢ Ensembles	of	classifiers	
ā€¢ Bootstrap	aggregating	(bagging)	
ā€¢ Boosting	(meta-algorithm)	
ā€¢ Ordinal	classification	
ā€¢ Regression	analysis	
ā€¢ Information	fuzzy	networks	(IFN)	
ā€¢ Linear	classifiers	
ā€¢ Fisher's	linear	discriminant	
ā€¢ Logistic	regression	
ā€¢ Naive	Bayes	classifier	
ā€¢ Perceptron	
ā€¢ Support	vector	machines	
ā€¢ Quadratic	classifiers	
ā€¢ k-nearest	neighbor	
ā€¢ Boosting
Pick another algorithm
ā€¢ C4.5	
ā€¢ Random	forests	
ā€¢ Bayesian	networks	
ā€¢ Hidden	Markov	models	
ā€¢ Artificial	neural	network	
ā€¢ Data	clustering	
ā€¢ Expectation-maximization	
algorithm	
ā€¢ Self-organizing	map	
ā€¢ Radial	basis	function	network	
ā€¢ Vector	Quantization	
ā€¢ Generative	topographic	map	
ā€¢ Information	bottleneck	method	
ā€¢ IBSEAD	
ā€¢ Apriori	algorithm	
ā€¢ Eclat	algorithm	
ā€¢ FP-growth	algorithm	
ā€¢ Single-linkage	clustering	
ā€¢ Conceptual	clustering	
ā€¢ K-means	algorithm	
ā€¢ Fuzzy	clustering	
ā€¢ Temporal	difference	learning	
ā€¢ Q-learning	
ā€¢ Learning	Automata
ā€¢ AODE	
ā€¢ Artificial	neural	network	
ā€¢ Backpropagation	
ā€¢ Naive	Bayes	classifier	
ā€¢ Bayesian	network	
ā€¢ Bayesian	knowledge	base	
ā€¢ Case-based	reasoning	
ā€¢ Decision	trees	
ā€¢ Inductive	logic	
programming	
ā€¢ Gaussian	process	regression	
ā€¢ Gene	expression	
programming	
ā€¢ Group	method	of	data	
handling	(GMDH)	
ā€¢ Learning	Automata	
ā€¢ Learning	Vector	
Quantization	
ā€¢ Logistic	Model	Tree	
ā€¢ Decision	tree	
ā€¢ Decision	graphs	
ā€¢ Lazy	learning	
ā€¢ Monte	Carlo	Method	
ā€¢ SARSA
ā€¢ Instance-based	learning	
ā€¢ Nearest	Neighbor	Algorithm	
ā€¢ Analogical	modeling	
ā€¢ Probably	approximately	correct	learning	
(PACL)	
ā€¢ Symbolic	machine	learning	algorithms	
ā€¢ Subsymbolic	machine	learning	algorithms	
ā€¢ Support	vector	machines	
ā€¢ Random	Forest	
ā€¢ Ensembles	of	classifiers	
ā€¢ Bootstrap	aggregating	(bagging)	
ā€¢ Boosting	(meta-algorithm)	
ā€¢ Ordinal	classification	
ā€¢ Regression	analysis	
ā€¢ Information	fuzzy	networks	(IFN)	
ā€¢ Linear	classifiers	
ā€¢ Fisher's	linear	discriminant	
ā€¢ Logistic	regression	
ā€¢ Naive	Bayes	classifier	
ā€¢ Perceptron	
ā€¢ Support	vector	machines	
ā€¢ Quadratic	classifiers	
ā€¢ k-nearest	neighbor	
ā€¢ Boosting
Pick another algorithm
RANDOM FOREST
RANDOM FOREST
Decision tree:
pick best out of all features
RANDOM FOREST
Decision tree:
pick best out of all features
Random forest:
pick best out of random
subset of features
RANDOM FOREST
RANDOM FOREST
pick best out of another
random subset of features
RANDOM FOREST
pick best out of another
random subset of features pick best out of yet another
random subset of features
RANDOM FOREST
RANDOM FOREST
RANDOM FOREST
class
instance
RANDOM FOREST
class
instance
RANDOM FOREST
class
instance
RANDOM FOREST
class
instance
Happy client
ALL OTHER USE CASES
Sound
Frequency
components
Genre
Bag of
words
Topic
Text
Pixel
values
Image
Cat or
dog
Video
Frame
pixels
Walking
or running
Database records Biometric data
Census
data
Average
salary
ā€¦
Dead or
alive
HANDS-ON SESSION
http://scikit-learn.org/stable/modules/generated/sklearn.ensemble.RandomForestClassiļ¬er.html
Introduction to Machine Learning @ Mooncascade ML Camp

More Related Content

Viewers also liked

Mastering the game of Go with deep neural networks and tree search (article o...
Mastering the game of Go with deep neural networks and tree search (article o...Mastering the game of Go with deep neural networks and tree search (article o...
Mastering the game of Go with deep neural networks and tree search (article o...
Ilya Kuzovkin
Ā 
#48 Machine learning
#48 Machine learning#48 Machine learning
#48 Machine learning
Meridian Impactive Digital
Ā 
Machine Learning for Understanding and Managing Ecosystems
Machine Learning for Understanding and Managing EcosystemsMachine Learning for Understanding and Managing Ecosystems
Machine Learning for Understanding and Managing Ecosystems
diannepatricia
Ā 
Demystifying Machine Learning - How to give your business superpowers.
Demystifying Machine Learning - How to give your business superpowers.Demystifying Machine Learning - How to give your business superpowers.
Demystifying Machine Learning - How to give your business superpowers.
10x Nation
Ā 
Actividad 02
Actividad 02Actividad 02
Actividad 02
Alejandra Perez
Ā 
Machine Learning and Data Mining: 03 Data Representation
Machine Learning and Data Mining: 03 Data RepresentationMachine Learning and Data Mining: 03 Data Representation
Machine Learning and Data Mining: 03 Data Representation
Pier Luca Lanzi
Ā 
A Beginner's Guide to Machine Learning with Scikit-Learn
A Beginner's Guide to Machine Learning with Scikit-LearnA Beginner's Guide to Machine Learning with Scikit-Learn
A Beginner's Guide to Machine Learning with Scikit-Learn
Sarah Guido
Ā 
Machine learning the next revolution or just another hype
Machine learning   the next revolution or just another hypeMachine learning   the next revolution or just another hype
Machine learning the next revolution or just another hype
Jorge Ferrer
Ā 
Š’Š²ŠµŠ“ŠµŠ½ŠøŠµ Š² Š°Ń€Ń…ŠøтŠµŠŗтуры Š½ŠµŠ¹Ń€Š¾Š½Š½Ń‹Ń… сŠµŃ‚ŠµŠ¹ / HighLoad++ 2016
Š’Š²ŠµŠ“ŠµŠ½ŠøŠµ Š² Š°Ń€Ń…ŠøтŠµŠŗтуры Š½ŠµŠ¹Ń€Š¾Š½Š½Ń‹Ń… сŠµŃ‚ŠµŠ¹ / HighLoad++ 2016Š’Š²ŠµŠ“ŠµŠ½ŠøŠµ Š² Š°Ń€Ń…ŠøтŠµŠŗтуры Š½ŠµŠ¹Ń€Š¾Š½Š½Ń‹Ń… сŠµŃ‚ŠµŠ¹ / HighLoad++ 2016
Š’Š²ŠµŠ“ŠµŠ½ŠøŠµ Š² Š°Ń€Ń…ŠøтŠµŠŗтуры Š½ŠµŠ¹Ń€Š¾Š½Š½Ń‹Ń… сŠµŃ‚ŠµŠ¹ / HighLoad++ 2016
Grigory Sapunov
Ā 
A Nontechnical Introduction to Machine Learning
A Nontechnical Introduction to Machine LearningA Nontechnical Introduction to Machine Learning
A Nontechnical Introduction to Machine Learning
Sam Elshamy
Ā 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
Raveen Perera
Ā 
Machine Learning - Challenges, Learnings & Opportunities
Machine Learning - Challenges, Learnings & OpportunitiesMachine Learning - Challenges, Learnings & Opportunities
Machine Learning - Challenges, Learnings & Opportunities
CodePolitan
Ā 
Machine Learning in Pathology Diagnostics with Simagis Live
Machine Learning in Pathology Diagnostics with Simagis LiveMachine Learning in Pathology Diagnostics with Simagis Live
Machine Learning in Pathology Diagnostics with Simagis Live
khvatkov
Ā 
A brief history of machine learning
A brief history of  machine learningA brief history of  machine learning
A brief history of machine learning
Robert Colner
Ā 
Neural Turing Machines
Neural Turing MachinesNeural Turing Machines
Neural Turing Machines
Ilya Kuzovkin
Ā 
Machine Learning and Search -State of Search 2016
Machine Learning and Search -State of Search 2016 Machine Learning and Search -State of Search 2016
Machine Learning and Search -State of Search 2016
Eric Enge
Ā 
MLaaS - Machine Learning as a Service
MLaaS - Machine Learning as a ServiceMLaaS - Machine Learning as a Service
MLaaS - Machine Learning as a Service
Karl Seiler
Ā 
Focus Junior - 14 Maggio 2016
Focus Junior - 14 Maggio 2016Focus Junior - 14 Maggio 2016
Focus Junior - 14 Maggio 2016
Pier Luca Lanzi
Ā 
Natural Language Processing with Python
Natural Language Processing with PythonNatural Language Processing with Python
Natural Language Processing with Python
Benjamin Bengfort
Ā 
Introduction Machine Learning by MyLittleAdventure
Introduction Machine Learning by MyLittleAdventureIntroduction Machine Learning by MyLittleAdventure
Introduction Machine Learning by MyLittleAdventure
mylittleadventure
Ā 

Viewers also liked (20)

Mastering the game of Go with deep neural networks and tree search (article o...
Mastering the game of Go with deep neural networks and tree search (article o...Mastering the game of Go with deep neural networks and tree search (article o...
Mastering the game of Go with deep neural networks and tree search (article o...
Ā 
#48 Machine learning
#48 Machine learning#48 Machine learning
#48 Machine learning
Ā 
Machine Learning for Understanding and Managing Ecosystems
Machine Learning for Understanding and Managing EcosystemsMachine Learning for Understanding and Managing Ecosystems
Machine Learning for Understanding and Managing Ecosystems
Ā 
Demystifying Machine Learning - How to give your business superpowers.
Demystifying Machine Learning - How to give your business superpowers.Demystifying Machine Learning - How to give your business superpowers.
Demystifying Machine Learning - How to give your business superpowers.
Ā 
Actividad 02
Actividad 02Actividad 02
Actividad 02
Ā 
Machine Learning and Data Mining: 03 Data Representation
Machine Learning and Data Mining: 03 Data RepresentationMachine Learning and Data Mining: 03 Data Representation
Machine Learning and Data Mining: 03 Data Representation
Ā 
A Beginner's Guide to Machine Learning with Scikit-Learn
A Beginner's Guide to Machine Learning with Scikit-LearnA Beginner's Guide to Machine Learning with Scikit-Learn
A Beginner's Guide to Machine Learning with Scikit-Learn
Ā 
Machine learning the next revolution or just another hype
Machine learning   the next revolution or just another hypeMachine learning   the next revolution or just another hype
Machine learning the next revolution or just another hype
Ā 
Š’Š²ŠµŠ“ŠµŠ½ŠøŠµ Š² Š°Ń€Ń…ŠøтŠµŠŗтуры Š½ŠµŠ¹Ń€Š¾Š½Š½Ń‹Ń… сŠµŃ‚ŠµŠ¹ / HighLoad++ 2016
Š’Š²ŠµŠ“ŠµŠ½ŠøŠµ Š² Š°Ń€Ń…ŠøтŠµŠŗтуры Š½ŠµŠ¹Ń€Š¾Š½Š½Ń‹Ń… сŠµŃ‚ŠµŠ¹ / HighLoad++ 2016Š’Š²ŠµŠ“ŠµŠ½ŠøŠµ Š² Š°Ń€Ń…ŠøтŠµŠŗтуры Š½ŠµŠ¹Ń€Š¾Š½Š½Ń‹Ń… сŠµŃ‚ŠµŠ¹ / HighLoad++ 2016
Š’Š²ŠµŠ“ŠµŠ½ŠøŠµ Š² Š°Ń€Ń…ŠøтŠµŠŗтуры Š½ŠµŠ¹Ń€Š¾Š½Š½Ń‹Ń… сŠµŃ‚ŠµŠ¹ / HighLoad++ 2016
Ā 
A Nontechnical Introduction to Machine Learning
A Nontechnical Introduction to Machine LearningA Nontechnical Introduction to Machine Learning
A Nontechnical Introduction to Machine Learning
Ā 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
Ā 
Machine Learning - Challenges, Learnings & Opportunities
Machine Learning - Challenges, Learnings & OpportunitiesMachine Learning - Challenges, Learnings & Opportunities
Machine Learning - Challenges, Learnings & Opportunities
Ā 
Machine Learning in Pathology Diagnostics with Simagis Live
Machine Learning in Pathology Diagnostics with Simagis LiveMachine Learning in Pathology Diagnostics with Simagis Live
Machine Learning in Pathology Diagnostics with Simagis Live
Ā 
A brief history of machine learning
A brief history of  machine learningA brief history of  machine learning
A brief history of machine learning
Ā 
Neural Turing Machines
Neural Turing MachinesNeural Turing Machines
Neural Turing Machines
Ā 
Machine Learning and Search -State of Search 2016
Machine Learning and Search -State of Search 2016 Machine Learning and Search -State of Search 2016
Machine Learning and Search -State of Search 2016
Ā 
MLaaS - Machine Learning as a Service
MLaaS - Machine Learning as a ServiceMLaaS - Machine Learning as a Service
MLaaS - Machine Learning as a Service
Ā 
Focus Junior - 14 Maggio 2016
Focus Junior - 14 Maggio 2016Focus Junior - 14 Maggio 2016
Focus Junior - 14 Maggio 2016
Ā 
Natural Language Processing with Python
Natural Language Processing with PythonNatural Language Processing with Python
Natural Language Processing with Python
Ā 
Introduction Machine Learning by MyLittleAdventure
Introduction Machine Learning by MyLittleAdventureIntroduction Machine Learning by MyLittleAdventure
Introduction Machine Learning by MyLittleAdventure
Ā 

Similar to Introduction to Machine Learning @ Mooncascade ML Camp

Research overview Oct. 2018
Research overview Oct. 2018Research overview Oct. 2018
Research overview Oct. 2018
XavierDevroey
Ā 
Lorentz workshop - 2018
Lorentz workshop - 2018Lorentz workshop - 2018
Lorentz workshop - 2018
XavierDevroey
Ā 
Machine learning for_finance
Machine learning for_financeMachine learning for_finance
Machine learning for_finance
Stefan Duprey
Ā 
Machine Learning : why we should know and how it works
Machine Learning : why we should know and how it worksMachine Learning : why we should know and how it works
Machine Learning : why we should know and how it works
Kevin Lee
Ā 
Introducing Reactive Machine Learning
Introducing Reactive Machine LearningIntroducing Reactive Machine Learning
Introducing Reactive Machine Learning
Jeff Smith
Ā 
Detecting Misleading Headlines in Online News: Hands-on Experiences on Attent...
Detecting Misleading Headlines in Online News: Hands-on Experiences on Attent...Detecting Misleading Headlines in Online News: Hands-on Experiences on Attent...
Detecting Misleading Headlines in Online News: Hands-on Experiences on Attent...
Kunwoo Park
Ā 
Bigger Data v Better Math
Bigger Data v Better MathBigger Data v Better Math
Bigger Data v Better Math
Brent Schneeman
Ā 
Workshop - Introduction to Machine Learning with R
Workshop - Introduction to Machine Learning with RWorkshop - Introduction to Machine Learning with R
Workshop - Introduction to Machine Learning with R
Shirin Elsinghorst
Ā 
Machine Learning in a Flash (Extended Edition 2): An Introduction to Neural N...
Machine Learning in a Flash (Extended Edition 2): An Introduction to Neural N...Machine Learning in a Flash (Extended Edition 2): An Introduction to Neural N...
Machine Learning in a Flash (Extended Edition 2): An Introduction to Neural N...
Kory Becker
Ā 
Ltc completed slides
Ltc completed slidesLtc completed slides
Ltc completed slidesRoseline Antai
Ā 
GANS Project for Image idetification.pdf
GANS Project for Image idetification.pdfGANS Project for Image idetification.pdf
GANS Project for Image idetification.pdf
VivekanandaGN1
Ā 
An introduction to Deep Learning with Apache MXNet (November 2017)
An introduction to Deep Learning with Apache MXNet (November 2017)An introduction to Deep Learning with Apache MXNet (November 2017)
An introduction to Deep Learning with Apache MXNet (November 2017)
Julien SIMON
Ā 
Deep learning
Deep learningDeep learning
Deep learning
Alessio Palma
Ā 
Comparing Machine Learning Algorithms in Text Mining
Comparing Machine Learning Algorithms in Text MiningComparing Machine Learning Algorithms in Text Mining
Comparing Machine Learning Algorithms in Text Mining
Andrea Gigli
Ā 
Numpy Talk at SIAM
Numpy Talk at SIAMNumpy Talk at SIAM
Numpy Talk at SIAM
Enthought, Inc.
Ā 
4. Classification.pdf
4. Classification.pdf4. Classification.pdf
4. Classification.pdf
Jyoti Yadav
Ā 
TAO Fayan_Report on Top 10 data mining algorithms applications with R
TAO Fayan_Report on Top 10 data mining algorithms applications with RTAO Fayan_Report on Top 10 data mining algorithms applications with R
TAO Fayan_Report on Top 10 data mining algorithms applications with RFayan TAO
Ā 
visualisasi data praktik pakai excel, py
visualisasi data praktik pakai excel, pyvisualisasi data praktik pakai excel, py
visualisasi data praktik pakai excel, py
ElmaLyrics
Ā 
Barga Data Science lecture 7
Barga Data Science lecture 7Barga Data Science lecture 7
Barga Data Science lecture 7
Roger Barga
Ā 
20181106 arie van_deursen_testday2018
20181106 arie van_deursen_testday201820181106 arie van_deursen_testday2018
20181106 arie van_deursen_testday2018
STAMP Project
Ā 

Similar to Introduction to Machine Learning @ Mooncascade ML Camp (20)

Research overview Oct. 2018
Research overview Oct. 2018Research overview Oct. 2018
Research overview Oct. 2018
Ā 
Lorentz workshop - 2018
Lorentz workshop - 2018Lorentz workshop - 2018
Lorentz workshop - 2018
Ā 
Machine learning for_finance
Machine learning for_financeMachine learning for_finance
Machine learning for_finance
Ā 
Machine Learning : why we should know and how it works
Machine Learning : why we should know and how it worksMachine Learning : why we should know and how it works
Machine Learning : why we should know and how it works
Ā 
Introducing Reactive Machine Learning
Introducing Reactive Machine LearningIntroducing Reactive Machine Learning
Introducing Reactive Machine Learning
Ā 
Detecting Misleading Headlines in Online News: Hands-on Experiences on Attent...
Detecting Misleading Headlines in Online News: Hands-on Experiences on Attent...Detecting Misleading Headlines in Online News: Hands-on Experiences on Attent...
Detecting Misleading Headlines in Online News: Hands-on Experiences on Attent...
Ā 
Bigger Data v Better Math
Bigger Data v Better MathBigger Data v Better Math
Bigger Data v Better Math
Ā 
Workshop - Introduction to Machine Learning with R
Workshop - Introduction to Machine Learning with RWorkshop - Introduction to Machine Learning with R
Workshop - Introduction to Machine Learning with R
Ā 
Machine Learning in a Flash (Extended Edition 2): An Introduction to Neural N...
Machine Learning in a Flash (Extended Edition 2): An Introduction to Neural N...Machine Learning in a Flash (Extended Edition 2): An Introduction to Neural N...
Machine Learning in a Flash (Extended Edition 2): An Introduction to Neural N...
Ā 
Ltc completed slides
Ltc completed slidesLtc completed slides
Ltc completed slides
Ā 
GANS Project for Image idetification.pdf
GANS Project for Image idetification.pdfGANS Project for Image idetification.pdf
GANS Project for Image idetification.pdf
Ā 
An introduction to Deep Learning with Apache MXNet (November 2017)
An introduction to Deep Learning with Apache MXNet (November 2017)An introduction to Deep Learning with Apache MXNet (November 2017)
An introduction to Deep Learning with Apache MXNet (November 2017)
Ā 
Deep learning
Deep learningDeep learning
Deep learning
Ā 
Comparing Machine Learning Algorithms in Text Mining
Comparing Machine Learning Algorithms in Text MiningComparing Machine Learning Algorithms in Text Mining
Comparing Machine Learning Algorithms in Text Mining
Ā 
Numpy Talk at SIAM
Numpy Talk at SIAMNumpy Talk at SIAM
Numpy Talk at SIAM
Ā 
4. Classification.pdf
4. Classification.pdf4. Classification.pdf
4. Classification.pdf
Ā 
TAO Fayan_Report on Top 10 data mining algorithms applications with R
TAO Fayan_Report on Top 10 data mining algorithms applications with RTAO Fayan_Report on Top 10 data mining algorithms applications with R
TAO Fayan_Report on Top 10 data mining algorithms applications with R
Ā 
visualisasi data praktik pakai excel, py
visualisasi data praktik pakai excel, pyvisualisasi data praktik pakai excel, py
visualisasi data praktik pakai excel, py
Ā 
Barga Data Science lecture 7
Barga Data Science lecture 7Barga Data Science lecture 7
Barga Data Science lecture 7
Ā 
20181106 arie van_deursen_testday2018
20181106 arie van_deursen_testday201820181106 arie van_deursen_testday2018
20181106 arie van_deursen_testday2018
Ā 

More from Ilya Kuzovkin

Understanding Information Processing in Human Brain by Interpreting Machine L...
Understanding Information Processing in Human Brain by Interpreting Machine L...Understanding Information Processing in Human Brain by Interpreting Machine L...
Understanding Information Processing in Human Brain by Interpreting Machine L...
Ilya Kuzovkin
Ā 
The Brain and the Modern AI: Drastic Differences and Curious Similarities
The Brain and the Modern AI: Drastic Differences and Curious SimilaritiesThe Brain and the Modern AI: Drastic Differences and Curious Similarities
The Brain and the Modern AI: Drastic Differences and Curious Similarities
Ilya Kuzovkin
Ā 
The First Day at the Deep learning Zoo
The First Day at the Deep learning ZooThe First Day at the Deep learning Zoo
The First Day at the Deep learning Zoo
Ilya Kuzovkin
Ā 
Intuitive Intro to Gƶdel's Incompleteness Theorem
Intuitive Intro to Gƶdel's Incompleteness TheoremIntuitive Intro to Gƶdel's Incompleteness Theorem
Intuitive Intro to Gƶdel's Incompleteness Theorem
Ilya Kuzovkin
Ā 
Paper overview: "Deep Residual Learning for Image Recognition"
Paper overview: "Deep Residual Learning for Image Recognition"Paper overview: "Deep Residual Learning for Image Recognition"
Paper overview: "Deep Residual Learning for Image Recognition"
Ilya Kuzovkin
Ā 
Deep Learning: Theory, History, State of the Art & Practical Tools
Deep Learning: Theory, History, State of the Art & Practical ToolsDeep Learning: Theory, History, State of the Art & Practical Tools
Deep Learning: Theory, History, State of the Art & Practical Tools
Ilya Kuzovkin
Ā 
Article overview: Unsupervised Learning of Visual Structure Using Predictive ...
Article overview: Unsupervised Learning of Visual Structure Using Predictive ...Article overview: Unsupervised Learning of Visual Structure Using Predictive ...
Article overview: Unsupervised Learning of Visual Structure Using Predictive ...
Ilya Kuzovkin
Ā 
Article overview: Deep Neural Networks Reveal a Gradient in the Complexity of...
Article overview: Deep Neural Networks Reveal a Gradient in the Complexity of...Article overview: Deep Neural Networks Reveal a Gradient in the Complexity of...
Article overview: Deep Neural Networks Reveal a Gradient in the Complexity of...
Ilya Kuzovkin
Ā 
NIPS2014 Article Overview: Do Deep Nets Really Need to be Deep?
NIPS2014 Article Overview: Do Deep Nets Really Need to be Deep?NIPS2014 Article Overview: Do Deep Nets Really Need to be Deep?
NIPS2014 Article Overview: Do Deep Nets Really Need to be Deep?
Ilya Kuzovkin
Ā 
Neuroimaging: Intracortical, fMRI, EEG
Neuroimaging: Intracortical, fMRI, EEGNeuroimaging: Intracortical, fMRI, EEG
Neuroimaging: Intracortical, fMRI, EEG
Ilya Kuzovkin
Ā 
Article Overview "Reach and grasp by people with tetraplegia using a neurally...
Article Overview "Reach and grasp by people with tetraplegia using a neurally...Article Overview "Reach and grasp by people with tetraplegia using a neurally...
Article Overview "Reach and grasp by people with tetraplegia using a neurally...
Ilya Kuzovkin
Ā 
Introduction to Computing on GPU
Introduction to Computing on GPUIntroduction to Computing on GPU
Introduction to Computing on GPU
Ilya Kuzovkin
Ā 
Soft Introduction to Brain-Computer Interfaces and Machine Learning
Soft Introduction to Brain-Computer Interfaces and Machine LearningSoft Introduction to Brain-Computer Interfaces and Machine Learning
Soft Introduction to Brain-Computer Interfaces and Machine Learning
Ilya Kuzovkin
Ā 
Ilya Kuzovkin - Adaptive Interactive Learning for Brain-Computer Interfaces
Ilya Kuzovkin - Adaptive Interactive Learning for Brain-Computer InterfacesIlya Kuzovkin - Adaptive Interactive Learning for Brain-Computer Interfaces
Ilya Kuzovkin - Adaptive Interactive Learning for Brain-Computer Interfaces
Ilya Kuzovkin
Ā 

More from Ilya Kuzovkin (14)

Understanding Information Processing in Human Brain by Interpreting Machine L...
Understanding Information Processing in Human Brain by Interpreting Machine L...Understanding Information Processing in Human Brain by Interpreting Machine L...
Understanding Information Processing in Human Brain by Interpreting Machine L...
Ā 
The Brain and the Modern AI: Drastic Differences and Curious Similarities
The Brain and the Modern AI: Drastic Differences and Curious SimilaritiesThe Brain and the Modern AI: Drastic Differences and Curious Similarities
The Brain and the Modern AI: Drastic Differences and Curious Similarities
Ā 
The First Day at the Deep learning Zoo
The First Day at the Deep learning ZooThe First Day at the Deep learning Zoo
The First Day at the Deep learning Zoo
Ā 
Intuitive Intro to Gƶdel's Incompleteness Theorem
Intuitive Intro to Gƶdel's Incompleteness TheoremIntuitive Intro to Gƶdel's Incompleteness Theorem
Intuitive Intro to Gƶdel's Incompleteness Theorem
Ā 
Paper overview: "Deep Residual Learning for Image Recognition"
Paper overview: "Deep Residual Learning for Image Recognition"Paper overview: "Deep Residual Learning for Image Recognition"
Paper overview: "Deep Residual Learning for Image Recognition"
Ā 
Deep Learning: Theory, History, State of the Art & Practical Tools
Deep Learning: Theory, History, State of the Art & Practical ToolsDeep Learning: Theory, History, State of the Art & Practical Tools
Deep Learning: Theory, History, State of the Art & Practical Tools
Ā 
Article overview: Unsupervised Learning of Visual Structure Using Predictive ...
Article overview: Unsupervised Learning of Visual Structure Using Predictive ...Article overview: Unsupervised Learning of Visual Structure Using Predictive ...
Article overview: Unsupervised Learning of Visual Structure Using Predictive ...
Ā 
Article overview: Deep Neural Networks Reveal a Gradient in the Complexity of...
Article overview: Deep Neural Networks Reveal a Gradient in the Complexity of...Article overview: Deep Neural Networks Reveal a Gradient in the Complexity of...
Article overview: Deep Neural Networks Reveal a Gradient in the Complexity of...
Ā 
NIPS2014 Article Overview: Do Deep Nets Really Need to be Deep?
NIPS2014 Article Overview: Do Deep Nets Really Need to be Deep?NIPS2014 Article Overview: Do Deep Nets Really Need to be Deep?
NIPS2014 Article Overview: Do Deep Nets Really Need to be Deep?
Ā 
Neuroimaging: Intracortical, fMRI, EEG
Neuroimaging: Intracortical, fMRI, EEGNeuroimaging: Intracortical, fMRI, EEG
Neuroimaging: Intracortical, fMRI, EEG
Ā 
Article Overview "Reach and grasp by people with tetraplegia using a neurally...
Article Overview "Reach and grasp by people with tetraplegia using a neurally...Article Overview "Reach and grasp by people with tetraplegia using a neurally...
Article Overview "Reach and grasp by people with tetraplegia using a neurally...
Ā 
Introduction to Computing on GPU
Introduction to Computing on GPUIntroduction to Computing on GPU
Introduction to Computing on GPU
Ā 
Soft Introduction to Brain-Computer Interfaces and Machine Learning
Soft Introduction to Brain-Computer Interfaces and Machine LearningSoft Introduction to Brain-Computer Interfaces and Machine Learning
Soft Introduction to Brain-Computer Interfaces and Machine Learning
Ā 
Ilya Kuzovkin - Adaptive Interactive Learning for Brain-Computer Interfaces
Ilya Kuzovkin - Adaptive Interactive Learning for Brain-Computer InterfacesIlya Kuzovkin - Adaptive Interactive Learning for Brain-Computer Interfaces
Ilya Kuzovkin - Adaptive Interactive Learning for Brain-Computer Interfaces
Ā 

Recently uploaded

UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
Ā 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
Alison B. Lowndes
Ā 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
Product School
Ā 
Dev Dives: Train smarter, not harder ā€“ active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder ā€“ active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder ā€“ active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder ā€“ active learning and UiPath LLMs for do...
UiPathCommunity
Ā 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
Ā 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
Frank van Harmelen
Ā 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
Ā 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Product School
Ā 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
Ā 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
Ā 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
Ā 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Tobias Schneck
Ā 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
Ā 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
Ā 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Thierry Lestable
Ā 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
Ā 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
Ā 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
DianaGray10
Ā 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
Ā 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
Ā 

Recently uploaded (20)

UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
Ā 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
Ā 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
Ā 
Dev Dives: Train smarter, not harder ā€“ active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder ā€“ active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder ā€“ active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder ā€“ active learning and UiPath LLMs for do...
Ā 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
Ā 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
Ā 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Ā 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Ā 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Ā 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
Ā 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Ā 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Ā 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Ā 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ā 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Ā 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Ā 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Ā 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
Ā 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
Ā 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Ā 

Introduction to Machine Learning @ Mooncascade ML Camp

  • 1. by Ilya Kuzovkin ilya.kuzovkin@gmail.com Mooncascade ML Camp 2016 Machine Learning ESSENTIAL CONCEPTS
  • 3.
  • 4.
  • 5.
  • 6.
  • 7.
  • 8.
  • 9.
  • 10. Can we ask a computer to create those patterns automatically?
  • 11. Can we ask a computer to create those patterns automatically? Yes
  • 12. Can we ask a computer to create those patterns automatically? Yes How?
  • 14. Instance Raw data Class (label) A data sample: ā€œ7ā€
  • 15. Instance Raw data Class (label) A data sample: ā€œ7ā€ How to represent it in a machine-readable form?
  • 16. Instance Raw data Class (label) A data sample: ā€œ7ā€ How to represent it in a machine-readable form? Feature extraction
  • 17. Instance Raw data Class (label) A data sample: ā€œ7ā€ How to represent it in a machine-readable form? Feature extraction 28px 28 px
  • 18. Instance Raw data Class (label) A data sample: ā€œ7ā€ 28px 28 px 784 pixels in total Feature vector (0, 0, 0, ā€¦, 28, 65, 128, 255, 101, 38,ā€¦ 0, 0, 0) How to represent it in a machine-readable form? Feature extraction
  • 19. Instance Raw data Class (label) A data sample: ā€œ7ā€ 28px 28 px 784 pixels in total Feature vector (0, 0, 0, ā€¦, 28, 65, 128, 255, 101, 38,ā€¦ 0, 0, 0) How to represent it in a machine-readable form? Feature extraction (0, 0, 0, ā€¦, 28, 65, 128, 255, 101, 38,ā€¦ 0, 0, 0) (0, 0, 0, ā€¦, 13, 48, 102, 0, 46, 255,ā€¦ 0, 0, 0) (0, 0, 0, ā€¦, 17, 34, 12, 43, 122, 70,ā€¦ 0, 7, 0) (0, 0, 0, ā€¦, 98, 21, 255, 255, 231, 140,ā€¦ 0, 0, 0) ā€œ7ā€ ā€œ2ā€ ā€œ8ā€ ā€œ2ā€
  • 20. Instance Raw data Class (label) A data sample: ā€œ7ā€ 28px 28 px 784 pixels in total Feature vector (0, 0, 0, ā€¦, 28, 65, 128, 255, 101, 38,ā€¦ 0, 0, 0) How to represent it in a machine-readable form? Feature extraction (0, 0, 0, ā€¦, 28, 65, 128, 255, 101, 38,ā€¦ 0, 0, 0) (0, 0, 0, ā€¦, 13, 48, 102, 0, 46, 255,ā€¦ 0, 0, 0) (0, 0, 0, ā€¦, 17, 34, 12, 43, 122, 70,ā€¦ 0, 7, 0) Dataset (0, 0, 0, ā€¦, 98, 21, 255, 255, 231, 140,ā€¦ 0, 0, 0) ā€œ7ā€ ā€œ2ā€ ā€œ8ā€ ā€œ2ā€
  • 21. The data is in the right format ā€” whatā€™s next?
  • 22. The data is in the right format ā€” whatā€™s next? ā€¢ C4.5 ā€¢ Random forests ā€¢ Bayesian networks ā€¢ Hidden Markov models ā€¢ Artificial neural network ā€¢ Data clustering ā€¢ Expectation-maximization algorithm ā€¢ Self-organizing map ā€¢ Radial basis function network ā€¢ Vector Quantization ā€¢ Generative topographic map ā€¢ Information bottleneck method ā€¢ IBSEAD ā€¢ Apriori algorithm ā€¢ Eclat algorithm ā€¢ FP-growth algorithm ā€¢ Single-linkage clustering ā€¢ Conceptual clustering ā€¢ K-means algorithm ā€¢ Fuzzy clustering ā€¢ Temporal difference learning ā€¢ Q-learning ā€¢ Learning Automata ā€¢ AODE ā€¢ Artificial neural network ā€¢ Backpropagation ā€¢ Naive Bayes classifier ā€¢ Bayesian network ā€¢ Bayesian knowledge base ā€¢ Case-based reasoning ā€¢ Decision trees ā€¢ Inductive logic programming ā€¢ Gaussian process regression ā€¢ Gene expression programming ā€¢ Group method of data handling (GMDH) ā€¢ Learning Automata ā€¢ Learning Vector Quantization ā€¢ Logistic Model Tree ā€¢ Decision tree ā€¢ Decision graphs ā€¢ Lazy learning ā€¢ Monte Carlo Method ā€¢ SARSA ā€¢ Instance-based learning ā€¢ Nearest Neighbor Algorithm ā€¢ Analogical modeling ā€¢ Probably approximately correct learning (PACL) ā€¢ Symbolic machine learning algorithms ā€¢ Subsymbolic machine learning algorithms ā€¢ Support vector machines ā€¢ Random Forest ā€¢ Ensembles of classifiers ā€¢ Bootstrap aggregating (bagging) ā€¢ Boosting (meta-algorithm) ā€¢ Ordinal classification ā€¢ Regression analysis ā€¢ Information fuzzy networks (IFN) ā€¢ Linear classifiers ā€¢ Fisher's linear discriminant ā€¢ Logistic regression ā€¢ Naive Bayes classifier ā€¢ Perceptron ā€¢ Support vector machines ā€¢ Quadratic classifiers ā€¢ k-nearest neighbor ā€¢ Boosting Pick an algorithm
  • 23. The data is in the right format ā€” whatā€™s next? ā€¢ C4.5 ā€¢ Random forests ā€¢ Bayesian networks ā€¢ Hidden Markov models ā€¢ Artificial neural network ā€¢ Data clustering ā€¢ Expectation-maximization algorithm ā€¢ Self-organizing map ā€¢ Radial basis function network ā€¢ Vector Quantization ā€¢ Generative topographic map ā€¢ Information bottleneck method ā€¢ IBSEAD ā€¢ Apriori algorithm ā€¢ Eclat algorithm ā€¢ FP-growth algorithm ā€¢ Single-linkage clustering ā€¢ Conceptual clustering ā€¢ K-means algorithm ā€¢ Fuzzy clustering ā€¢ Temporal difference learning ā€¢ Q-learning ā€¢ Learning Automata ā€¢ AODE ā€¢ Artificial neural network ā€¢ Backpropagation ā€¢ Naive Bayes classifier ā€¢ Bayesian network ā€¢ Bayesian knowledge base ā€¢ Case-based reasoning ā€¢ Decision trees ā€¢ Inductive logic programming ā€¢ Gaussian process regression ā€¢ Gene expression programming ā€¢ Group method of data handling (GMDH) ā€¢ Learning Automata ā€¢ Learning Vector Quantization ā€¢ Logistic Model Tree ā€¢ Decision tree ā€¢ Decision graphs ā€¢ Lazy learning ā€¢ Monte Carlo Method ā€¢ SARSA ā€¢ Instance-based learning ā€¢ Nearest Neighbor Algorithm ā€¢ Analogical modeling ā€¢ Probably approximately correct learning (PACL) ā€¢ Symbolic machine learning algorithms ā€¢ Subsymbolic machine learning algorithms ā€¢ Support vector machines ā€¢ Random Forest ā€¢ Ensembles of classifiers ā€¢ Bootstrap aggregating (bagging) ā€¢ Boosting (meta-algorithm) ā€¢ Ordinal classification ā€¢ Regression analysis ā€¢ Information fuzzy networks (IFN) ā€¢ Linear classifiers ā€¢ Fisher's linear discriminant ā€¢ Logistic regression ā€¢ Naive Bayes classifier ā€¢ Perceptron ā€¢ Support vector machines ā€¢ Quadratic classifiers ā€¢ k-nearest neighbor ā€¢ Boosting Pick an algorithm
  • 25. DECISION TREE vs. (0, ā€¦, 28, 65, ā€¦, 207, 101, 0, 0) (0, ā€¦, 19, 34, ā€¦, 254, 54, 0, 0) (0, ā€¦, 87, 59, ā€¦, 240, 52, 4, 0) (0, ā€¦, 87, 52, ā€¦, 240, 19, 3, 0) (0, ā€¦, 28, 64, ā€¦, 102, 101, 0, 0) (0, ā€¦, 19, 23, ā€¦, 105, 54, 0, 0) (0, ā€¦, 87, 74, ā€¦, 121, 51, 7, 0) (0, ā€¦, 87, 112, ā€¦, 239, 52, 4, 0)
  • 26. DECISION TREE vs. (0, ā€¦, 28, 65, ā€¦, 207, 101, 0, 0) (0, ā€¦, 19, 34, ā€¦, 254, 54, 0, 0) (0, ā€¦, 87, 59, ā€¦, 240, 52, 4, 0) (0, ā€¦, 87, 52, ā€¦, 240, 19, 3, 0) (0, ā€¦, 28, 64, ā€¦, 102, 101, 0, 0) (0, ā€¦, 19, 23, ā€¦, 105, 54, 0, 0) (0, ā€¦, 87, 74, ā€¦, 121, 51, 7, 0) (0, ā€¦, 87, 112, ā€¦, 239, 52, 4, 0) PIXEL #417
  • 27. DECISION TREE vs. (0, ā€¦, 28, 65, ā€¦, 207, 101, 0, 0) (0, ā€¦, 19, 34, ā€¦, 254, 54, 0, 0) (0, ā€¦, 87, 59, ā€¦, 240, 52, 4, 0) (0, ā€¦, 87, 52, ā€¦, 240, 19, 3, 0) (0, ā€¦, 28, 64, ā€¦, 102, 101, 0, 0) (0, ā€¦, 19, 23, ā€¦, 105, 54, 0, 0) (0, ā€¦, 87, 74, ā€¦, 121, 51, 7, 0) (0, ā€¦, 87, 112, ā€¦, 239, 52, 4, 0) PIXEL #417 PIXEL #417 >200 <200
  • 28. DECISION TREE vs. (0, ā€¦, 28, 65, ā€¦, 207, 101, 0, 0) (0, ā€¦, 19, 34, ā€¦, 254, 54, 0, 0) (0, ā€¦, 87, 59, ā€¦, 240, 52, 4, 0) (0, ā€¦, 87, 52, ā€¦, 240, 19, 3, 0) (0, ā€¦, 28, 64, ā€¦, 102, 101, 0, 0) (0, ā€¦, 19, 23, ā€¦, 105, 54, 0, 0) (0, ā€¦, 87, 74, ā€¦, 121, 51, 7, 0) (0, ā€¦, 87, 112, ā€¦, 239, 52, 4, 0) PIXEL #417 PIXEL #417 >200 <200
  • 29. DECISION TREE vs. (0, ā€¦, 28, 65, ā€¦, 207, 101, 0, 0) (0, ā€¦, 19, 34, ā€¦, 254, 54, 0, 0) (0, ā€¦, 87, 59, ā€¦, 240, 52, 4, 0) (0, ā€¦, 87, 52, ā€¦, 240, 19, 3, 0) (0, ā€¦, 28, 64, ā€¦, 102, 101, 0, 0) (0, ā€¦, 19, 23, ā€¦, 105, 54, 0, 0) (0, ā€¦, 87, 74, ā€¦, 121, 51, 7, 0) (0, ā€¦, 87, 112, ā€¦, 239, 52, 4, 0) PIXEL #417 >200 <200
  • 30. DECISION TREE vs. (0, ā€¦, 28, 65, ā€¦, 207, 101, 0, 0) (0, ā€¦, 19, 34, ā€¦, 254, 54, 0, 0) (0, ā€¦, 87, 59, ā€¦, 240, 52, 4, 0) (0, ā€¦, 87, 52, ā€¦, 240, 19, 3, 0) (0, ā€¦, 28, 64, ā€¦, 102, 101, 0, 0) (0, ā€¦, 19, 23, ā€¦, 105, 54, 0, 0) (0, ā€¦, 87, 74, ā€¦, 121, 51, 7, 0) (0, ā€¦, 87, 112, ā€¦, 239, 52, 4, 0) PIXEL #417 >200 <200 PIXEL #123
  • 31. DECISION TREE vs. (0, ā€¦, 28, 65, ā€¦, 207, 101, 0, 0) (0, ā€¦, 19, 34, ā€¦, 254, 54, 0, 0) (0, ā€¦, 87, 59, ā€¦, 240, 52, 4, 0) (0, ā€¦, 87, 52, ā€¦, 240, 19, 3, 0) (0, ā€¦, 28, 64, ā€¦, 102, 101, 0, 0) (0, ā€¦, 19, 23, ā€¦, 105, 54, 0, 0) (0, ā€¦, 87, 74, ā€¦, 121, 51, 7, 0) (0, ā€¦, 87, 112, ā€¦, 239, 52, 4, 0) PIXEL #417 >200 <200 PIXEL #123 <100 >100 PIXEL #123
  • 32. DECISION TREE vs. (0, ā€¦, 28, 65, ā€¦, 207, 101, 0, 0) (0, ā€¦, 19, 34, ā€¦, 254, 54, 0, 0) (0, ā€¦, 87, 59, ā€¦, 240, 52, 4, 0) (0, ā€¦, 87, 52, ā€¦, 240, 19, 3, 0) (0, ā€¦, 28, 64, ā€¦, 102, 101, 0, 0) (0, ā€¦, 19, 23, ā€¦, 105, 54, 0, 0) (0, ā€¦, 87, 74, ā€¦, 121, 51, 7, 0) (0, ā€¦, 87, 112, ā€¦, 239, 52, 4, 0) PIXEL #417 >200 <200 <100 >100 PIXEL #123
  • 37. ACCURACY Confusion matrix acc = correctly classiļ¬ed total number of samples Trueclass Predicted class
  • 38. ACCURACY Confusion matrix acc = correctly classiļ¬ed total number of samples Beware of an imbalanced dataset! Trueclass Predicted class
  • 39. ACCURACY Confusion matrix acc = correctly classiļ¬ed total number of samples Beware of an imbalanced dataset! Consider the following model: ā€œAlways predict 2ā€ Trueclass Predicted class
  • 40. ACCURACY Confusion matrix acc = correctly classiļ¬ed total number of samples Beware of an imbalanced dataset! Consider the following model: ā€œAlways predict 2ā€ Accuracy 0.9 Trueclass Predicted class
  • 42. DECISION TREE ā€œYou said 100% accurate?! Every 10th digit your system detects is wrong!ā€ Angry client
  • 43. DECISION TREE ā€œYou said 100% accurate?! Every 10th digit your system detects is wrong!ā€ Angry client Weā€™ve trained our system on the data the client gave us. But our system has never seen the new data the client applied it to. And in the real life ā€” it never willā€¦
  • 44. OVERFITTING Simulate the real-life situation ā€” split the dataset
  • 45. OVERFITTING Simulate the real-life situation ā€” split the dataset
  • 46. OVERFITTING Simulate the real-life situation ā€” split the dataset
  • 47. OVERFITTING Simulate the real-life situation ā€” split the dataset
  • 49. Underļ¬tting! ā€œToo stupidā€ OK Overļ¬tting! ā€œToo smartā€ OVERFITTING Our current decision tree has too much capacity, it just has memorized all of the data. Letā€™s make it less complex.
  • 50.
  • 51.
  • 52.
  • 53. You probably did not notice, but we are overļ¬tting again :(
  • 54. TEST SET 20% TRAINING SET 60% THE WHOLE DATASET VALIDATION SET 20%
  • 55. TEST SET 20% TRAINING SET 60% THE WHOLE DATASET VALIDATION SET 20% Fit various models and parameter combinations on this subset
  • 56. TEST SET 20% TRAINING SET 60% THE WHOLE DATASET VALIDATION SET 20% Fit various models and parameter combinations on this subset ā€¢ Evaluate the models created with different parameters
  • 57. TEST SET 20% TRAINING SET 60% THE WHOLE DATASET VALIDATION SET 20% Fit various models and parameter combinations on this subset ā€¢ Evaluate the models created with different parameters ! ā€¢ Estimate overļ¬tting TRA VALI
  • 58. TEST SET 20% TRAINING SET 60% THE WHOLE DATASET VALIDATION SET 20% Fit various models and parameter combinations on this subset ā€¢ Evaluate the models created with different parameters ! ā€¢ Estimate overļ¬tting TRA VALI TRA VALI
  • 59. TEST SET 20% TRAINING SET 60% THE WHOLE DATASET VALIDATION SET 20% Fit various models and parameter combinations on this subset ā€¢ Evaluate the models created with different parameters ! ā€¢ Estimate overļ¬tting TRA VALI TRA VALI TRA VALI
  • 60. TEST SET 20% TRAINING SET 60% THE WHOLE DATASET VALIDATION SET 20% Fit various models and parameter combinations on this subset ā€¢ Evaluate the models created with different parameters ! ā€¢ Estimate overļ¬tting TRA VALI TRA VALI TRA VALI TRA VALI
  • 61. TEST SET 20% TRAINING SET 60% THE WHOLE DATASET VALIDATION SET 20% Fit various models and parameter combinations on this subset ā€¢ Evaluate the models created with different parameters ! ā€¢ Estimate overļ¬tting TRA VALI TRA VALI TRA VALI TRA VALI TRA VALI
  • 62. TEST SET 20% TRAINING SET 60% THE WHOLE DATASET VALIDATION SET 20% Fit various models and parameter combinations on this subset ā€¢ Evaluate the models created with different parameters ! ā€¢ Estimate overļ¬tting Use only once to get the ļ¬nal performance estimate TRA VALI TRA VALI TRA VALI TRA VALI TRA VALI
  • 65. CROSS-VALIDATION TRAINING SET 60% THE WHOLE DATASET VALIDATION SET 20%
  • 66. CROSS-VALIDATION TRAINING SET 60% THE WHOLE DATASET VALIDATION SET 20% What if we got too optimistic validation set?
  • 67. CROSS-VALIDATION TRAINING SET 60% THE WHOLE DATASET VALIDATION SET 20% What if we got too optimistic validation set? TRAINING SET 80%
  • 68. CROSS-VALIDATION TRAINING SET 60% THE WHOLE DATASET VALIDATION SET 20% What if we got too optimistic validation set? TRAINING SET 80% Fix the parameter value you ned to evaluate, say msl=15
  • 69. CROSS-VALIDATION TRAINING SET 60% THE WHOLE DATASET VALIDATION SET 20% What if we got too optimistic validation set? TRAINING SET 80% Fix the parameter value you ned to evaluate, say msl=15 TRAINING VAL TRAINING VAL TRAININGVAL Repeat 10 times
  • 70. CROSS-VALIDATION TRAINING SET 60% THE WHOLE DATASET VALIDATION SET 20% What if we got too optimistic validation set? TRAINING SET 80% Fix the parameter value you ned to evaluate, say msl=15 TRAINING VAL TRAINING VAL TRAININGVAL Repeat 10 times } Take average validation score over 10 runs ā€” it is a more stable estimate.
  • 71.
  • 72.
  • 73.
  • 74. MACHINE LEARNING PIPELINE Take raw data Extract features Split into TRAINING and TEST Pick an algorithm and parameters Train on the TRAINING data Evaluate on the TRAINING data with CV Train on the whole TRAINING Fix the best parameters Evaluate on TEST Report ļ¬nal performance to the client Try our different algorithms and parameters
  • 75. MACHINE LEARNING PIPELINE Take raw data Extract features Split into TRAINING and TEST Pick an algorithm and parameters Train on the TRAINING data Evaluate on the TRAINING data with CV Train on the whole TRAINING Fix the best parameters Evaluate on TEST Report ļ¬nal performance to the client Try our different algorithms and parameters ā€œSo it is ~87%ā€¦ermā€¦ Could you do better?ā€
  • 76. MACHINE LEARNING PIPELINE Take raw data Extract features Split into TRAINING and TEST Pick an algorithm and parameters Train on the TRAINING data Evaluate on the TRAINING data with CV Train on the whole TRAINING Fix the best parameters Evaluate on TEST Report ļ¬nal performance to the client Try our different algorithms and parameters ā€œSo it is ~87%ā€¦ermā€¦ Could you do better?ā€ Yes
  • 77. ā€¢ C4.5 ā€¢ Random forests ā€¢ Bayesian networks ā€¢ Hidden Markov models ā€¢ Artificial neural network ā€¢ Data clustering ā€¢ Expectation-maximization algorithm ā€¢ Self-organizing map ā€¢ Radial basis function network ā€¢ Vector Quantization ā€¢ Generative topographic map ā€¢ Information bottleneck method ā€¢ IBSEAD ā€¢ Apriori algorithm ā€¢ Eclat algorithm ā€¢ FP-growth algorithm ā€¢ Single-linkage clustering ā€¢ Conceptual clustering ā€¢ K-means algorithm ā€¢ Fuzzy clustering ā€¢ Temporal difference learning ā€¢ Q-learning ā€¢ Learning Automata ā€¢ AODE ā€¢ Artificial neural network ā€¢ Backpropagation ā€¢ Naive Bayes classifier ā€¢ Bayesian network ā€¢ Bayesian knowledge base ā€¢ Case-based reasoning ā€¢ Decision trees ā€¢ Inductive logic programming ā€¢ Gaussian process regression ā€¢ Gene expression programming ā€¢ Group method of data handling (GMDH) ā€¢ Learning Automata ā€¢ Learning Vector Quantization ā€¢ Logistic Model Tree ā€¢ Decision tree ā€¢ Decision graphs ā€¢ Lazy learning ā€¢ Monte Carlo Method ā€¢ SARSA ā€¢ Instance-based learning ā€¢ Nearest Neighbor Algorithm ā€¢ Analogical modeling ā€¢ Probably approximately correct learning (PACL) ā€¢ Symbolic machine learning algorithms ā€¢ Subsymbolic machine learning algorithms ā€¢ Support vector machines ā€¢ Random Forest ā€¢ Ensembles of classifiers ā€¢ Bootstrap aggregating (bagging) ā€¢ Boosting (meta-algorithm) ā€¢ Ordinal classification ā€¢ Regression analysis ā€¢ Information fuzzy networks (IFN) ā€¢ Linear classifiers ā€¢ Fisher's linear discriminant ā€¢ Logistic regression ā€¢ Naive Bayes classifier ā€¢ Perceptron ā€¢ Support vector machines ā€¢ Quadratic classifiers ā€¢ k-nearest neighbor ā€¢ Boosting Pick another algorithm
  • 78. ā€¢ C4.5 ā€¢ Random forests ā€¢ Bayesian networks ā€¢ Hidden Markov models ā€¢ Artificial neural network ā€¢ Data clustering ā€¢ Expectation-maximization algorithm ā€¢ Self-organizing map ā€¢ Radial basis function network ā€¢ Vector Quantization ā€¢ Generative topographic map ā€¢ Information bottleneck method ā€¢ IBSEAD ā€¢ Apriori algorithm ā€¢ Eclat algorithm ā€¢ FP-growth algorithm ā€¢ Single-linkage clustering ā€¢ Conceptual clustering ā€¢ K-means algorithm ā€¢ Fuzzy clustering ā€¢ Temporal difference learning ā€¢ Q-learning ā€¢ Learning Automata ā€¢ AODE ā€¢ Artificial neural network ā€¢ Backpropagation ā€¢ Naive Bayes classifier ā€¢ Bayesian network ā€¢ Bayesian knowledge base ā€¢ Case-based reasoning ā€¢ Decision trees ā€¢ Inductive logic programming ā€¢ Gaussian process regression ā€¢ Gene expression programming ā€¢ Group method of data handling (GMDH) ā€¢ Learning Automata ā€¢ Learning Vector Quantization ā€¢ Logistic Model Tree ā€¢ Decision tree ā€¢ Decision graphs ā€¢ Lazy learning ā€¢ Monte Carlo Method ā€¢ SARSA ā€¢ Instance-based learning ā€¢ Nearest Neighbor Algorithm ā€¢ Analogical modeling ā€¢ Probably approximately correct learning (PACL) ā€¢ Symbolic machine learning algorithms ā€¢ Subsymbolic machine learning algorithms ā€¢ Support vector machines ā€¢ Random Forest ā€¢ Ensembles of classifiers ā€¢ Bootstrap aggregating (bagging) ā€¢ Boosting (meta-algorithm) ā€¢ Ordinal classification ā€¢ Regression analysis ā€¢ Information fuzzy networks (IFN) ā€¢ Linear classifiers ā€¢ Fisher's linear discriminant ā€¢ Logistic regression ā€¢ Naive Bayes classifier ā€¢ Perceptron ā€¢ Support vector machines ā€¢ Quadratic classifiers ā€¢ k-nearest neighbor ā€¢ Boosting Pick another algorithm
  • 80. RANDOM FOREST Decision tree: pick best out of all features
  • 81. RANDOM FOREST Decision tree: pick best out of all features Random forest: pick best out of random subset of features
  • 83. RANDOM FOREST pick best out of another random subset of features
  • 84. RANDOM FOREST pick best out of another random subset of features pick best out of yet another random subset of features
  • 91.
  • 92.
  • 94. ALL OTHER USE CASES
  • 95. Sound Frequency components Genre Bag of words Topic Text Pixel values Image Cat or dog Video Frame pixels Walking or running Database records Biometric data Census data Average salary ā€¦ Dead or alive
  • 96.
  • 97.
  • 98.
  • 99.
  • 100.