SlideShare a Scribd company logo
1 of 92
Download to read offline
Machine Learning-1
Presented by Skillslash
What is MC Learning
www.skillslash.com
The subfield of computer science that “gives computers the ability to learn
without being explicitly programmed”.
(Arthur Samuel, 1959)
A computer program is said to learn from experience E with respect to some class of
tasks T and performance measure P if its performance at tasks in T, as measured by P,
improves with experience E.”
(Tom Mitchell, 1997)
Using data for answering
questions
High Bias and Low Variance
(Low Flexibility)
Low Bias and High Variance
(Too Flexibility)
Low Bias and High Variance
(Balanced Flexibility)
Bias Error:
The bias is known as the difference between the prediction of the values by the ML model and the correct
value. Being high in biasing gives a large error in training as well as testing data.
Variance Error:
Variance is the amount that the estimate of the target function will change if different training data was
used.
www.skillslash.com
Types of Machine Learning
Supervised Learning vs Unsupervised Learning
Un-Supervised Learning
Clustering
Regression vs Classification
Semi Supervised Learning
Types of Supervised ML
Supervise
d
Unsupervise
d
Reinforceme
nt
Output is a discrete variable
(e.g.,
Defaulter and Non Defaulter
Spam and non spam
Purchaser Non Purchaser)
Classificatio
n
Regressio
n
Output is continuous (e.g.,
price of house,
temperature)
www.skillslash.com
www.skillslash.com
Supervise
d
Unsupervise
d
Reinforceme
nt
www.skillslash.com
www.skillslash.com
Types of Machine Learning
Problems
Supervise
d
Unsupervise
d
Reinforceme
nt
Supervise
d
Is this a cat or a dog?
Are these emails spam or not?
Unsupervised
Predict the market value of houses, given the
square meters, number of rooms,
neighborhood, etc.
Reinforcement
Learn through examples of which we know
the desired output (what we want to
predict).
Types of Machine Learning
Problems
Unsupervise
d
Supervised
There is no desired output. Learn something
about the data. Latent relationships.
I want to find anomalies in the credit card
usage patterns of my customers.
Reinforcement
I have photos and want to put them in
20 groups.
www.skillslash.com
Types of Machine Learning
Problems
Unsupervise
d
Supervise
d
Reinforceme
nt
Useful for learning structure in the data
(clustering), hidden correlations, reduce
dimensionality, etc.
www.skillslash.com
Environment gives feedback via a
positive or negative reward signal.
Unsupervised
Reinforceme
nt
Supervise
d
An agent interacts with an environment and
watches the result of the interaction.
Types of Machine Learning
Problems
www.skillslash.com
Data Gathering
60
Might depend on human work
• Manual labeling for supervised learning.
• Domain knowledge. Maybe even experts.
May come for free, or “sort of”
• E.g., Machine Translation.
The more the better: Some algorithms need large amounts of data
to be useful (e.g., neural networks).
The quantity and quality of data dictate the model accuracy
www.skillslash.com
Data Preprocessing
61
Is there anything wrong with the data?
• Missing values
• Outliers
• Bad encoding (for text)
• Wrongly-labeled examples
• Biased data
• Do I have many more samples of one
class than the rest?
Need to fix/remove data?
www.skillslash.com
Feature Engineering
62
What is a feature?
A feature is an individual measurable
property of a phenomenon being
observed
Our inputs are represented by a set of
features.
To classify spam email, features could be:
• Number of words that have been
ch4ng3d
like this.
• Language of the email
Buy ch34p drugs
from the
ph4rm4cy now :)
:) :)
(2, 0, 3)
Feature
engineerin
g
www.skillslash.com
Feature Engineering
63
Extract more information from existing data, not adding “new” data
per-se
• Making it more useful
• With good features, most algorithms can learn
faster It can be an art
• Requires thought and knowledge of the
data Two steps:
• Variable transformation (e.g., dates into weekdays,
normalizing)
www.skillslash.com
Algorithm Selection & Training
64
Supervise
d
• Linear classifier
• Naive Bayes
• Support Vector Machines
(SVM)
• Decision Tree
• Random Forests
• k-Nearest Neighbors
• Neural Networks (Deep
learning)
Unsupervise
d
• PCA
• t-SNE
• k-mean
s
• DBSCAN
Reinforcemen
t
• SARSA–λ
• Q-Learnin
g
www.skillslash.com
65
THE MACHINE LEARNING FRAMEWORK
y = f(x)
● Training: given a training set of labeled examples {(x1
,y1
), …,
(xN
,yN
)}, estimate the prediction function f by minimizing the
prediction error on the training set
● Testing: apply f to a never before seen test example x and
output the predicted value y = f(x)
output prediction
function
Image
feature
www.skillslash.com
Goal of training: making the correct prediction as often as
possible
• Incremental improvement:
• Use of metrics for evaluating performance and comparing
solutions
• Hyperparameter tuning: more an art than a science
Algorithm Selection & Training
66
Predic
t
Adjus
t
www.skillslash.com
Summary
67
• Machine Learning is intelligent use of data to answer questions
• Enabled by an exponential increase in computing power and
data availability
• Three big types of problems: supervised, unsupervised,
reinforcement
• 5 steps to every machine learning solution:
1. Data Gathering
2. Data Preprocessing
3. Feature Engineering
4. Algorithm Selection & Training
5. Making Predictions www.skillslash.com
Generalization
● How well does a learned model generalize from the data it
was trained on to a new test set?
Training set (labels known) Test set (labels
unknown)
Generalization
● Components of generalization error
○ Bias: how much the average model over all training sets differ from the true
model?
■ Error due to inaccurate assumptions/simplifications made by the model
■ Using very less features
○ Variance: how much models estimated from different training sets differ from
each other
● Underfitting: model is too “simple” to represent all the relevant class
characteristics
○ High bias and low variance
○ High training error and high test error
● Overfitting: model is too “complex” and fits irrelevant characteristics
(noise) in the data
○ Low bias and high variance
○ Low training error and high test error
Bias-Variance Trade-off
• Models with too few parameters are
inaccurate because of a large bias (not
enough flexibility).
• Bias can also come due to wrong
assumption.
• Lead to Train error
• Models with too many parameters are
inaccurate because of a large variance
(too much sensitivity to the sample).
• Lead to Test Error
www.skillslash.com
THANK YOU
www.skillslash.com
Machine Learning Fundamentals in 40 Characters
Machine Learning Fundamentals in 40 Characters
Machine Learning Fundamentals in 40 Characters
Machine Learning Fundamentals in 40 Characters
Machine Learning Fundamentals in 40 Characters
Machine Learning Fundamentals in 40 Characters
Machine Learning Fundamentals in 40 Characters
Machine Learning Fundamentals in 40 Characters
Machine Learning Fundamentals in 40 Characters
Machine Learning Fundamentals in 40 Characters
Machine Learning Fundamentals in 40 Characters
Machine Learning Fundamentals in 40 Characters
Machine Learning Fundamentals in 40 Characters
Machine Learning Fundamentals in 40 Characters
Machine Learning Fundamentals in 40 Characters
Machine Learning Fundamentals in 40 Characters
Machine Learning Fundamentals in 40 Characters
Machine Learning Fundamentals in 40 Characters

More Related Content

Similar to Machine Learning Fundamentals in 40 Characters

Introduction to Machine Learning.pptx
Introduction to Machine Learning.pptxIntroduction to Machine Learning.pptx
Introduction to Machine Learning.pptxDr. Amanpreet Kaur
 
Introduction to ML (Machine Learning)
Introduction to ML (Machine Learning)Introduction to ML (Machine Learning)
Introduction to ML (Machine Learning)SwatiTripathi44
 
introduction to machine learning
introduction to machine learningintroduction to machine learning
introduction to machine learningJohnson Ubah
 
Supervised learning techniques and applications
Supervised learning techniques and applicationsSupervised learning techniques and applications
Supervised learning techniques and applicationsBenjaminlapid1
 
Optimal Model Complexity (1).pptx
Optimal Model Complexity (1).pptxOptimal Model Complexity (1).pptx
Optimal Model Complexity (1).pptxMurindanyiSudi1
 
Machine learning with ADA Boost
Machine learning with ADA BoostMachine learning with ADA Boost
Machine learning with ADA BoostAman Patel
 
Machine Learning course in Chandigarh Join
Machine Learning course in Chandigarh JoinMachine Learning course in Chandigarh Join
Machine Learning course in Chandigarh Joinasmeerana605
 
Tech meetup Data Driven - Codemotion
Tech meetup Data Driven - Codemotion Tech meetup Data Driven - Codemotion
Tech meetup Data Driven - Codemotion antimo musone
 
It's Machine Learning Basics -- For You!
It's Machine Learning Basics -- For You!It's Machine Learning Basics -- For You!
It's Machine Learning Basics -- For You!To Sum It Up
 
Introduction to machine learning-2023-IT-AI and DS.pdf
Introduction to machine learning-2023-IT-AI and DS.pdfIntroduction to machine learning-2023-IT-AI and DS.pdf
Introduction to machine learning-2023-IT-AI and DS.pdfSisayNegash4
 
Machine Learning Chapter one introduction
Machine Learning Chapter one introductionMachine Learning Chapter one introduction
Machine Learning Chapter one introductionARVIND SARDAR
 
Intro to machine learning
Intro to machine learningIntro to machine learning
Intro to machine learningAkshay Kanchan
 
Machine Learning 2 deep Learning: An Intro
Machine Learning 2 deep Learning: An IntroMachine Learning 2 deep Learning: An Intro
Machine Learning 2 deep Learning: An IntroSi Krishan
 
Rahul_Kirtoniya_11800121032_CSE_Machine_Learning.pptx
Rahul_Kirtoniya_11800121032_CSE_Machine_Learning.pptxRahul_Kirtoniya_11800121032_CSE_Machine_Learning.pptx
Rahul_Kirtoniya_11800121032_CSE_Machine_Learning.pptxRahulKirtoniya
 
Machine Learning Interview Questions
Machine Learning Interview QuestionsMachine Learning Interview Questions
Machine Learning Interview QuestionsRock Interview
 

Similar to Machine Learning Fundamentals in 40 Characters (20)

Introduction to Machine Learning.pptx
Introduction to Machine Learning.pptxIntroduction to Machine Learning.pptx
Introduction to Machine Learning.pptx
 
Introduction to ML (Machine Learning)
Introduction to ML (Machine Learning)Introduction to ML (Machine Learning)
Introduction to ML (Machine Learning)
 
introduction to machine learning
introduction to machine learningintroduction to machine learning
introduction to machine learning
 
Supervised learning techniques and applications
Supervised learning techniques and applicationsSupervised learning techniques and applications
Supervised learning techniques and applications
 
Machine Learning by Rj
Machine Learning by RjMachine Learning by Rj
Machine Learning by Rj
 
ML basics.pptx
ML basics.pptxML basics.pptx
ML basics.pptx
 
MachineLearning_AishwaryaCR
MachineLearning_AishwaryaCRMachineLearning_AishwaryaCR
MachineLearning_AishwaryaCR
 
Optimal Model Complexity (1).pptx
Optimal Model Complexity (1).pptxOptimal Model Complexity (1).pptx
Optimal Model Complexity (1).pptx
 
Machine learning with ADA Boost
Machine learning with ADA BoostMachine learning with ADA Boost
Machine learning with ADA Boost
 
Machine Learning course in Chandigarh Join
Machine Learning course in Chandigarh JoinMachine Learning course in Chandigarh Join
Machine Learning course in Chandigarh Join
 
Tech meetup Data Driven - Codemotion
Tech meetup Data Driven - Codemotion Tech meetup Data Driven - Codemotion
Tech meetup Data Driven - Codemotion
 
It's Machine Learning Basics -- For You!
It's Machine Learning Basics -- For You!It's Machine Learning Basics -- For You!
It's Machine Learning Basics -- For You!
 
Introduction to machine learning-2023-IT-AI and DS.pdf
Introduction to machine learning-2023-IT-AI and DS.pdfIntroduction to machine learning-2023-IT-AI and DS.pdf
Introduction to machine learning-2023-IT-AI and DS.pdf
 
Machine Learning Chapter one introduction
Machine Learning Chapter one introductionMachine Learning Chapter one introduction
Machine Learning Chapter one introduction
 
Intro to machine learning
Intro to machine learningIntro to machine learning
Intro to machine learning
 
Machine Learning - Deep Learning
Machine Learning - Deep LearningMachine Learning - Deep Learning
Machine Learning - Deep Learning
 
Machine Learning 2 deep Learning: An Intro
Machine Learning 2 deep Learning: An IntroMachine Learning 2 deep Learning: An Intro
Machine Learning 2 deep Learning: An Intro
 
Rahul_Kirtoniya_11800121032_CSE_Machine_Learning.pptx
Rahul_Kirtoniya_11800121032_CSE_Machine_Learning.pptxRahul_Kirtoniya_11800121032_CSE_Machine_Learning.pptx
Rahul_Kirtoniya_11800121032_CSE_Machine_Learning.pptx
 
Machine Learning Interview Questions
Machine Learning Interview QuestionsMachine Learning Interview Questions
Machine Learning Interview Questions
 
Unit-1.ppt
Unit-1.pptUnit-1.ppt
Unit-1.ppt
 

Recently uploaded

办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...ThinkInnovation
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一F La
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 

Recently uploaded (20)

办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 

Machine Learning Fundamentals in 40 Characters

  • 2. What is MC Learning www.skillslash.com The subfield of computer science that “gives computers the ability to learn without being explicitly programmed”. (Arthur Samuel, 1959) A computer program is said to learn from experience E with respect to some class of tasks T and performance measure P if its performance at tasks in T, as measured by P, improves with experience E.” (Tom Mitchell, 1997) Using data for answering questions
  • 3.
  • 4.
  • 5.
  • 6.
  • 7.
  • 8.
  • 9.
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20.
  • 21.
  • 22.
  • 23.
  • 24.
  • 25.
  • 26.
  • 27.
  • 28.
  • 29.
  • 30.
  • 31.
  • 32.
  • 33.
  • 34.
  • 35.
  • 36.
  • 37.
  • 38.
  • 39.
  • 40. High Bias and Low Variance (Low Flexibility) Low Bias and High Variance (Too Flexibility) Low Bias and High Variance (Balanced Flexibility)
  • 41. Bias Error: The bias is known as the difference between the prediction of the values by the ML model and the correct value. Being high in biasing gives a large error in training as well as testing data. Variance Error: Variance is the amount that the estimate of the target function will change if different training data was used.
  • 43. Supervised Learning vs Unsupervised Learning
  • 44.
  • 45.
  • 50. Types of Supervised ML Supervise d Unsupervise d Reinforceme nt Output is a discrete variable (e.g., Defaulter and Non Defaulter Spam and non spam Purchaser Non Purchaser) Classificatio n Regressio n Output is continuous (e.g., price of house, temperature) www.skillslash.com
  • 52.
  • 55. Types of Machine Learning Problems Supervise d Unsupervise d Reinforceme nt Supervise d Is this a cat or a dog? Are these emails spam or not? Unsupervised Predict the market value of houses, given the square meters, number of rooms, neighborhood, etc. Reinforcement Learn through examples of which we know the desired output (what we want to predict).
  • 56. Types of Machine Learning Problems Unsupervise d Supervised There is no desired output. Learn something about the data. Latent relationships. I want to find anomalies in the credit card usage patterns of my customers. Reinforcement I have photos and want to put them in 20 groups. www.skillslash.com
  • 57. Types of Machine Learning Problems Unsupervise d Supervise d Reinforceme nt Useful for learning structure in the data (clustering), hidden correlations, reduce dimensionality, etc. www.skillslash.com
  • 58. Environment gives feedback via a positive or negative reward signal. Unsupervised Reinforceme nt Supervise d An agent interacts with an environment and watches the result of the interaction. Types of Machine Learning Problems www.skillslash.com
  • 59.
  • 60. Data Gathering 60 Might depend on human work • Manual labeling for supervised learning. • Domain knowledge. Maybe even experts. May come for free, or “sort of” • E.g., Machine Translation. The more the better: Some algorithms need large amounts of data to be useful (e.g., neural networks). The quantity and quality of data dictate the model accuracy www.skillslash.com
  • 61. Data Preprocessing 61 Is there anything wrong with the data? • Missing values • Outliers • Bad encoding (for text) • Wrongly-labeled examples • Biased data • Do I have many more samples of one class than the rest? Need to fix/remove data? www.skillslash.com
  • 62. Feature Engineering 62 What is a feature? A feature is an individual measurable property of a phenomenon being observed Our inputs are represented by a set of features. To classify spam email, features could be: • Number of words that have been ch4ng3d like this. • Language of the email Buy ch34p drugs from the ph4rm4cy now :) :) :) (2, 0, 3) Feature engineerin g www.skillslash.com
  • 63. Feature Engineering 63 Extract more information from existing data, not adding “new” data per-se • Making it more useful • With good features, most algorithms can learn faster It can be an art • Requires thought and knowledge of the data Two steps: • Variable transformation (e.g., dates into weekdays, normalizing) www.skillslash.com
  • 64. Algorithm Selection & Training 64 Supervise d • Linear classifier • Naive Bayes • Support Vector Machines (SVM) • Decision Tree • Random Forests • k-Nearest Neighbors • Neural Networks (Deep learning) Unsupervise d • PCA • t-SNE • k-mean s • DBSCAN Reinforcemen t • SARSA–λ • Q-Learnin g www.skillslash.com
  • 65. 65 THE MACHINE LEARNING FRAMEWORK y = f(x) ● Training: given a training set of labeled examples {(x1 ,y1 ), …, (xN ,yN )}, estimate the prediction function f by minimizing the prediction error on the training set ● Testing: apply f to a never before seen test example x and output the predicted value y = f(x) output prediction function Image feature www.skillslash.com
  • 66. Goal of training: making the correct prediction as often as possible • Incremental improvement: • Use of metrics for evaluating performance and comparing solutions • Hyperparameter tuning: more an art than a science Algorithm Selection & Training 66 Predic t Adjus t www.skillslash.com
  • 67. Summary 67 • Machine Learning is intelligent use of data to answer questions • Enabled by an exponential increase in computing power and data availability • Three big types of problems: supervised, unsupervised, reinforcement • 5 steps to every machine learning solution: 1. Data Gathering 2. Data Preprocessing 3. Feature Engineering 4. Algorithm Selection & Training 5. Making Predictions www.skillslash.com
  • 68. Generalization ● How well does a learned model generalize from the data it was trained on to a new test set? Training set (labels known) Test set (labels unknown)
  • 69. Generalization ● Components of generalization error ○ Bias: how much the average model over all training sets differ from the true model? ■ Error due to inaccurate assumptions/simplifications made by the model ■ Using very less features ○ Variance: how much models estimated from different training sets differ from each other ● Underfitting: model is too “simple” to represent all the relevant class characteristics ○ High bias and low variance ○ High training error and high test error ● Overfitting: model is too “complex” and fits irrelevant characteristics (noise) in the data ○ Low bias and high variance ○ Low training error and high test error
  • 70.
  • 71.
  • 72. Bias-Variance Trade-off • Models with too few parameters are inaccurate because of a large bias (not enough flexibility). • Bias can also come due to wrong assumption. • Lead to Train error • Models with too many parameters are inaccurate because of a large variance (too much sensitivity to the sample). • Lead to Test Error