SlideShare a Scribd company logo
1 of 19
Ajay Ramaseshan
MachinePulse.
Machine Learning & Real-world Applications
 Why Machine Learning?
 Supervised Vs Unsupervised learning.
 Machine learning methods.
 Real world applications.
Agenda
Why Machine Learning?
 Volume of data collected growing day by day.
 Data production will be 44 times greater in 2020 than in 2009.
 Every day, 2.5 quintillion bytes of data are created, with 90
percent of the world's data created in the past two years.
 Very little data will ever be looked at by a human.
 Data is cheap and abundant (data warehouses, data marts);
knowledge is expensive and scarce.
 Knowledge Discovery is NEEDED to make sense and use of
data.
 Machine Learning is a technique in which computers learn
from data to obtain insight and help in knowledge discovery.
Why Machine Learning?
 Supervised learning – class labels/ target variable known
Overview of Machine Learning Methods
 Unsupervised learning – no class labels provided, need to
detect clusters of similar items in the data.
Overview (Contd)
 Parametric Models:
Assumes prob. distribution for data, and
learn parameters from data
E.g. Naïve Bayes classifier, linear regression etc.
 Non-parametric Models:
No fixed number of parameters.
E.g. K-NN, histograms etc.
Parametric Vs Nonparametric
 Generative model – learns model for generating data, given
some hidden parameters.
 Learns the joint probability distribution p(x,y).
e.g. HMM, GMM, Naïve Bayes etc.
 Discriminative model – learns dependence of unobserved
variable y on observed variable x.
 Tries to model the separation between classes.
 Learns the conditional probability distribution p(y|x).
e.g. Logistic Regression, SVM, Neural networks etc.
Generative Vs Discriminative
 Classification – Supervised learning.
 Commonly used Methods for Classification –
Naïve Bayes
Decision tree
K nearest neighbors
Neural Networks
Support Vector Machines.
Classification
 Regression – Predicting an output variable given input
variables.
 Algorithms used –
Ordinary least squares
 Partial least squares
 Logistic Regression
 Stepwise Regression
 Support Vector Regression
Neural Networks
Regression
 Clustering:
Group data into clusters using similarity measures.
 Algorithms:
K-means clustering
 Density based EM algorithm
 Hierarchical clustering.
 Spectral Clustering
Clustering
 Naïve Bayes classifier
Assumes conditional independence among features.
 P(xi,xj,xk|C) = P(xi|C)P(xj|C)P(xk|C)
Naïve Bayes
 K-nearest neighbors:
Classifies the data point with the class of the k
nearest neighbors.
 Value of k decided using cross validation
K-Nearest Neighbors
 Leaves indicate classes.
 Non-terminal nodes – decisions on attribute values
 Algorithms used for decision tree learning
 C4.5
 ID3
 CART.
Decision trees
 Artificial Neural Networks
Modeled after the
human brain
 Consists of an input layer,
many hidden layers, and an
output layer.
 Multi-Layer Perceptrons,
Radial Basis Functions,
Kohonen Networks etc.
Neural Networks
Validation methods
 Cross validation techniques
K-fold cross validation
Leave one out Cross Validation
 ROC curve (for binary classifier)
 Confusion Matrix
Evaluation of Machine Learning Methods
Some real life applications of machine learning:
 Recommender systems – suggesting similar people on
Facebook/LinkedIn, similar movies/ books etc. on Amazon,
 Business applications – Customer segmentation, Customer
retention, Targeted Marketing etc.
 Medical applications – Disease diagnosis,
 Banking – Credit card issue, fraud detection etc.
 Language translation, text to speech or vice versa.
Real life applications
Wisconsin Breast Cancer dataset:
 Instances : 569
 Features : 32
 Class variable : malignant, or benign.
 Steps to classify using k-NN
 Load data into R
 Data normalization
 Split into training and test datasets
 Train model
 Evaluate model performance
Breast Cancer Dataset and k-NN
Thank you!

More Related Content

What's hot

Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
Simplilearn
 
An introduction to Machine Learning
An introduction to Machine LearningAn introduction to Machine Learning
An introduction to Machine Learning
butest
 
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
Simplilearn
 
Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...
Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...
Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...
Simplilearn
 

What's hot (20)

Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Machine Learning-Linear regression
Machine Learning-Linear regressionMachine Learning-Linear regression
Machine Learning-Linear regression
 
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
 
supervised learning
supervised learningsupervised learning
supervised learning
 
Machine Can Think
Machine Can ThinkMachine Can Think
Machine Can Think
 
Pattern recognition and Machine Learning.
Pattern recognition and Machine Learning.Pattern recognition and Machine Learning.
Pattern recognition and Machine Learning.
 
Presentation on supervised learning
Presentation on supervised learningPresentation on supervised learning
Presentation on supervised learning
 
Back propagation
Back propagationBack propagation
Back propagation
 
Unsupervised learning
Unsupervised learningUnsupervised learning
Unsupervised learning
 
An introduction to Machine Learning
An introduction to Machine LearningAn introduction to Machine Learning
An introduction to Machine Learning
 
Machine learning
Machine learningMachine learning
Machine learning
 
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Tutori...
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Tutori...Machine Learning Algorithms | Machine Learning Tutorial | Data Science Tutori...
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Tutori...
 
Introduction to Deep Learning
Introduction to Deep LearningIntroduction to Deep Learning
Introduction to Deep Learning
 
Classification and Regression
Classification and RegressionClassification and Regression
Classification and Regression
 
Machine learning
Machine learningMachine learning
Machine learning
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
 
Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...
Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...
Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...
 
Intro/Overview on Machine Learning Presentation
Intro/Overview on Machine Learning PresentationIntro/Overview on Machine Learning Presentation
Intro/Overview on Machine Learning Presentation
 
Machine learning
Machine learning Machine learning
Machine learning
 

Similar to Machine Learning and Real-World Applications

Presentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data MiningPresentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data Mining
butest
 
Brief Tour of Machine Learning
Brief Tour of Machine LearningBrief Tour of Machine Learning
Brief Tour of Machine Learning
butest
 
Machine Learning ICS 273A
Machine Learning ICS 273AMachine Learning ICS 273A
Machine Learning ICS 273A
butest
 
Internship - Python - AI ML.pptx
Internship - Python - AI ML.pptxInternship - Python - AI ML.pptx
Internship - Python - AI ML.pptx
Hchethankumar
 
Internship - Python - AI ML.pptx
Internship - Python - AI ML.pptxInternship - Python - AI ML.pptx
Internship - Python - AI ML.pptx
Hchethankumar
 
Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401
butest
 
32_Nov07_MachineLear..
32_Nov07_MachineLear..32_Nov07_MachineLear..
32_Nov07_MachineLear..
butest
 

Similar to Machine Learning and Real-World Applications (20)

Introduction to Machine Learning with SciKit-Learn
Introduction to Machine Learning with SciKit-LearnIntroduction to Machine Learning with SciKit-Learn
Introduction to Machine Learning with SciKit-Learn
 
Data mining
Data mining Data mining
Data mining
 
background.pptx
background.pptxbackground.pptx
background.pptx
 
5. Machine Learning.pptx
5.  Machine Learning.pptx5.  Machine Learning.pptx
5. Machine Learning.pptx
 
Presentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data MiningPresentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data Mining
 
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHESIMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
 
Brief Tour of Machine Learning
Brief Tour of Machine LearningBrief Tour of Machine Learning
Brief Tour of Machine Learning
 
Introduction to Deep Learning and neon at Galvanize
Introduction to Deep Learning and neon at GalvanizeIntroduction to Deep Learning and neon at Galvanize
Introduction to Deep Learning and neon at Galvanize
 
Machine Learning ICS 273A
Machine Learning ICS 273AMachine Learning ICS 273A
Machine Learning ICS 273A
 
Internship - Python - AI ML.pptx
Internship - Python - AI ML.pptxInternship - Python - AI ML.pptx
Internship - Python - AI ML.pptx
 
Internship - Python - AI ML.pptx
Internship - Python - AI ML.pptxInternship - Python - AI ML.pptx
Internship - Python - AI ML.pptx
 
IRJET- Study and Evaluation of Classification Algorithms in Data Mining
IRJET- Study and Evaluation of Classification Algorithms in Data MiningIRJET- Study and Evaluation of Classification Algorithms in Data Mining
IRJET- Study and Evaluation of Classification Algorithms in Data Mining
 
Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401
 
Supervised and unsupervised learning
Supervised and unsupervised learningSupervised and unsupervised learning
Supervised and unsupervised learning
 
Few shot learning/ one shot learning/ machine learning
Few shot learning/ one shot learning/ machine learningFew shot learning/ one shot learning/ machine learning
Few shot learning/ one shot learning/ machine learning
 
AI and Deep Learning
AI and Deep Learning AI and Deep Learning
AI and Deep Learning
 
32_Nov07_MachineLear..
32_Nov07_MachineLear..32_Nov07_MachineLear..
32_Nov07_MachineLear..
 
PythonML.pptx
PythonML.pptxPythonML.pptx
PythonML.pptx
 
Deep learning from a novice perspective
Deep learning from a novice perspectiveDeep learning from a novice perspective
Deep learning from a novice perspective
 
Hypothesis on Different Data Mining Algorithms
Hypothesis on Different Data Mining AlgorithmsHypothesis on Different Data Mining Algorithms
Hypothesis on Different Data Mining Algorithms
 

More from MachinePulse

IoT Cloud architecture
IoT Cloud architectureIoT Cloud architecture
IoT Cloud architecture
MachinePulse
 

More from MachinePulse (15)

Tracking PV module degradation using SolarPulse
Tracking PV module degradation using SolarPulseTracking PV module degradation using SolarPulse
Tracking PV module degradation using SolarPulse
 
SolarPulse solves the problem of managing distributed solar sites
SolarPulse solves the problem of managing distributed solar sitesSolarPulse solves the problem of managing distributed solar sites
SolarPulse solves the problem of managing distributed solar sites
 
SolarPulse helps PV plants reduce field expenses
SolarPulse helps PV plants reduce field expensesSolarPulse helps PV plants reduce field expenses
SolarPulse helps PV plants reduce field expenses
 
Effective response to adverse weather conditions using SolarPulse
Effective response to adverse weather conditions using SolarPulseEffective response to adverse weather conditions using SolarPulse
Effective response to adverse weather conditions using SolarPulse
 
Assessing the impact of SolarPulse on performance of utility scale PV plants
Assessing the impact of SolarPulse on performance of utility scale PV plantsAssessing the impact of SolarPulse on performance of utility scale PV plants
Assessing the impact of SolarPulse on performance of utility scale PV plants
 
Value of solar remote monitoring and analytics for operational intelligence
Value of solar remote monitoring and analytics for operational  intelligenceValue of solar remote monitoring and analytics for operational  intelligence
Value of solar remote monitoring and analytics for operational intelligence
 
MachinePulse company presentation
MachinePulse company presentationMachinePulse company presentation
MachinePulse company presentation
 
MachinePulse Products
MachinePulse ProductsMachinePulse Products
MachinePulse Products
 
Industrial Internet of Things in Cleantech
Industrial Internet of Things in CleantechIndustrial Internet of Things in Cleantech
Industrial Internet of Things in Cleantech
 
MachinePulse at the Global Solar EPC Summit 2015 - 4th June 2015
MachinePulse at the Global Solar EPC Summit 2015 - 4th June 2015MachinePulse at the Global Solar EPC Summit 2015 - 4th June 2015
MachinePulse at the Global Solar EPC Summit 2015 - 4th June 2015
 
Managing your Assets with Big Data Tools
Managing your Assets with Big Data ToolsManaging your Assets with Big Data Tools
Managing your Assets with Big Data Tools
 
Challenges & Applications in the Industrial Internet of Things (IoT)
Challenges & Applications in the Industrial Internet of Things (IoT)Challenges & Applications in the Industrial Internet of Things (IoT)
Challenges & Applications in the Industrial Internet of Things (IoT)
 
Predictive Analytics - An Overview
Predictive Analytics - An OverviewPredictive Analytics - An Overview
Predictive Analytics - An Overview
 
IoT Cloud architecture
IoT Cloud architectureIoT Cloud architecture
IoT Cloud architecture
 
MachinePulse at the November Open Hardware Meetup, Mumbai 2014
MachinePulse at the November Open Hardware Meetup, Mumbai 2014MachinePulse at the November Open Hardware Meetup, Mumbai 2014
MachinePulse at the November Open Hardware Meetup, Mumbai 2014
 

Recently uploaded

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 

Machine Learning and Real-World Applications

  • 2.  Why Machine Learning?  Supervised Vs Unsupervised learning.  Machine learning methods.  Real world applications. Agenda
  • 4.  Volume of data collected growing day by day.  Data production will be 44 times greater in 2020 than in 2009.  Every day, 2.5 quintillion bytes of data are created, with 90 percent of the world's data created in the past two years.  Very little data will ever be looked at by a human.  Data is cheap and abundant (data warehouses, data marts); knowledge is expensive and scarce.  Knowledge Discovery is NEEDED to make sense and use of data.  Machine Learning is a technique in which computers learn from data to obtain insight and help in knowledge discovery. Why Machine Learning?
  • 5.  Supervised learning – class labels/ target variable known Overview of Machine Learning Methods
  • 6.  Unsupervised learning – no class labels provided, need to detect clusters of similar items in the data. Overview (Contd)
  • 7.  Parametric Models: Assumes prob. distribution for data, and learn parameters from data E.g. Naïve Bayes classifier, linear regression etc.  Non-parametric Models: No fixed number of parameters. E.g. K-NN, histograms etc. Parametric Vs Nonparametric
  • 8.  Generative model – learns model for generating data, given some hidden parameters.  Learns the joint probability distribution p(x,y). e.g. HMM, GMM, Naïve Bayes etc.  Discriminative model – learns dependence of unobserved variable y on observed variable x.  Tries to model the separation between classes.  Learns the conditional probability distribution p(y|x). e.g. Logistic Regression, SVM, Neural networks etc. Generative Vs Discriminative
  • 9.  Classification – Supervised learning.  Commonly used Methods for Classification – Naïve Bayes Decision tree K nearest neighbors Neural Networks Support Vector Machines. Classification
  • 10.  Regression – Predicting an output variable given input variables.  Algorithms used – Ordinary least squares  Partial least squares  Logistic Regression  Stepwise Regression  Support Vector Regression Neural Networks Regression
  • 11.  Clustering: Group data into clusters using similarity measures.  Algorithms: K-means clustering  Density based EM algorithm  Hierarchical clustering.  Spectral Clustering Clustering
  • 12.  Naïve Bayes classifier Assumes conditional independence among features.  P(xi,xj,xk|C) = P(xi|C)P(xj|C)P(xk|C) Naïve Bayes
  • 13.  K-nearest neighbors: Classifies the data point with the class of the k nearest neighbors.  Value of k decided using cross validation K-Nearest Neighbors
  • 14.  Leaves indicate classes.  Non-terminal nodes – decisions on attribute values  Algorithms used for decision tree learning  C4.5  ID3  CART. Decision trees
  • 15.  Artificial Neural Networks Modeled after the human brain  Consists of an input layer, many hidden layers, and an output layer.  Multi-Layer Perceptrons, Radial Basis Functions, Kohonen Networks etc. Neural Networks
  • 16. Validation methods  Cross validation techniques K-fold cross validation Leave one out Cross Validation  ROC curve (for binary classifier)  Confusion Matrix Evaluation of Machine Learning Methods
  • 17. Some real life applications of machine learning:  Recommender systems – suggesting similar people on Facebook/LinkedIn, similar movies/ books etc. on Amazon,  Business applications – Customer segmentation, Customer retention, Targeted Marketing etc.  Medical applications – Disease diagnosis,  Banking – Credit card issue, fraud detection etc.  Language translation, text to speech or vice versa. Real life applications
  • 18. Wisconsin Breast Cancer dataset:  Instances : 569  Features : 32  Class variable : malignant, or benign.  Steps to classify using k-NN  Load data into R  Data normalization  Split into training and test datasets  Train model  Evaluate model performance Breast Cancer Dataset and k-NN