SlideShare a Scribd company logo
1 of 14
S TAT I S T I C A L
L E A R N I N G
V S .
M A C H I N E
L E A R N I N G
Venn diagram on the coverage of machine learning and statistical modeling in the universe
of data science
DEFINITION
• Machine Learning is …
an algorithm that can learn from data without relying on rules-
based programming.
• Statistical Modelling is …
formalization of relationships between variables in the form of
mathematical equations.
A BUSINESS CASE
Let us now see an interesting example published by McKinsey differentiating the two
algorithms :
– Case : Understand the risk level of customers churn over a period of time for a
Telecom company
– Data Available : Two Drivers – A & B
– Next is a graph to understand the difference between a statistical model and a
Machine Learning algorithm.
OBSERVATION
• Statistical model is all about getting a simple formulation of a frontier in a classification
model problem. Here we see a non linear boundary which to some extent separates
risky people from non-risky people. But when we see the contours generated by
Machine Learning algorithm, we witness that statistical modeling is no way comparable
for the problem in hand to the Machine Learning algorithm. The contours of machine
learning seems to capture all patterns beyond any boundaries of linearity or even
continuity of the boundaries. This is what Machine Learning can do for you.
DIFFERENCES BETWEEN MACHINE
LEARNING AND STATISTICAL MODELING
• Given the flavor of difference in output of these two approaches, let us understand the
difference in the two paradigms, even though both do almost similar job :
– Schools they come from
– When did they come into existence?
– Assumptions they work on
– Type of data they deal with
– Nomenclatures of operations and objects
– Techniques used
– Predictive power and human efforts involved to implement
• All the differences mentioned above do separate the two to some extent, but there is
no hard boundary between Machine Learning and statistical modeling.
THEY BELONG TO DIFFERENT SCHOOLS
• Machine Learning is …
– a subfield of computer science and artificial intelligence which deals with building
systems that can learn from data, instead of explicitly programmed instructions.
• Statistical Modelling is …
– a subfield of mathematics which deals with finding relationship between variables
to predict an outcome
THEY CAME UP IN DIFFERENT ERAS
Statistical modeling has been there for centuries now. However, Machine
learning is a very recent development. It came into existence in the 1990s
as steady advances in digitization and cheap computing power enabled
data scientists to stop building finished models and instead train
computers to do so. The unmanageable volume and complexity of the big
data that the world is now swimming in have increased the potential of
machine learning—and the need for it.
EXTENT OF ASSUMPTIONS INVOLVED
• Statistical modeling work on a number of assumption. For instance a linear
regression assumes :
– Linear relation between independent and dependent variable
– Homoscedasticity
– Mean of error at zero for every dependent value
– Independence of observations
– Error should be normally distributed for each value of dependent variable
• Similarly Logistic regressions comes with its own set of assumptions. Even a non
linear model has to comply to a continuous segregation boundary. Machine
Learning algorithms do assume a few of these things but in general are spared
from most of these assumptions. The biggest advantage of using a Machine
Learning algorithm is that there might not be any continuity of boundary as
shown in the case study above. Also, we need not specify the distribution of
dependent or independent variable in a machine learning algorithm.
TYPES OF DATA THEY DEAL WITH
Machine Learning algorithms are wide range tools. Online Learning tools predict data on
the fly. These tools are capable of learning from trillions of observations one by one.
They make prediction and learn simultaneously. Other algorithms like Random Forest
and Gradient Boosting are also exceptionally fast with big data. Machine learning does
really well with wide (high number of attributes) and deep (high number of
observations). However statistical modeling are generally applied for smaller data with
less attributes or they end up over fitting.
Formulation
Even when the end goal for both machine learning and statistical modeling is same, the formulation of two
are significantly different.
In a statistical model, we basically try to estimate the function “f” in
Dependent Variable ( Y ) = f(Independent Variable) + error function
Machine Learning takes away the deterministic function “f” out of the equation. It simply becomes
Output(Y) ----- > Input (X)
It will try to find pockets of X in n dimensions (where n is the number of attributes), where occurrence of Y
is significantly different.
PREDICTIVE POWER AND HUMAN
EFFORT
• Nature does not assume anything before forcing an event to occur.
• So the lesser assumptions in a predictive model, higher will be the
predictive power. Machine Learning as the name suggest needs minimal
human effort. Machine learning works on iterations where computer
tries to find out patterns hidden in data. Because machine does this
work on comprehensive data and is independent of all the assumption,
predictive power is generally very strong for these models. Statistical
model are mathematics intensive and based on coefficient estimation. It
requires the modeler to understand the relation between variable before
putting it in.
END NOTES
It may seem that machine learning and statistical modeling are two
different branches of predictive modeling, they are almost the same. The
difference between these two have gone down significantly over past
decade. Both the branches have learned from each other a lot and will
further come closer in future. I hope we motivated you enough to acquire
skills in each of these two domains and then compare how do they
compliment each other.

More Related Content

What's hot

Machine Learning in Healthcare Diagnostics
Machine Learning in Healthcare DiagnosticsMachine Learning in Healthcare Diagnostics
Machine Learning in Healthcare DiagnosticsLarry Smarr
 
Classification and Regression
Classification and RegressionClassification and Regression
Classification and RegressionMegha Sharma
 
"An Introduction to Machine Learning and How to Teach Machines to See," a Pre...
"An Introduction to Machine Learning and How to Teach Machines to See," a Pre..."An Introduction to Machine Learning and How to Teach Machines to See," a Pre...
"An Introduction to Machine Learning and How to Teach Machines to See," a Pre...Edge AI and Vision Alliance
 
IRJET- Road Accident Prediction using Machine Learning Algorithm
IRJET- Road Accident Prediction using Machine Learning AlgorithmIRJET- Road Accident Prediction using Machine Learning Algorithm
IRJET- Road Accident Prediction using Machine Learning AlgorithmIRJET Journal
 
Linear regression
Linear regressionLinear regression
Linear regressionMartinHogg9
 
Machine learning ppt.
Machine learning ppt.Machine learning ppt.
Machine learning ppt.ASHOK KUMAR
 
Ways to evaluate a machine learning model’s performance
Ways to evaluate a machine learning model’s performanceWays to evaluate a machine learning model’s performance
Ways to evaluate a machine learning model’s performanceMala Deep Upadhaya
 
Classification Based Machine Learning Algorithms
Classification Based Machine Learning AlgorithmsClassification Based Machine Learning Algorithms
Classification Based Machine Learning AlgorithmsMd. Main Uddin Rony
 
Machine learning session4(linear regression)
Machine learning   session4(linear regression)Machine learning   session4(linear regression)
Machine learning session4(linear regression)Abhimanyu Dwivedi
 
Supervised and Unsupervised Machine Learning
Supervised and Unsupervised Machine LearningSupervised and Unsupervised Machine Learning
Supervised and Unsupervised Machine LearningSpotle.ai
 
Machine Learning With Logistic Regression
Machine Learning  With Logistic RegressionMachine Learning  With Logistic Regression
Machine Learning With Logistic RegressionKnoldus Inc.
 
Logistic regression
Logistic regressionLogistic regression
Logistic regressionVARUN KUMAR
 
Machine Learning and Real-World Applications
Machine Learning and Real-World ApplicationsMachine Learning and Real-World Applications
Machine Learning and Real-World ApplicationsMachinePulse
 
Supervised Machine Learning Techniques
Supervised Machine Learning TechniquesSupervised Machine Learning Techniques
Supervised Machine Learning TechniquesTara ram Goyal
 
Linear Regression vs Logistic Regression | Edureka
Linear Regression vs Logistic Regression | EdurekaLinear Regression vs Logistic Regression | Edureka
Linear Regression vs Logistic Regression | EdurekaEdureka!
 
Machine Learning presentation.
Machine Learning presentation.Machine Learning presentation.
Machine Learning presentation.butest
 
Machine Learning
Machine LearningMachine Learning
Machine LearningRahul Kumar
 

What's hot (20)

Logistic regression
Logistic regressionLogistic regression
Logistic regression
 
Machine Learning in Healthcare Diagnostics
Machine Learning in Healthcare DiagnosticsMachine Learning in Healthcare Diagnostics
Machine Learning in Healthcare Diagnostics
 
Classification and Regression
Classification and RegressionClassification and Regression
Classification and Regression
 
"An Introduction to Machine Learning and How to Teach Machines to See," a Pre...
"An Introduction to Machine Learning and How to Teach Machines to See," a Pre..."An Introduction to Machine Learning and How to Teach Machines to See," a Pre...
"An Introduction to Machine Learning and How to Teach Machines to See," a Pre...
 
IRJET- Road Accident Prediction using Machine Learning Algorithm
IRJET- Road Accident Prediction using Machine Learning AlgorithmIRJET- Road Accident Prediction using Machine Learning Algorithm
IRJET- Road Accident Prediction using Machine Learning Algorithm
 
Linear regression
Linear regressionLinear regression
Linear regression
 
Machine learning ppt.
Machine learning ppt.Machine learning ppt.
Machine learning ppt.
 
Ways to evaluate a machine learning model’s performance
Ways to evaluate a machine learning model’s performanceWays to evaluate a machine learning model’s performance
Ways to evaluate a machine learning model’s performance
 
supervised learning
supervised learningsupervised learning
supervised learning
 
Classification Based Machine Learning Algorithms
Classification Based Machine Learning AlgorithmsClassification Based Machine Learning Algorithms
Classification Based Machine Learning Algorithms
 
Machine learning session4(linear regression)
Machine learning   session4(linear regression)Machine learning   session4(linear regression)
Machine learning session4(linear regression)
 
Supervised and Unsupervised Machine Learning
Supervised and Unsupervised Machine LearningSupervised and Unsupervised Machine Learning
Supervised and Unsupervised Machine Learning
 
Machine Learning With Logistic Regression
Machine Learning  With Logistic RegressionMachine Learning  With Logistic Regression
Machine Learning With Logistic Regression
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
 
Machine Learning and Real-World Applications
Machine Learning and Real-World ApplicationsMachine Learning and Real-World Applications
Machine Learning and Real-World Applications
 
Supervised Machine Learning Techniques
Supervised Machine Learning TechniquesSupervised Machine Learning Techniques
Supervised Machine Learning Techniques
 
SEM
SEMSEM
SEM
 
Linear Regression vs Logistic Regression | Edureka
Linear Regression vs Logistic Regression | EdurekaLinear Regression vs Logistic Regression | Edureka
Linear Regression vs Logistic Regression | Edureka
 
Machine Learning presentation.
Machine Learning presentation.Machine Learning presentation.
Machine Learning presentation.
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 

Similar to Statistical learning vs. Machine Learning

A tour of the top 10 algorithms for machine learning newbies
A tour of the top 10 algorithms for machine learning newbiesA tour of the top 10 algorithms for machine learning newbies
A tour of the top 10 algorithms for machine learning newbiesVimal Gupta
 
Data analytics with python introductory
Data analytics with python introductoryData analytics with python introductory
Data analytics with python introductoryAbhimanyu Dwivedi
 
Machine Learning basics
Machine Learning basicsMachine Learning basics
Machine Learning basicsNeeleEilers
 
MACHINE LEARNING AND ITS APPLICATIONS (2).pptx
MACHINE LEARNING AND ITS APPLICATIONS (2).pptxMACHINE LEARNING AND ITS APPLICATIONS (2).pptx
MACHINE LEARNING AND ITS APPLICATIONS (2).pptxssuser442651
 
The Incredible Disappearing Data Scientist
The Incredible Disappearing Data ScientistThe Incredible Disappearing Data Scientist
The Incredible Disappearing Data ScientistRebecca Bilbro
 
A Comparison of Traditional Simulation and MSAL (6-3-2015)
A Comparison of Traditional Simulation and MSAL (6-3-2015)A Comparison of Traditional Simulation and MSAL (6-3-2015)
A Comparison of Traditional Simulation and MSAL (6-3-2015)Bob Garrett
 
Materi 10 - Penelitian Pemodelan Komputer.pdf
Materi 10 - Penelitian Pemodelan Komputer.pdfMateri 10 - Penelitian Pemodelan Komputer.pdf
Materi 10 - Penelitian Pemodelan Komputer.pdfMahesaRioAditya
 
The Price is Wrong - Quantative Finance
The Price is Wrong - Quantative Finance The Price is Wrong - Quantative Finance
The Price is Wrong - Quantative Finance TerminusDB
 
Mathematical models and algorithms challenges
Mathematical models and algorithms challengesMathematical models and algorithms challenges
Mathematical models and algorithms challengesijctcm
 
Machine learning - session 1
Machine learning - session 1Machine learning - session 1
Machine learning - session 1Luis Borbon
 
Machine learning
Machine learningMachine learning
Machine learningeonx_32
 
notes as .ppt
notes as .pptnotes as .ppt
notes as .pptbutest
 
Statistical Modeling: The Two Cultures
Statistical Modeling: The Two CulturesStatistical Modeling: The Two Cultures
Statistical Modeling: The Two CulturesChristoph Molnar
 
Machine learning interview questions and answers
Machine learning interview questions and answersMachine learning interview questions and answers
Machine learning interview questions and answerskavinilavuG
 

Similar to Statistical learning vs. Machine Learning (20)

A tour of the top 10 algorithms for machine learning newbies
A tour of the top 10 algorithms for machine learning newbiesA tour of the top 10 algorithms for machine learning newbies
A tour of the top 10 algorithms for machine learning newbies
 
Top Machine Learning Algorithms Used By AI Professionals ARTiBA.pdf
Top Machine Learning Algorithms Used By AI Professionals ARTiBA.pdfTop Machine Learning Algorithms Used By AI Professionals ARTiBA.pdf
Top Machine Learning Algorithms Used By AI Professionals ARTiBA.pdf
 
Qt unit i
Qt unit   iQt unit   i
Qt unit i
 
Data analytics with python introductory
Data analytics with python introductoryData analytics with python introductory
Data analytics with python introductory
 
Machine Learning basics
Machine Learning basicsMachine Learning basics
Machine Learning basics
 
Machine Learning.pptx
Machine Learning.pptxMachine Learning.pptx
Machine Learning.pptx
 
MACHINE LEARNING AND ITS APPLICATIONS (2).pptx
MACHINE LEARNING AND ITS APPLICATIONS (2).pptxMACHINE LEARNING AND ITS APPLICATIONS (2).pptx
MACHINE LEARNING AND ITS APPLICATIONS (2).pptx
 
The Incredible Disappearing Data Scientist
The Incredible Disappearing Data ScientistThe Incredible Disappearing Data Scientist
The Incredible Disappearing Data Scientist
 
A Comparison of Traditional Simulation and MSAL (6-3-2015)
A Comparison of Traditional Simulation and MSAL (6-3-2015)A Comparison of Traditional Simulation and MSAL (6-3-2015)
A Comparison of Traditional Simulation and MSAL (6-3-2015)
 
Materi 10 - Penelitian Pemodelan Komputer.pdf
Materi 10 - Penelitian Pemodelan Komputer.pdfMateri 10 - Penelitian Pemodelan Komputer.pdf
Materi 10 - Penelitian Pemodelan Komputer.pdf
 
The Price is Wrong - Quantative Finance
The Price is Wrong - Quantative Finance The Price is Wrong - Quantative Finance
The Price is Wrong - Quantative Finance
 
Mathematical models and algorithms challenges
Mathematical models and algorithms challengesMathematical models and algorithms challenges
Mathematical models and algorithms challenges
 
Machine learning - session 1
Machine learning - session 1Machine learning - session 1
Machine learning - session 1
 
Machine learning
Machine learningMachine learning
Machine learning
 
ML.pdf
ML.pdfML.pdf
ML.pdf
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
notes as .ppt
notes as .pptnotes as .ppt
notes as .ppt
 
Eckovation Machine Learning
Eckovation Machine LearningEckovation Machine Learning
Eckovation Machine Learning
 
Statistical Modeling: The Two Cultures
Statistical Modeling: The Two CulturesStatistical Modeling: The Two Cultures
Statistical Modeling: The Two Cultures
 
Machine learning interview questions and answers
Machine learning interview questions and answersMachine learning interview questions and answers
Machine learning interview questions and answers
 

Recently uploaded

VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一ffjhghh
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 

Recently uploaded (20)

VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 

Statistical learning vs. Machine Learning

  • 1. S TAT I S T I C A L L E A R N I N G V S . M A C H I N E L E A R N I N G
  • 2. Venn diagram on the coverage of machine learning and statistical modeling in the universe of data science
  • 3. DEFINITION • Machine Learning is … an algorithm that can learn from data without relying on rules- based programming. • Statistical Modelling is … formalization of relationships between variables in the form of mathematical equations.
  • 4. A BUSINESS CASE Let us now see an interesting example published by McKinsey differentiating the two algorithms : – Case : Understand the risk level of customers churn over a period of time for a Telecom company – Data Available : Two Drivers – A & B – Next is a graph to understand the difference between a statistical model and a Machine Learning algorithm.
  • 5.
  • 6. OBSERVATION • Statistical model is all about getting a simple formulation of a frontier in a classification model problem. Here we see a non linear boundary which to some extent separates risky people from non-risky people. But when we see the contours generated by Machine Learning algorithm, we witness that statistical modeling is no way comparable for the problem in hand to the Machine Learning algorithm. The contours of machine learning seems to capture all patterns beyond any boundaries of linearity or even continuity of the boundaries. This is what Machine Learning can do for you.
  • 7. DIFFERENCES BETWEEN MACHINE LEARNING AND STATISTICAL MODELING • Given the flavor of difference in output of these two approaches, let us understand the difference in the two paradigms, even though both do almost similar job : – Schools they come from – When did they come into existence? – Assumptions they work on – Type of data they deal with – Nomenclatures of operations and objects – Techniques used – Predictive power and human efforts involved to implement • All the differences mentioned above do separate the two to some extent, but there is no hard boundary between Machine Learning and statistical modeling.
  • 8. THEY BELONG TO DIFFERENT SCHOOLS • Machine Learning is … – a subfield of computer science and artificial intelligence which deals with building systems that can learn from data, instead of explicitly programmed instructions. • Statistical Modelling is … – a subfield of mathematics which deals with finding relationship between variables to predict an outcome
  • 9. THEY CAME UP IN DIFFERENT ERAS Statistical modeling has been there for centuries now. However, Machine learning is a very recent development. It came into existence in the 1990s as steady advances in digitization and cheap computing power enabled data scientists to stop building finished models and instead train computers to do so. The unmanageable volume and complexity of the big data that the world is now swimming in have increased the potential of machine learning—and the need for it.
  • 10. EXTENT OF ASSUMPTIONS INVOLVED • Statistical modeling work on a number of assumption. For instance a linear regression assumes : – Linear relation between independent and dependent variable – Homoscedasticity – Mean of error at zero for every dependent value – Independence of observations – Error should be normally distributed for each value of dependent variable • Similarly Logistic regressions comes with its own set of assumptions. Even a non linear model has to comply to a continuous segregation boundary. Machine Learning algorithms do assume a few of these things but in general are spared from most of these assumptions. The biggest advantage of using a Machine Learning algorithm is that there might not be any continuity of boundary as shown in the case study above. Also, we need not specify the distribution of dependent or independent variable in a machine learning algorithm.
  • 11. TYPES OF DATA THEY DEAL WITH Machine Learning algorithms are wide range tools. Online Learning tools predict data on the fly. These tools are capable of learning from trillions of observations one by one. They make prediction and learn simultaneously. Other algorithms like Random Forest and Gradient Boosting are also exceptionally fast with big data. Machine learning does really well with wide (high number of attributes) and deep (high number of observations). However statistical modeling are generally applied for smaller data with less attributes or they end up over fitting.
  • 12. Formulation Even when the end goal for both machine learning and statistical modeling is same, the formulation of two are significantly different. In a statistical model, we basically try to estimate the function “f” in Dependent Variable ( Y ) = f(Independent Variable) + error function Machine Learning takes away the deterministic function “f” out of the equation. It simply becomes Output(Y) ----- > Input (X) It will try to find pockets of X in n dimensions (where n is the number of attributes), where occurrence of Y is significantly different.
  • 13. PREDICTIVE POWER AND HUMAN EFFORT • Nature does not assume anything before forcing an event to occur. • So the lesser assumptions in a predictive model, higher will be the predictive power. Machine Learning as the name suggest needs minimal human effort. Machine learning works on iterations where computer tries to find out patterns hidden in data. Because machine does this work on comprehensive data and is independent of all the assumption, predictive power is generally very strong for these models. Statistical model are mathematics intensive and based on coefficient estimation. It requires the modeler to understand the relation between variable before putting it in.
  • 14. END NOTES It may seem that machine learning and statistical modeling are two different branches of predictive modeling, they are almost the same. The difference between these two have gone down significantly over past decade. Both the branches have learned from each other a lot and will further come closer in future. I hope we motivated you enough to acquire skills in each of these two domains and then compare how do they compliment each other.