SlideShare a Scribd company logo
Master the Art of Analytics
A Simplistic Explainer Series For Citizen Data Scientists
J o u r n e y To w a r d s A u g m e n t e d A n a l y t i c s
Multinomial Logistic
Regression
Terminologies
Introduction & Example
Standard input/tuning parameters & Sample UI
Sample output UI
Interpretation of Output
Limitations
Business use cases
What Are
All Covered
Terminologies
Terminologies
 Target variable usually denoted by Y , is the variable being predicted and is also
called dependent variable, output variable, response variable or outcome
variable (Ex : One highlighted in red box in table below)
 Predictor, sometimes called an independent variable, is a variable that is being
used to predict the target variable ( Ex : variables highlighted in green box in
table below )
Age Marital Status Gender
Satisfaction
level
58 married Female High
44 single Female Low
33 married Male Medium
47 married Female High
33 single Female Medium
35 married Male High
28 single Male Low
Introduction & Example
Introduction
• OBJECTIVE :
• Logistic regression measures the relationship
between the categorical target variable and one or
more independent variables
• It deals with situations in which the outcome for a
target variable can have two or more possible types
• Thus , logistic regression makes use of one or more
predictor variables that may be either continuous or
categorical to predict the target variable classes
• BENEFIT:
• Logistic regression model output helps identify
important factors ( Xi ) impacting the target variable
(Y) and also the nature of relationship between each
of these factors and dependent variable
Example : Multinomial Logistic Regression :
Input Let’s conduct the Multinomial Logistic Regression analysis on following variables :
Job satisfaction
level
Age Marital Status Gender Income
Low 58 married Male 46,399
Medium 44 single Male 47,971
Low 33 married Female 52,618
High 47 married Male 28,717
Medium 33 single Female 41,216
Medium 35 married Female 34,372
Low 28 single Male 64,811
Medium 42 divorced Female 53,000
High 58 married Female 41,375
Low 43 single Male 53,778
Low 41 divorced Male 44,440
Medium 29 single Female 51,026
Independent variables (Xi)Target Variable (Y)
Example : Multinomial Logistic Regression :
Output 1 Coefficient P value
High
Age 1.54 0.05
Income -0.34 0.03
Male 0.67 0.02
Low
Age -2.34 0.05
Income 0.56 0.01
Male -1.23 0.04
Coefficients
• High satisfaction with reference to medium satisfaction:
Age - Multinomial logit (Natural log of the proportion of High to that of Medium here)
estimate for 1 year increase in age for high job satisfaction relative to medium job satisfaction
when other independent variables are held constant = 1.54
Male - Multinomial logit estimate for comparing male to females for high job
satisfaction relative to medium job satisfaction when other variables are held constant = 0.67
Interpretation
Example : Multinomial Logistic Regression :
Output 2
Classification Accuracy : (50+ 10 + 70) / (50+ 10 + 70+ 4+4+5+4+6+7) = 81%
• The prediction accuracy is useful criterion for assessing the model performance
• Model with prediction accuracy >= 70% is useful
Classification Error = 100- Accuracy = 19%
There is 19% chance of error in classification
Low Medium High
Low 50 4 4
Medium 4 70 5
High 6 7 10
Actual versus predicted
Predicted
Actual
Standard input/tuning
parameters & Sample
UI
STANDARD
INPUT
PARAMETERS
& SAMPLE UI
SAMPLE OUTPUT UI
Sample output 1 : Model Summary
Actual versus predicted
Predicted
Actual
Coefficient matrix :
Low Medium High
Low 50 4 4
Medium 4 70 5
High 6 7 10
Coefficient P value
High
Age 1.54 0.05
Income -0.34 0.03
Male 0.67 0.02
Low
Age -2.34 0.05
Income 0.56 0.01
Male -1.23 0.04
Age Marital Status Gender Income
Job satisfaction
level
Predicted
class
Probability
58 married Female 46,399 Low Low 0.7
44 single Female 47,971 High High 0.9
33 married Male 52,618 Low Low 0.8
47 married Female 28,717 Low High 0.7
33 single Male 41,216 High Low 0.6
35 married Male 34,372 High High 0.5
28 single Female 64,811 Low Low 0.4
42 divorced Male 53,000 Low Low 0.3
58 married Female 41,375 High Low 0.2
43 single Male 53,778 High High 0.1
Sample output 2 : Predicted class &
probability
Sample Output 3 : Classification Plot
• Lesser the overlap among three classes in the plot above , better the classification done
by model
• Thus, output will contain predicted class column, confusion matrix and classification plot
Interpretation of
Output
Interpretation of Important Model Summary
Statistics
Accuracy:
 If Accuracy >= 70% : Model is well fit on
provided data and predicted classes are
reasonably accurate
 If Accuracy < 70% : Model is not well fit on
provided data and predicted classes are
likely to contain high chances of error
Coefficients and p value :
 If value of coefficient is positive and p value
<0.05 , variable is positively correlated with target
variable
 If value of coefficient is negative and p value
<0.05 , variable is negatively correlated with
target variable
 If p value > 0.05, variable is unimportant in terms
of predicting target variable classes
Limitations
Limitations
 It is applicable only when target variable is categorical
 Sample size must be at least 1000 in order to get reliable predictions
 Level 1 of the target variable should represent the desired outcome.
 i.e. if desired class is yes in response/non response target variable
then Yes has to be recoded into 1 and No into 0
Business Use Cases
Use case 1
Business benefit:
• By having a knowledge of probable
election outcome, proper strategy
can be put in place in case of
discrepancies between
expectations and predictions and
the segments with high likelihood
of voting oppositions can be
targeted in better and effective
manner in order to get their votes
in favor of a client party
Business problem :
• A research agency wants to predict
the likelihood of each election
candidate being voted by each
voter and in turn devise a strategy
to take proactive steps
• Here the target variable would be
‘preferred party name’ and
predictors would be customer
demographics such as age, income,
qualification, occupation, gender,
religion and past voting status etc.
Use case 1 : Sample Input Dataset
Responder
ID
Qualification income Age Gender Occupation
Done voting in
past
Preferred party
1039153 Bcom 105000 18 M Accountant yes ABC
1069697 12th
Pass 192000 20 F
Office
Supervisor
No XYZ
1068120 BSC 310000 30 F Pathologist yes PQL
563175 10th
Pass 100000 45 M Labour yes XYZ
562842 ME 357228 25 M
Software
Developer
No PQL
562681 MSC 413000 28 F Statistician yes XYZ
562404 BSC Nill 34 F Home maker No PQL
Use case 1 : Output : Predicted Class
Output : Each record will have a predicted class along with probability assigned as
shown below :
Respond
er ID
Qualificatio
n
Income Age Gender Occupation
Done
voting in
past
Predicted
party
Probability
1039153 Bcom 105000 18 M Accountant yes ABC 0.7
1069697 12th
Pass 192000 20 F
Office
Supervisor
No XYZ 0.9
1068120 BSC 310000 30 F Pathologist yes PQL 0.8
563175 10th
Pass 100000 45 M Labour yes XYZ 0.7
562842 ME 357228 25 M
Software
Developer
No PQL 0.6
562681 MSC 413000 28 F Statistician yes XYZ 0.5
562404 BSC Nill 34 F
Home
maker
No PQL 0.4
Use case 1 : Output : Sample Class profile
Predicted
Party
Average
Annual
income
Average
Age
ABC 86,467 30
XYZ 60,935 25
PQL 1,05,400 35
• As can be seen in the table above, there is distinctive characteristics of population associated with each preferred
party :
• For instance, females are inclined towards XYZ whereas males tend to prefer ABC
• Responders with high income and age prefer to vote for PQL whereas XYZ party is preferred by lowest income and
age group
• Fresh voters are likely to vote for party XYZ whereas those who have done voting in past are inclined towards ABC
party
Gender
Predicted
Party
Male Female
ABC 60 4
XYZ 10 78
PQL 14 15
Past voting status
Predicted
Party
Yes No
ABC 58 6
XYZ 15 73
PQL 11 19
Use case 2
Business benefit:
•Given the body profile of a patient
and predicted level of disease , right
cure/medications can be suggested to
a patient
Business problem :
•A doctor/ pharmacist wants to predict
the likelihood of a new patient’s
disease being at
initial/intermediate/severe stage
based on various body attributes of a
patient such as blood pressure ,
hemoglobin level, sugar level , red
blood counts, TSH etc.
•Here the target variable would be level
of disease and would contain values
‘Initial, Intermediate and Severe’
Want to Learn
More?
Get in touch with us @
support@Smarten.com
And Do Checkout the Learning section
on
Smarten.com
June 2018

More Related Content

What's hot

Multinomial Logistic Regression
Multinomial Logistic RegressionMultinomial Logistic Regression
Multinomial Logistic Regression
Dr Athar Khan
 
Multinomial logisticregression basicrelationships
Multinomial logisticregression basicrelationshipsMultinomial logisticregression basicrelationships
Multinomial logisticregression basicrelationships
Anirudha si
 
Variable Selection Methods
Variable Selection MethodsVariable Selection Methods
Variable Selection Methodsjoycemi_la
 
Chi square Test Using SPSS
Chi square Test Using SPSSChi square Test Using SPSS
Chi square Test Using SPSS
Dr Athar Khan
 
ders 7.1 VAR.pptx
ders 7.1 VAR.pptxders 7.1 VAR.pptx
ders 7.1 VAR.pptx
Ergin Akalpler
 
Chi square tests using SPSS
Chi square tests using SPSSChi square tests using SPSS
Chi square tests using SPSS
Parag Shah
 
Presentation On Regression
Presentation On RegressionPresentation On Regression
Presentation On Regression
alok tiwari
 
Correlation & Regression Analysis using SPSS
Correlation & Regression Analysis  using SPSSCorrelation & Regression Analysis  using SPSS
Correlation & Regression Analysis using SPSS
Parag Shah
 
Statistics Case Study - Stepwise Multiple Regression
Statistics Case Study - Stepwise Multiple RegressionStatistics Case Study - Stepwise Multiple Regression
Statistics Case Study - Stepwise Multiple RegressionSharad Srivastava
 
03.data presentation(2015) 2
03.data presentation(2015) 203.data presentation(2015) 2
03.data presentation(2015) 2
Mmedsc Hahm
 
Levels of measurement
Levels of measurementLevels of measurement
Levels of measurement
punjab agricultural university
 
4.5. logistic regression
4.5. logistic regression4.5. logistic regression
4.5. logistic regression
A M
 
Statistical analysis using spss
Statistical analysis using spssStatistical analysis using spss
Statistical analysis using spssjpcagphil
 
Exploratory Data Analysis
Exploratory Data AnalysisExploratory Data Analysis
Exploratory Data Analysis
Umair Shafique
 
Data Science - Part IV - Regression Analysis & ANOVA
Data Science - Part IV - Regression Analysis & ANOVAData Science - Part IV - Regression Analysis & ANOVA
Data Science - Part IV - Regression Analysis & ANOVA
Derek Kane
 
Introduction to correlation and regression analysis
Introduction to correlation and regression analysisIntroduction to correlation and regression analysis
Introduction to correlation and regression analysis
Farzad Javidanrad
 
Quantitative analysis using SPSS
Quantitative analysis using SPSSQuantitative analysis using SPSS
Quantitative analysis using SPSSAlaa Sadik
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
zcreichenbach
 

What's hot (20)

Multinomial Logistic Regression
Multinomial Logistic RegressionMultinomial Logistic Regression
Multinomial Logistic Regression
 
Multinomial logisticregression basicrelationships
Multinomial logisticregression basicrelationshipsMultinomial logisticregression basicrelationships
Multinomial logisticregression basicrelationships
 
Variable Selection Methods
Variable Selection MethodsVariable Selection Methods
Variable Selection Methods
 
Chi square Test Using SPSS
Chi square Test Using SPSSChi square Test Using SPSS
Chi square Test Using SPSS
 
ders 7.1 VAR.pptx
ders 7.1 VAR.pptxders 7.1 VAR.pptx
ders 7.1 VAR.pptx
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
 
Chi square tests using SPSS
Chi square tests using SPSSChi square tests using SPSS
Chi square tests using SPSS
 
Presentation On Regression
Presentation On RegressionPresentation On Regression
Presentation On Regression
 
Correlation & Regression Analysis using SPSS
Correlation & Regression Analysis  using SPSSCorrelation & Regression Analysis  using SPSS
Correlation & Regression Analysis using SPSS
 
Statistics Case Study - Stepwise Multiple Regression
Statistics Case Study - Stepwise Multiple RegressionStatistics Case Study - Stepwise Multiple Regression
Statistics Case Study - Stepwise Multiple Regression
 
03.data presentation(2015) 2
03.data presentation(2015) 203.data presentation(2015) 2
03.data presentation(2015) 2
 
Levels of measurement
Levels of measurementLevels of measurement
Levels of measurement
 
4.5. logistic regression
4.5. logistic regression4.5. logistic regression
4.5. logistic regression
 
Statistical analysis using spss
Statistical analysis using spssStatistical analysis using spss
Statistical analysis using spss
 
Correlation analysis
Correlation analysis Correlation analysis
Correlation analysis
 
Exploratory Data Analysis
Exploratory Data AnalysisExploratory Data Analysis
Exploratory Data Analysis
 
Data Science - Part IV - Regression Analysis & ANOVA
Data Science - Part IV - Regression Analysis & ANOVAData Science - Part IV - Regression Analysis & ANOVA
Data Science - Part IV - Regression Analysis & ANOVA
 
Introduction to correlation and regression analysis
Introduction to correlation and regression analysisIntroduction to correlation and regression analysis
Introduction to correlation and regression analysis
 
Quantitative analysis using SPSS
Quantitative analysis using SPSSQuantitative analysis using SPSS
Quantitative analysis using SPSS
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
 

Similar to What is the Multinomial-Logistic Regression Classification Algorithm and How Does One Use it for Analysis?

What is Binary Logistic Regression Classification and How is it Used in Analy...
What is Binary Logistic Regression Classification and How is it Used in Analy...What is Binary Logistic Regression Classification and How is it Used in Analy...
What is Binary Logistic Regression Classification and How is it Used in Analy...
Smarten Augmented Analytics
 
What is SVM Classification Analysis and How Can It Benefit Business Analytics?
What is SVM Classification Analysis and How Can It Benefit Business Analytics?What is SVM Classification Analysis and How Can It Benefit Business Analytics?
What is SVM Classification Analysis and How Can It Benefit Business Analytics?
Smarten Augmented Analytics
 
Hypothesis testng
Hypothesis testngHypothesis testng
Hypothesis testng
Omar (TUBBS 128) Ventura VII
 
Final Exam Due Friday, Week EightInstructions  Each response is.docx
Final Exam Due Friday, Week EightInstructions  Each response is.docxFinal Exam Due Friday, Week EightInstructions  Each response is.docx
Final Exam Due Friday, Week EightInstructions  Each response is.docx
mydrynan
 
wk_1ScoreWeek 1.Measurement and Description - chapters 1 and 21 .docx
wk_1ScoreWeek 1.Measurement and Description - chapters 1 and 21 .docxwk_1ScoreWeek 1.Measurement and Description - chapters 1 and 21 .docx
wk_1ScoreWeek 1.Measurement and Description - chapters 1 and 21 .docx
ericbrooks84875
 
What is the Chi Square Test of Association and How Can it be Used for Analysis?
What is the Chi Square Test of Association and How Can it be Used for Analysis?What is the Chi Square Test of Association and How Can it be Used for Analysis?
What is the Chi Square Test of Association and How Can it be Used for Analysis?
Smarten Augmented Analytics
 
Tanmay Gupta _ 202109121 _ SIP PPT (1).pptx
Tanmay Gupta _ 202109121 _ SIP PPT (1).pptxTanmay Gupta _ 202109121 _ SIP PPT (1).pptx
Tanmay Gupta _ 202109121 _ SIP PPT (1).pptx
tanmaygupta80898
 
Statistics Mathematics B.sc -week-2.pptx
Statistics  Mathematics B.sc -week-2.pptxStatistics  Mathematics B.sc -week-2.pptx
Statistics Mathematics B.sc -week-2.pptx
ZiaOul
 
Statistics Mathematics B.sc -week-2.pptx
Statistics  Mathematics B.sc -week-2.pptxStatistics  Mathematics B.sc -week-2.pptx
Statistics Mathematics B.sc -week-2.pptx
ZiaOul
 
What is the Independent Samples T Test Method of Analysis and How Can it Bene...
What is the Independent Samples T Test Method of Analysis and How Can it Bene...What is the Independent Samples T Test Method of Analysis and How Can it Bene...
What is the Independent Samples T Test Method of Analysis and How Can it Bene...
Smarten Augmented Analytics
 
Six Sigma Black Belt Training Material Notes 5
Six Sigma Black Belt Training Material Notes 5Six Sigma Black Belt Training Material Notes 5
Six Sigma Black Belt Training Material Notes 5
Skillogic Solutions
 
Final report mkt
Final report mktFinal report mkt
Final report mkt
Charan Singh
 
Part 1.1. Education-selection Application formEducation que.docx
Part 1.1. Education-selection Application formEducation que.docxPart 1.1. Education-selection Application formEducation que.docx
Part 1.1. Education-selection Application formEducation que.docx
danhaley45372
 
Organizational culture project presentation using SPSS analysis
Organizational culture project presentation using SPSS analysisOrganizational culture project presentation using SPSS analysis
Organizational culture project presentation using SPSS analysis
Jim George Kurian
 
Inferential Statistics
Inferential StatisticsInferential Statistics
Inferential StatisticsKate Organ
 
Some study materials
Some study materialsSome study materials
Some study materials
SatishH5
 
A Study of Factors Influencing Selection of Management Colleges in India - A ...
A Study of Factors Influencing Selection of Management Colleges in India - A ...A Study of Factors Influencing Selection of Management Colleges in India - A ...
A Study of Factors Influencing Selection of Management Colleges in India - A ...Ravi Shankar
 
Research Methods in Marketing
Research Methods in MarketingResearch Methods in Marketing
Research Methods in Marketing
Vartika Kundu
 
Question 1 of 201.0 PointsA sample of 20 observations has a st.docx
Question 1 of 201.0 PointsA sample of 20 observations has a st.docxQuestion 1 of 201.0 PointsA sample of 20 observations has a st.docx
Question 1 of 201.0 PointsA sample of 20 observations has a st.docx
hildredzr1di
 
The analysis of the data has been done using excel statistical sof.docx
The analysis of the data has been done using excel statistical sof.docxThe analysis of the data has been done using excel statistical sof.docx
The analysis of the data has been done using excel statistical sof.docx
mattinsonjanel
 

Similar to What is the Multinomial-Logistic Regression Classification Algorithm and How Does One Use it for Analysis? (20)

What is Binary Logistic Regression Classification and How is it Used in Analy...
What is Binary Logistic Regression Classification and How is it Used in Analy...What is Binary Logistic Regression Classification and How is it Used in Analy...
What is Binary Logistic Regression Classification and How is it Used in Analy...
 
What is SVM Classification Analysis and How Can It Benefit Business Analytics?
What is SVM Classification Analysis and How Can It Benefit Business Analytics?What is SVM Classification Analysis and How Can It Benefit Business Analytics?
What is SVM Classification Analysis and How Can It Benefit Business Analytics?
 
Hypothesis testng
Hypothesis testngHypothesis testng
Hypothesis testng
 
Final Exam Due Friday, Week EightInstructions  Each response is.docx
Final Exam Due Friday, Week EightInstructions  Each response is.docxFinal Exam Due Friday, Week EightInstructions  Each response is.docx
Final Exam Due Friday, Week EightInstructions  Each response is.docx
 
wk_1ScoreWeek 1.Measurement and Description - chapters 1 and 21 .docx
wk_1ScoreWeek 1.Measurement and Description - chapters 1 and 21 .docxwk_1ScoreWeek 1.Measurement and Description - chapters 1 and 21 .docx
wk_1ScoreWeek 1.Measurement and Description - chapters 1 and 21 .docx
 
What is the Chi Square Test of Association and How Can it be Used for Analysis?
What is the Chi Square Test of Association and How Can it be Used for Analysis?What is the Chi Square Test of Association and How Can it be Used for Analysis?
What is the Chi Square Test of Association and How Can it be Used for Analysis?
 
Tanmay Gupta _ 202109121 _ SIP PPT (1).pptx
Tanmay Gupta _ 202109121 _ SIP PPT (1).pptxTanmay Gupta _ 202109121 _ SIP PPT (1).pptx
Tanmay Gupta _ 202109121 _ SIP PPT (1).pptx
 
Statistics Mathematics B.sc -week-2.pptx
Statistics  Mathematics B.sc -week-2.pptxStatistics  Mathematics B.sc -week-2.pptx
Statistics Mathematics B.sc -week-2.pptx
 
Statistics Mathematics B.sc -week-2.pptx
Statistics  Mathematics B.sc -week-2.pptxStatistics  Mathematics B.sc -week-2.pptx
Statistics Mathematics B.sc -week-2.pptx
 
What is the Independent Samples T Test Method of Analysis and How Can it Bene...
What is the Independent Samples T Test Method of Analysis and How Can it Bene...What is the Independent Samples T Test Method of Analysis and How Can it Bene...
What is the Independent Samples T Test Method of Analysis and How Can it Bene...
 
Six Sigma Black Belt Training Material Notes 5
Six Sigma Black Belt Training Material Notes 5Six Sigma Black Belt Training Material Notes 5
Six Sigma Black Belt Training Material Notes 5
 
Final report mkt
Final report mktFinal report mkt
Final report mkt
 
Part 1.1. Education-selection Application formEducation que.docx
Part 1.1. Education-selection Application formEducation que.docxPart 1.1. Education-selection Application formEducation que.docx
Part 1.1. Education-selection Application formEducation que.docx
 
Organizational culture project presentation using SPSS analysis
Organizational culture project presentation using SPSS analysisOrganizational culture project presentation using SPSS analysis
Organizational culture project presentation using SPSS analysis
 
Inferential Statistics
Inferential StatisticsInferential Statistics
Inferential Statistics
 
Some study materials
Some study materialsSome study materials
Some study materials
 
A Study of Factors Influencing Selection of Management Colleges in India - A ...
A Study of Factors Influencing Selection of Management Colleges in India - A ...A Study of Factors Influencing Selection of Management Colleges in India - A ...
A Study of Factors Influencing Selection of Management Colleges in India - A ...
 
Research Methods in Marketing
Research Methods in MarketingResearch Methods in Marketing
Research Methods in Marketing
 
Question 1 of 201.0 PointsA sample of 20 observations has a st.docx
Question 1 of 201.0 PointsA sample of 20 observations has a st.docxQuestion 1 of 201.0 PointsA sample of 20 observations has a st.docx
Question 1 of 201.0 PointsA sample of 20 observations has a st.docx
 
The analysis of the data has been done using excel statistical sof.docx
The analysis of the data has been done using excel statistical sof.docxThe analysis of the data has been done using excel statistical sof.docx
The analysis of the data has been done using excel statistical sof.docx
 

More from Smarten Augmented Analytics

Crime Type Prediction - Augmented Analytics Use Case – Smarten
Crime Type Prediction - Augmented Analytics Use Case – SmartenCrime Type Prediction - Augmented Analytics Use Case – Smarten
Crime Type Prediction - Augmented Analytics Use Case – Smarten
Smarten Augmented Analytics
 
What Is Multilayer Perceptron Classifier And How Is It Used For Enterprise An...
What Is Multilayer Perceptron Classifier And How Is It Used For Enterprise An...What Is Multilayer Perceptron Classifier And How Is It Used For Enterprise An...
What Is Multilayer Perceptron Classifier And How Is It Used For Enterprise An...
Smarten Augmented Analytics
 
What Is Generalized Linear Regression with Gaussian Distribution And How Can ...
What Is Generalized Linear Regression with Gaussian Distribution And How Can ...What Is Generalized Linear Regression with Gaussian Distribution And How Can ...
What Is Generalized Linear Regression with Gaussian Distribution And How Can ...
Smarten Augmented Analytics
 
What Is Random Forest Classification And How Can It Help Your Business?
What Is Random Forest Classification And How Can It Help Your Business?What Is Random Forest Classification And How Can It Help Your Business?
What Is Random Forest Classification And How Can It Help Your Business?
Smarten Augmented Analytics
 
What is Isotonic Regression and How Can a Business Utilize it to Analyze Data?
What is Isotonic Regression and How Can a Business Utilize it to Analyze Data?What is Isotonic Regression and How Can a Business Utilize it to Analyze Data?
What is Isotonic Regression and How Can a Business Utilize it to Analyze Data?
Smarten Augmented Analytics
 
Students' Academic Performance Predictive Analytics Use Case – Smarten
Students' Academic Performance Predictive Analytics Use Case – SmartenStudents' Academic Performance Predictive Analytics Use Case – Smarten
Students' Academic Performance Predictive Analytics Use Case – Smarten
Smarten Augmented Analytics
 
Random Forest Regression Analysis Reveals Impact of Variables on Target Values
Random Forest Regression Analysis Reveals Impact of Variables on Target Values  Random Forest Regression Analysis Reveals Impact of Variables on Target Values
Random Forest Regression Analysis Reveals Impact of Variables on Target Values
Smarten Augmented Analytics
 
Gradient Boosting Regression Analysis Reveals Dependent Variables and Interre...
Gradient Boosting Regression Analysis Reveals Dependent Variables and Interre...Gradient Boosting Regression Analysis Reveals Dependent Variables and Interre...
Gradient Boosting Regression Analysis Reveals Dependent Variables and Interre...
Smarten Augmented Analytics
 
What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...
Smarten Augmented Analytics
 
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
Smarten Augmented Analytics
 
Fraud Mitigation Predictive Analytics Use Case – Smarten
Fraud Mitigation Predictive Analytics Use Case – SmartenFraud Mitigation Predictive Analytics Use Case – Smarten
Fraud Mitigation Predictive Analytics Use Case – Smarten
Smarten Augmented Analytics
 
Quality Control Predictive Analytics Use Case - Smarten
Quality Control Predictive Analytics Use Case - SmartenQuality Control Predictive Analytics Use Case - Smarten
Quality Control Predictive Analytics Use Case - Smarten
Smarten Augmented Analytics
 
Machine Maintenance Management Predictive Analytics Use Case - Smarten
Machine Maintenance Management Predictive Analytics Use Case - SmartenMachine Maintenance Management Predictive Analytics Use Case - Smarten
Machine Maintenance Management Predictive Analytics Use Case - Smarten
Smarten Augmented Analytics
 
Predictive Analytics Using External Data Augmented Analytics Use Case - Smarten
Predictive Analytics Using External Data Augmented Analytics Use Case - SmartenPredictive Analytics Using External Data Augmented Analytics Use Case - Smarten
Predictive Analytics Using External Data Augmented Analytics Use Case - Smarten
Smarten Augmented Analytics
 
Marketing Optimization Augmented Analytics Use Cases - Smarten
Marketing Optimization Augmented Analytics Use Cases - SmartenMarketing Optimization Augmented Analytics Use Cases - Smarten
Marketing Optimization Augmented Analytics Use Cases - Smarten
Smarten Augmented Analytics
 
Human Resource Attrition Augmented Analytics Use Case - Smarten
Human Resource Attrition Augmented Analytics Use Case - SmartenHuman Resource Attrition Augmented Analytics Use Case - Smarten
Human Resource Attrition Augmented Analytics Use Case - Smarten
Smarten Augmented Analytics
 
Customer Targeting Augmented Analytics Use Case - Smarten
Customer Targeting Augmented Analytics Use Case - SmartenCustomer Targeting Augmented Analytics Use Case - Smarten
Customer Targeting Augmented Analytics Use Case - Smarten
Smarten Augmented Analytics
 
What is Naïve Bayes Classification and How is it Used for Enterprise Analysis?
What is Naïve Bayes Classification and How is it Used for Enterprise Analysis?What is Naïve Bayes Classification and How is it Used for Enterprise Analysis?
What is Naïve Bayes Classification and How is it Used for Enterprise Analysis?
Smarten Augmented Analytics
 
What is KNN Classification and How Can This Analysis Help an Enterprise?
What is KNN Classification and How Can This Analysis Help an Enterprise?What is KNN Classification and How Can This Analysis Help an Enterprise?
What is KNN Classification and How Can This Analysis Help an Enterprise?
Smarten Augmented Analytics
 
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
Smarten Augmented Analytics
 

More from Smarten Augmented Analytics (20)

Crime Type Prediction - Augmented Analytics Use Case – Smarten
Crime Type Prediction - Augmented Analytics Use Case – SmartenCrime Type Prediction - Augmented Analytics Use Case – Smarten
Crime Type Prediction - Augmented Analytics Use Case – Smarten
 
What Is Multilayer Perceptron Classifier And How Is It Used For Enterprise An...
What Is Multilayer Perceptron Classifier And How Is It Used For Enterprise An...What Is Multilayer Perceptron Classifier And How Is It Used For Enterprise An...
What Is Multilayer Perceptron Classifier And How Is It Used For Enterprise An...
 
What Is Generalized Linear Regression with Gaussian Distribution And How Can ...
What Is Generalized Linear Regression with Gaussian Distribution And How Can ...What Is Generalized Linear Regression with Gaussian Distribution And How Can ...
What Is Generalized Linear Regression with Gaussian Distribution And How Can ...
 
What Is Random Forest Classification And How Can It Help Your Business?
What Is Random Forest Classification And How Can It Help Your Business?What Is Random Forest Classification And How Can It Help Your Business?
What Is Random Forest Classification And How Can It Help Your Business?
 
What is Isotonic Regression and How Can a Business Utilize it to Analyze Data?
What is Isotonic Regression and How Can a Business Utilize it to Analyze Data?What is Isotonic Regression and How Can a Business Utilize it to Analyze Data?
What is Isotonic Regression and How Can a Business Utilize it to Analyze Data?
 
Students' Academic Performance Predictive Analytics Use Case – Smarten
Students' Academic Performance Predictive Analytics Use Case – SmartenStudents' Academic Performance Predictive Analytics Use Case – Smarten
Students' Academic Performance Predictive Analytics Use Case – Smarten
 
Random Forest Regression Analysis Reveals Impact of Variables on Target Values
Random Forest Regression Analysis Reveals Impact of Variables on Target Values  Random Forest Regression Analysis Reveals Impact of Variables on Target Values
Random Forest Regression Analysis Reveals Impact of Variables on Target Values
 
Gradient Boosting Regression Analysis Reveals Dependent Variables and Interre...
Gradient Boosting Regression Analysis Reveals Dependent Variables and Interre...Gradient Boosting Regression Analysis Reveals Dependent Variables and Interre...
Gradient Boosting Regression Analysis Reveals Dependent Variables and Interre...
 
What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...
 
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
 
Fraud Mitigation Predictive Analytics Use Case – Smarten
Fraud Mitigation Predictive Analytics Use Case – SmartenFraud Mitigation Predictive Analytics Use Case – Smarten
Fraud Mitigation Predictive Analytics Use Case – Smarten
 
Quality Control Predictive Analytics Use Case - Smarten
Quality Control Predictive Analytics Use Case - SmartenQuality Control Predictive Analytics Use Case - Smarten
Quality Control Predictive Analytics Use Case - Smarten
 
Machine Maintenance Management Predictive Analytics Use Case - Smarten
Machine Maintenance Management Predictive Analytics Use Case - SmartenMachine Maintenance Management Predictive Analytics Use Case - Smarten
Machine Maintenance Management Predictive Analytics Use Case - Smarten
 
Predictive Analytics Using External Data Augmented Analytics Use Case - Smarten
Predictive Analytics Using External Data Augmented Analytics Use Case - SmartenPredictive Analytics Using External Data Augmented Analytics Use Case - Smarten
Predictive Analytics Using External Data Augmented Analytics Use Case - Smarten
 
Marketing Optimization Augmented Analytics Use Cases - Smarten
Marketing Optimization Augmented Analytics Use Cases - SmartenMarketing Optimization Augmented Analytics Use Cases - Smarten
Marketing Optimization Augmented Analytics Use Cases - Smarten
 
Human Resource Attrition Augmented Analytics Use Case - Smarten
Human Resource Attrition Augmented Analytics Use Case - SmartenHuman Resource Attrition Augmented Analytics Use Case - Smarten
Human Resource Attrition Augmented Analytics Use Case - Smarten
 
Customer Targeting Augmented Analytics Use Case - Smarten
Customer Targeting Augmented Analytics Use Case - SmartenCustomer Targeting Augmented Analytics Use Case - Smarten
Customer Targeting Augmented Analytics Use Case - Smarten
 
What is Naïve Bayes Classification and How is it Used for Enterprise Analysis?
What is Naïve Bayes Classification and How is it Used for Enterprise Analysis?What is Naïve Bayes Classification and How is it Used for Enterprise Analysis?
What is Naïve Bayes Classification and How is it Used for Enterprise Analysis?
 
What is KNN Classification and How Can This Analysis Help an Enterprise?
What is KNN Classification and How Can This Analysis Help an Enterprise?What is KNN Classification and How Can This Analysis Help an Enterprise?
What is KNN Classification and How Can This Analysis Help an Enterprise?
 
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
 

Recently uploaded

AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI AppAI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
Google
 
AI Pilot Review: The World’s First Virtual Assistant Marketing Suite
AI Pilot Review: The World’s First Virtual Assistant Marketing SuiteAI Pilot Review: The World’s First Virtual Assistant Marketing Suite
AI Pilot Review: The World’s First Virtual Assistant Marketing Suite
Google
 
Artificia Intellicence and XPath Extension Functions
Artificia Intellicence and XPath Extension FunctionsArtificia Intellicence and XPath Extension Functions
Artificia Intellicence and XPath Extension Functions
Octavian Nadolu
 
Vitthal Shirke Java Microservices Resume.pdf
Vitthal Shirke Java Microservices Resume.pdfVitthal Shirke Java Microservices Resume.pdf
Vitthal Shirke Java Microservices Resume.pdf
Vitthal Shirke
 
Launch Your Streaming Platforms in Minutes
Launch Your Streaming Platforms in MinutesLaunch Your Streaming Platforms in Minutes
Launch Your Streaming Platforms in Minutes
Roshan Dwivedi
 
Navigating the Metaverse: A Journey into Virtual Evolution"
Navigating the Metaverse: A Journey into Virtual Evolution"Navigating the Metaverse: A Journey into Virtual Evolution"
Navigating the Metaverse: A Journey into Virtual Evolution"
Donna Lenk
 
Need for Speed: Removing speed bumps from your Symfony projects ⚡️
Need for Speed: Removing speed bumps from your Symfony projects ⚡️Need for Speed: Removing speed bumps from your Symfony projects ⚡️
Need for Speed: Removing speed bumps from your Symfony projects ⚡️
Łukasz Chruściel
 
Using Xen Hypervisor for Functional Safety
Using Xen Hypervisor for Functional SafetyUsing Xen Hypervisor for Functional Safety
Using Xen Hypervisor for Functional Safety
Ayan Halder
 
Mobile App Development Company In Noida | Drona Infotech
Mobile App Development Company In Noida | Drona InfotechMobile App Development Company In Noida | Drona Infotech
Mobile App Development Company In Noida | Drona Infotech
Drona Infotech
 
Atelier - Innover avec l’IA Générative et les graphes de connaissances
Atelier - Innover avec l’IA Générative et les graphes de connaissancesAtelier - Innover avec l’IA Générative et les graphes de connaissances
Atelier - Innover avec l’IA Générative et les graphes de connaissances
Neo4j
 
Fundamentals of Programming and Language Processors
Fundamentals of Programming and Language ProcessorsFundamentals of Programming and Language Processors
Fundamentals of Programming and Language Processors
Rakesh Kumar R
 
Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024
Paco van Beckhoven
 
Quarkus Hidden and Forbidden Extensions
Quarkus Hidden and Forbidden ExtensionsQuarkus Hidden and Forbidden Extensions
Quarkus Hidden and Forbidden Extensions
Max Andersen
 
Orion Context Broker introduction 20240604
Orion Context Broker introduction 20240604Orion Context Broker introduction 20240604
Orion Context Broker introduction 20240604
Fermin Galan
 
A Sighting of filterA in Typelevel Rite of Passage
A Sighting of filterA in Typelevel Rite of PassageA Sighting of filterA in Typelevel Rite of Passage
A Sighting of filterA in Typelevel Rite of Passage
Philip Schwarz
 
Automated software refactoring with OpenRewrite and Generative AI.pptx.pdf
Automated software refactoring with OpenRewrite and Generative AI.pptx.pdfAutomated software refactoring with OpenRewrite and Generative AI.pptx.pdf
Automated software refactoring with OpenRewrite and Generative AI.pptx.pdf
timtebeek1
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
APIs for Browser Automation (MoT Meetup 2024)
APIs for Browser Automation (MoT Meetup 2024)APIs for Browser Automation (MoT Meetup 2024)
APIs for Browser Automation (MoT Meetup 2024)
Boni García
 
Transform Your Communication with Cloud-Based IVR Solutions
Transform Your Communication with Cloud-Based IVR SolutionsTransform Your Communication with Cloud-Based IVR Solutions
Transform Your Communication with Cloud-Based IVR Solutions
TheSMSPoint
 
A Study of Variable-Role-based Feature Enrichment in Neural Models of Code
A Study of Variable-Role-based Feature Enrichment in Neural Models of CodeA Study of Variable-Role-based Feature Enrichment in Neural Models of Code
A Study of Variable-Role-based Feature Enrichment in Neural Models of Code
Aftab Hussain
 

Recently uploaded (20)

AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI AppAI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
 
AI Pilot Review: The World’s First Virtual Assistant Marketing Suite
AI Pilot Review: The World’s First Virtual Assistant Marketing SuiteAI Pilot Review: The World’s First Virtual Assistant Marketing Suite
AI Pilot Review: The World’s First Virtual Assistant Marketing Suite
 
Artificia Intellicence and XPath Extension Functions
Artificia Intellicence and XPath Extension FunctionsArtificia Intellicence and XPath Extension Functions
Artificia Intellicence and XPath Extension Functions
 
Vitthal Shirke Java Microservices Resume.pdf
Vitthal Shirke Java Microservices Resume.pdfVitthal Shirke Java Microservices Resume.pdf
Vitthal Shirke Java Microservices Resume.pdf
 
Launch Your Streaming Platforms in Minutes
Launch Your Streaming Platforms in MinutesLaunch Your Streaming Platforms in Minutes
Launch Your Streaming Platforms in Minutes
 
Navigating the Metaverse: A Journey into Virtual Evolution"
Navigating the Metaverse: A Journey into Virtual Evolution"Navigating the Metaverse: A Journey into Virtual Evolution"
Navigating the Metaverse: A Journey into Virtual Evolution"
 
Need for Speed: Removing speed bumps from your Symfony projects ⚡️
Need for Speed: Removing speed bumps from your Symfony projects ⚡️Need for Speed: Removing speed bumps from your Symfony projects ⚡️
Need for Speed: Removing speed bumps from your Symfony projects ⚡️
 
Using Xen Hypervisor for Functional Safety
Using Xen Hypervisor for Functional SafetyUsing Xen Hypervisor for Functional Safety
Using Xen Hypervisor for Functional Safety
 
Mobile App Development Company In Noida | Drona Infotech
Mobile App Development Company In Noida | Drona InfotechMobile App Development Company In Noida | Drona Infotech
Mobile App Development Company In Noida | Drona Infotech
 
Atelier - Innover avec l’IA Générative et les graphes de connaissances
Atelier - Innover avec l’IA Générative et les graphes de connaissancesAtelier - Innover avec l’IA Générative et les graphes de connaissances
Atelier - Innover avec l’IA Générative et les graphes de connaissances
 
Fundamentals of Programming and Language Processors
Fundamentals of Programming and Language ProcessorsFundamentals of Programming and Language Processors
Fundamentals of Programming and Language Processors
 
Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024
 
Quarkus Hidden and Forbidden Extensions
Quarkus Hidden and Forbidden ExtensionsQuarkus Hidden and Forbidden Extensions
Quarkus Hidden and Forbidden Extensions
 
Orion Context Broker introduction 20240604
Orion Context Broker introduction 20240604Orion Context Broker introduction 20240604
Orion Context Broker introduction 20240604
 
A Sighting of filterA in Typelevel Rite of Passage
A Sighting of filterA in Typelevel Rite of PassageA Sighting of filterA in Typelevel Rite of Passage
A Sighting of filterA in Typelevel Rite of Passage
 
Automated software refactoring with OpenRewrite and Generative AI.pptx.pdf
Automated software refactoring with OpenRewrite and Generative AI.pptx.pdfAutomated software refactoring with OpenRewrite and Generative AI.pptx.pdf
Automated software refactoring with OpenRewrite and Generative AI.pptx.pdf
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
APIs for Browser Automation (MoT Meetup 2024)
APIs for Browser Automation (MoT Meetup 2024)APIs for Browser Automation (MoT Meetup 2024)
APIs for Browser Automation (MoT Meetup 2024)
 
Transform Your Communication with Cloud-Based IVR Solutions
Transform Your Communication with Cloud-Based IVR SolutionsTransform Your Communication with Cloud-Based IVR Solutions
Transform Your Communication with Cloud-Based IVR Solutions
 
A Study of Variable-Role-based Feature Enrichment in Neural Models of Code
A Study of Variable-Role-based Feature Enrichment in Neural Models of CodeA Study of Variable-Role-based Feature Enrichment in Neural Models of Code
A Study of Variable-Role-based Feature Enrichment in Neural Models of Code
 

What is the Multinomial-Logistic Regression Classification Algorithm and How Does One Use it for Analysis?

  • 1. Master the Art of Analytics A Simplistic Explainer Series For Citizen Data Scientists J o u r n e y To w a r d s A u g m e n t e d A n a l y t i c s
  • 3. Terminologies Introduction & Example Standard input/tuning parameters & Sample UI Sample output UI Interpretation of Output Limitations Business use cases What Are All Covered
  • 5. Terminologies  Target variable usually denoted by Y , is the variable being predicted and is also called dependent variable, output variable, response variable or outcome variable (Ex : One highlighted in red box in table below)  Predictor, sometimes called an independent variable, is a variable that is being used to predict the target variable ( Ex : variables highlighted in green box in table below ) Age Marital Status Gender Satisfaction level 58 married Female High 44 single Female Low 33 married Male Medium 47 married Female High 33 single Female Medium 35 married Male High 28 single Male Low
  • 7. Introduction • OBJECTIVE : • Logistic regression measures the relationship between the categorical target variable and one or more independent variables • It deals with situations in which the outcome for a target variable can have two or more possible types • Thus , logistic regression makes use of one or more predictor variables that may be either continuous or categorical to predict the target variable classes • BENEFIT: • Logistic regression model output helps identify important factors ( Xi ) impacting the target variable (Y) and also the nature of relationship between each of these factors and dependent variable
  • 8. Example : Multinomial Logistic Regression : Input Let’s conduct the Multinomial Logistic Regression analysis on following variables : Job satisfaction level Age Marital Status Gender Income Low 58 married Male 46,399 Medium 44 single Male 47,971 Low 33 married Female 52,618 High 47 married Male 28,717 Medium 33 single Female 41,216 Medium 35 married Female 34,372 Low 28 single Male 64,811 Medium 42 divorced Female 53,000 High 58 married Female 41,375 Low 43 single Male 53,778 Low 41 divorced Male 44,440 Medium 29 single Female 51,026 Independent variables (Xi)Target Variable (Y)
  • 9. Example : Multinomial Logistic Regression : Output 1 Coefficient P value High Age 1.54 0.05 Income -0.34 0.03 Male 0.67 0.02 Low Age -2.34 0.05 Income 0.56 0.01 Male -1.23 0.04 Coefficients • High satisfaction with reference to medium satisfaction: Age - Multinomial logit (Natural log of the proportion of High to that of Medium here) estimate for 1 year increase in age for high job satisfaction relative to medium job satisfaction when other independent variables are held constant = 1.54 Male - Multinomial logit estimate for comparing male to females for high job satisfaction relative to medium job satisfaction when other variables are held constant = 0.67 Interpretation
  • 10. Example : Multinomial Logistic Regression : Output 2 Classification Accuracy : (50+ 10 + 70) / (50+ 10 + 70+ 4+4+5+4+6+7) = 81% • The prediction accuracy is useful criterion for assessing the model performance • Model with prediction accuracy >= 70% is useful Classification Error = 100- Accuracy = 19% There is 19% chance of error in classification Low Medium High Low 50 4 4 Medium 4 70 5 High 6 7 10 Actual versus predicted Predicted Actual
  • 14. Sample output 1 : Model Summary Actual versus predicted Predicted Actual Coefficient matrix : Low Medium High Low 50 4 4 Medium 4 70 5 High 6 7 10 Coefficient P value High Age 1.54 0.05 Income -0.34 0.03 Male 0.67 0.02 Low Age -2.34 0.05 Income 0.56 0.01 Male -1.23 0.04
  • 15. Age Marital Status Gender Income Job satisfaction level Predicted class Probability 58 married Female 46,399 Low Low 0.7 44 single Female 47,971 High High 0.9 33 married Male 52,618 Low Low 0.8 47 married Female 28,717 Low High 0.7 33 single Male 41,216 High Low 0.6 35 married Male 34,372 High High 0.5 28 single Female 64,811 Low Low 0.4 42 divorced Male 53,000 Low Low 0.3 58 married Female 41,375 High Low 0.2 43 single Male 53,778 High High 0.1 Sample output 2 : Predicted class & probability
  • 16. Sample Output 3 : Classification Plot • Lesser the overlap among three classes in the plot above , better the classification done by model • Thus, output will contain predicted class column, confusion matrix and classification plot
  • 18. Interpretation of Important Model Summary Statistics Accuracy:  If Accuracy >= 70% : Model is well fit on provided data and predicted classes are reasonably accurate  If Accuracy < 70% : Model is not well fit on provided data and predicted classes are likely to contain high chances of error Coefficients and p value :  If value of coefficient is positive and p value <0.05 , variable is positively correlated with target variable  If value of coefficient is negative and p value <0.05 , variable is negatively correlated with target variable  If p value > 0.05, variable is unimportant in terms of predicting target variable classes
  • 20. Limitations  It is applicable only when target variable is categorical  Sample size must be at least 1000 in order to get reliable predictions  Level 1 of the target variable should represent the desired outcome.  i.e. if desired class is yes in response/non response target variable then Yes has to be recoded into 1 and No into 0
  • 22. Use case 1 Business benefit: • By having a knowledge of probable election outcome, proper strategy can be put in place in case of discrepancies between expectations and predictions and the segments with high likelihood of voting oppositions can be targeted in better and effective manner in order to get their votes in favor of a client party Business problem : • A research agency wants to predict the likelihood of each election candidate being voted by each voter and in turn devise a strategy to take proactive steps • Here the target variable would be ‘preferred party name’ and predictors would be customer demographics such as age, income, qualification, occupation, gender, religion and past voting status etc.
  • 23. Use case 1 : Sample Input Dataset Responder ID Qualification income Age Gender Occupation Done voting in past Preferred party 1039153 Bcom 105000 18 M Accountant yes ABC 1069697 12th Pass 192000 20 F Office Supervisor No XYZ 1068120 BSC 310000 30 F Pathologist yes PQL 563175 10th Pass 100000 45 M Labour yes XYZ 562842 ME 357228 25 M Software Developer No PQL 562681 MSC 413000 28 F Statistician yes XYZ 562404 BSC Nill 34 F Home maker No PQL
  • 24. Use case 1 : Output : Predicted Class Output : Each record will have a predicted class along with probability assigned as shown below : Respond er ID Qualificatio n Income Age Gender Occupation Done voting in past Predicted party Probability 1039153 Bcom 105000 18 M Accountant yes ABC 0.7 1069697 12th Pass 192000 20 F Office Supervisor No XYZ 0.9 1068120 BSC 310000 30 F Pathologist yes PQL 0.8 563175 10th Pass 100000 45 M Labour yes XYZ 0.7 562842 ME 357228 25 M Software Developer No PQL 0.6 562681 MSC 413000 28 F Statistician yes XYZ 0.5 562404 BSC Nill 34 F Home maker No PQL 0.4
  • 25. Use case 1 : Output : Sample Class profile Predicted Party Average Annual income Average Age ABC 86,467 30 XYZ 60,935 25 PQL 1,05,400 35 • As can be seen in the table above, there is distinctive characteristics of population associated with each preferred party : • For instance, females are inclined towards XYZ whereas males tend to prefer ABC • Responders with high income and age prefer to vote for PQL whereas XYZ party is preferred by lowest income and age group • Fresh voters are likely to vote for party XYZ whereas those who have done voting in past are inclined towards ABC party Gender Predicted Party Male Female ABC 60 4 XYZ 10 78 PQL 14 15 Past voting status Predicted Party Yes No ABC 58 6 XYZ 15 73 PQL 11 19
  • 26. Use case 2 Business benefit: •Given the body profile of a patient and predicted level of disease , right cure/medications can be suggested to a patient Business problem : •A doctor/ pharmacist wants to predict the likelihood of a new patient’s disease being at initial/intermediate/severe stage based on various body attributes of a patient such as blood pressure , hemoglobin level, sugar level , red blood counts, TSH etc. •Here the target variable would be level of disease and would contain values ‘Initial, Intermediate and Severe’
  • 27. Want to Learn More? Get in touch with us @ support@Smarten.com And Do Checkout the Learning section on Smarten.com June 2018