SlideShare a Scribd company logo
Multinomial Logistic Regression Models




January 1, 2013     ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
 Logistic regression CAN handle dependant variables
   with more than two categories
  It is important to note whether the response variable
   is ordinal (consisting of ordered categories like young,
   middle-aged, old) or nominal (dependant is unordered
   like red, blue, black)
  Some multinomial logistic models are appropriate only
   for ordered response
  It is not mathematically necessary to consider the
   natural ordering when modeling ordinal response but,
  Considering the natural ordering
       Leads to a more parsimonious model
       Increase power to detect relationships with other variables

January 1, 2013    ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
 Applying logistic regression considering the natural
   order is done using a modeling technique called the
   “Proportional Odds Model”
  Say the dependant variable Y has 4 states measuring
   the impact of radiation on the human body; fine,
   sick, serious,dead
  Let p1=prob of fine, p2=prob of sick, p3=prob of
   serious, p4=prob of dead
  Let us define a baseline category: fine, since this is
   the normal stage (we shall see why we need this
   later)
January 1, 2013   ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    What if we break up the modeling of the 4 level
      ordered dependant into 3 binary logistic
      situations: 1 – (fine,sick), 2 – (fine,serious),3 –
      (fine,dead)?
     Then we would have 3 logit equations:
       Log(p2/p1)=B11+B12X1+B13X2
       Log(p3/p1)=B21+B22X1+B23X2
       Log(p4/p1)=B31+B32X1+B33X2
      X is the degree of radiation dummy with 3 levels so
        broken into 2 binary dummies
     So, 9 parameters to be estmated

January 1, 2013   ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Now consider an alternative model for the same
      situation
     Cumulative logit model:
       L1=log(p1/p2+p3+p4)
       L2=log(p1+p2/p3+p4)
       L3=log(p1+p2+p3/p4)
     The obvious way to introduce covariates is
       L1=B11+B12X1+B13X2
       L2=B21+B22X1+B23X2
       L3=B31+B32X1+B33X2

January 1, 2013   ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Let us simplyfy the model by specifying that
      the slope parameters are identical over the
      logit equations. Then,
       L1=A1+B1X1+B2X2
       L2=A2+B1X1+B2X2
       L3=A3+B1X1+B2X2
     This is the proportional odds cumulative logit
      model

January 1, 2013   ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
 Suppose that the categorical outcome is actually a
   categorized version of an unobservable (latent)
   continuous variable which has a logistic distribution
  The continuous scale is divided into five regions by
   four cut-points c1, c2, c3, c4 which are determined by
   nature
  If Z ≤ c1 we observe Y = 1; if c1 < Z ≤ c2 we observe Y =
   2; and so on
  Suppose that the Z is related to the X’s through a linear
   regression
  Then, the coarsened categorical variable would be
   related Y will be related to the X’s by a proportional-
   odds cumulative logit model
January 1, 2013   ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Let us go back to the model
       L1=A1+B1X1+B2X2
       L2=A2+B1X1+B2X2
       L3=A3+B1X1+B2X2
     Note that Lj is the log-odds of falling into or below category j
      versus falling above it
     Aj is the log-odds of falling into or below category j when X1 =
      X2 = 0
     B1 is the increase in log-odds of falling into or below any
      category associated with a one-unit increase in Xk, holding all
      the other X-variables constant.
     Therefore, a positive slope indicates a tendency for the
      response level to decrease as the variable decreases
January 1, 2013     ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Our example of 4 levels of impact of radiation
      corresponding to 3 levels of radiation

 proc logistic data=radiation_impact;
 freq count;
 class radiation / order=data param=ref ref=first;
 model sickness (order=data descending) = radiation /
   link=logit
 aggregate=(radiation) scale=none;
 run;
January 1, 2013   ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Freq=count
       This is important for specifying grouped data
       Count is the variable that contains the frequency of
        occurrance of each observation
       In its absence, each row would be considered a
        separate row of data
     Class=radiation
       Specifies that radiation is a classification variable to
        be used in the analysis
       SAS would automatically generate n-1 binary
        dummies for n categories of radiation with param=ref
        option
January 1, 2013    ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Order=data
       Simply tells SAS to arrange the response categories in
          the order they occur in the input data 1,2,3,4
     Param=ref
       This implies that there is going to be dummy coding
          for the classification variable ‘radiation’listed in class
     Ref=first
       Designates the first ordered level, in this case ‘fine’ as
          the reference level

January 1, 2013      ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Order=data descending
         This tells SAS to reverse the order of the logits
         So, instead of the cumulative logit model being
         L1=log(p1/p2+p3+p4)
         L2=log(p1+p2/p3+p4)
         L3=log(p1+p2+p3/p4), it becomes
         L1=log(p4/p1+p2+p3)
         L2=log(p4+p3/p1+p2)
         L3=log(p4+p3+p2/p1)
     Now, a positive B1 indicates that a higher value of X1
      leads to greater chance of radiation sickness
January 1, 2013      ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Link=logit
       fits the cumulative logit model when there are more
          than two response categories
     Aggregate=radiation
       Indicates that the goodness of fit statistics are to be
          calculated on the subpopulations of the variable:
          radiation
     Scale=none
       No correction is need for the dispersion parameter
       To understand this, read up. This happens when the
          goodness of fit statistic exceeds its degrees of freedom
          and need to be corrected for
January 1, 2013      ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    When we fit this model, the first output we
      see:
 Score Test for the Proportional Odds Assumption
 Chi-Square DF         Pr > ChiSq
 17.2866        21     0.6936

  Null hypothesis is that the current proportional-odds
   cumulative logit model is true
  Seems like we fail to reject the null and so can proceed to the
   rest of the output under the current assumption


January 1, 2013   ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Ultimately we are interested in the predicted
      probabilities
          OUTPUT <OUT=SAS-data-set><options>
     Predicted=
       For a cumulative model, it is the predicted cumulative
          probability (that is, the probability that the response
          variable is less than or equal to the value of _LEVEL_);
     PREDPROBS=I or C
       Individual|I requests the predicted probability of each
        response level.
       CUMULATIVE | C requests the cumulative predicted
        probability of each response level
January 1, 2013      ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India

More Related Content

What's hot

ANALYTICAL FORMULATIONS FOR THE LEVEL BASED WEIGHTED AVERAGE VALUE OF DISCRET...
ANALYTICAL FORMULATIONS FOR THE LEVEL BASED WEIGHTED AVERAGE VALUE OF DISCRET...ANALYTICAL FORMULATIONS FOR THE LEVEL BASED WEIGHTED AVERAGE VALUE OF DISCRET...
ANALYTICAL FORMULATIONS FOR THE LEVEL BASED WEIGHTED AVERAGE VALUE OF DISCRET...
ijsc
 
Mm2521542158
Mm2521542158Mm2521542158
Mm2521542158
IJERA Editor
 
A Fuzzy Mean-Variance-Skewness Portfolioselection Problem.
A Fuzzy Mean-Variance-Skewness Portfolioselection Problem.A Fuzzy Mean-Variance-Skewness Portfolioselection Problem.
A Fuzzy Mean-Variance-Skewness Portfolioselection Problem.
inventionjournals
 
Application of Adomian Decomposition Method in Solving Second Order Nonlinear...
Application of Adomian Decomposition Method in Solving Second Order Nonlinear...Application of Adomian Decomposition Method in Solving Second Order Nonlinear...
Application of Adomian Decomposition Method in Solving Second Order Nonlinear...
inventionjournals
 
A review of automatic differentiationand its efficient implementation
A review of automatic differentiationand its efficient implementationA review of automatic differentiationand its efficient implementation
A review of automatic differentiationand its efficient implementation
ssuserfa7e73
 
A NEW ALGORITHM FOR SOLVING FULLY FUZZY BI-LEVEL QUADRATIC PROGRAMMING PROBLEMS
A NEW ALGORITHM FOR SOLVING FULLY FUZZY BI-LEVEL QUADRATIC PROGRAMMING PROBLEMSA NEW ALGORITHM FOR SOLVING FULLY FUZZY BI-LEVEL QUADRATIC PROGRAMMING PROBLEMS
A NEW ALGORITHM FOR SOLVING FULLY FUZZY BI-LEVEL QUADRATIC PROGRAMMING PROBLEMS
orajjournal
 
Linear programming
Linear programmingLinear programming
Linear programming
VARUN KUMAR
 
Cg32519523
Cg32519523Cg32519523
Cg32519523
IJERA Editor
 
Chapter two
Chapter twoChapter two
Chapter two
Mohamed Daahir
 
Fractional calculus and applications
Fractional calculus and applicationsFractional calculus and applications
Fractional calculus and applications
PlusOrMinusZero
 
An Approach to Mathematically Establish the Practical Use of Assignment Probl...
An Approach to Mathematically Establish the Practical Use of Assignment Probl...An Approach to Mathematically Establish the Practical Use of Assignment Probl...
An Approach to Mathematically Establish the Practical Use of Assignment Probl...
ijtsrd
 
Linear Programming 1
Linear Programming 1Linear Programming 1
Linear Programming 1
irsa javed
 
Heptagonal Fuzzy Numbers by Max Min Method
Heptagonal Fuzzy Numbers by Max Min MethodHeptagonal Fuzzy Numbers by Max Min Method
Heptagonal Fuzzy Numbers by Max Min Method
YogeshIJTSRD
 
Linear programming class 12 investigatory project
Linear programming class 12 investigatory projectLinear programming class 12 investigatory project
Linear programming class 12 investigatory project
Divyans890
 
A new approach for ranking of intuitionistic fuzzy numbers
A new approach for ranking of intuitionistic fuzzy numbersA new approach for ranking of intuitionistic fuzzy numbers
A new approach for ranking of intuitionistic fuzzy numbers
Journal of Fuzzy Extension and Applications
 
linear programming
linear programming linear programming
linear programming
DagnaygebawGoshme
 
Certified global minima
Certified global minimaCertified global minima
Certified global minima
ssuserfa7e73
 
Dr azimifar pattern recognition lect2
Dr azimifar pattern recognition lect2Dr azimifar pattern recognition lect2
Dr azimifar pattern recognition lect2
Zahra Amini
 
A SYSTEMATIC APPROACH FOR SOLVING MIXED INTUITIONISTIC FUZZY TRANSPORTATION P...
A SYSTEMATIC APPROACH FOR SOLVING MIXED INTUITIONISTIC FUZZY TRANSPORTATION P...A SYSTEMATIC APPROACH FOR SOLVING MIXED INTUITIONISTIC FUZZY TRANSPORTATION P...
A SYSTEMATIC APPROACH FOR SOLVING MIXED INTUITIONISTIC FUZZY TRANSPORTATION P...
Navodaya Institute of Technology
 

What's hot (19)

ANALYTICAL FORMULATIONS FOR THE LEVEL BASED WEIGHTED AVERAGE VALUE OF DISCRET...
ANALYTICAL FORMULATIONS FOR THE LEVEL BASED WEIGHTED AVERAGE VALUE OF DISCRET...ANALYTICAL FORMULATIONS FOR THE LEVEL BASED WEIGHTED AVERAGE VALUE OF DISCRET...
ANALYTICAL FORMULATIONS FOR THE LEVEL BASED WEIGHTED AVERAGE VALUE OF DISCRET...
 
Mm2521542158
Mm2521542158Mm2521542158
Mm2521542158
 
A Fuzzy Mean-Variance-Skewness Portfolioselection Problem.
A Fuzzy Mean-Variance-Skewness Portfolioselection Problem.A Fuzzy Mean-Variance-Skewness Portfolioselection Problem.
A Fuzzy Mean-Variance-Skewness Portfolioselection Problem.
 
Application of Adomian Decomposition Method in Solving Second Order Nonlinear...
Application of Adomian Decomposition Method in Solving Second Order Nonlinear...Application of Adomian Decomposition Method in Solving Second Order Nonlinear...
Application of Adomian Decomposition Method in Solving Second Order Nonlinear...
 
A review of automatic differentiationand its efficient implementation
A review of automatic differentiationand its efficient implementationA review of automatic differentiationand its efficient implementation
A review of automatic differentiationand its efficient implementation
 
A NEW ALGORITHM FOR SOLVING FULLY FUZZY BI-LEVEL QUADRATIC PROGRAMMING PROBLEMS
A NEW ALGORITHM FOR SOLVING FULLY FUZZY BI-LEVEL QUADRATIC PROGRAMMING PROBLEMSA NEW ALGORITHM FOR SOLVING FULLY FUZZY BI-LEVEL QUADRATIC PROGRAMMING PROBLEMS
A NEW ALGORITHM FOR SOLVING FULLY FUZZY BI-LEVEL QUADRATIC PROGRAMMING PROBLEMS
 
Linear programming
Linear programmingLinear programming
Linear programming
 
Cg32519523
Cg32519523Cg32519523
Cg32519523
 
Chapter two
Chapter twoChapter two
Chapter two
 
Fractional calculus and applications
Fractional calculus and applicationsFractional calculus and applications
Fractional calculus and applications
 
An Approach to Mathematically Establish the Practical Use of Assignment Probl...
An Approach to Mathematically Establish the Practical Use of Assignment Probl...An Approach to Mathematically Establish the Practical Use of Assignment Probl...
An Approach to Mathematically Establish the Practical Use of Assignment Probl...
 
Linear Programming 1
Linear Programming 1Linear Programming 1
Linear Programming 1
 
Heptagonal Fuzzy Numbers by Max Min Method
Heptagonal Fuzzy Numbers by Max Min MethodHeptagonal Fuzzy Numbers by Max Min Method
Heptagonal Fuzzy Numbers by Max Min Method
 
Linear programming class 12 investigatory project
Linear programming class 12 investigatory projectLinear programming class 12 investigatory project
Linear programming class 12 investigatory project
 
A new approach for ranking of intuitionistic fuzzy numbers
A new approach for ranking of intuitionistic fuzzy numbersA new approach for ranking of intuitionistic fuzzy numbers
A new approach for ranking of intuitionistic fuzzy numbers
 
linear programming
linear programming linear programming
linear programming
 
Certified global minima
Certified global minimaCertified global minima
Certified global minima
 
Dr azimifar pattern recognition lect2
Dr azimifar pattern recognition lect2Dr azimifar pattern recognition lect2
Dr azimifar pattern recognition lect2
 
A SYSTEMATIC APPROACH FOR SOLVING MIXED INTUITIONISTIC FUZZY TRANSPORTATION P...
A SYSTEMATIC APPROACH FOR SOLVING MIXED INTUITIONISTIC FUZZY TRANSPORTATION P...A SYSTEMATIC APPROACH FOR SOLVING MIXED INTUITIONISTIC FUZZY TRANSPORTATION P...
A SYSTEMATIC APPROACH FOR SOLVING MIXED INTUITIONISTIC FUZZY TRANSPORTATION P...
 

Viewers also liked

Boosted Tree-based Multinomial Logit Model for Aggregated Market Data
Boosted Tree-based Multinomial Logit Model for Aggregated Market DataBoosted Tree-based Multinomial Logit Model for Aggregated Market Data
Boosted Tree-based Multinomial Logit Model for Aggregated Market Data
Jay (Jianqiang) Wang
 
The power of RapidMiner, showing the direct marketing demo
The power of RapidMiner, showing the direct marketing demoThe power of RapidMiner, showing the direct marketing demo
The power of RapidMiner, showing the direct marketing demo
Wessel Luijben
 
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARNHadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
Josh Patterson
 
Statisticsfor businessproject solution
Statisticsfor businessproject solutionStatisticsfor businessproject solution
Statisticsfor businessproject solution
huynguyenbac
 
Financialmodeling
FinancialmodelingFinancialmodeling
Financialmodeling
Talal Tahir
 
Chapter 16
Chapter 16Chapter 16
Chapter 16
Matthew L Levy
 
Midterm
MidtermMidterm
Midterm
Wilkes U
 
Ordinal Logistic Regression
Ordinal Logistic RegressionOrdinal Logistic Regression
Ordinal Logistic Regression
Al-Ahmadgaid Asaad
 
Logistic Regression/Markov Chain presentation
Logistic Regression/Markov Chain presentationLogistic Regression/Markov Chain presentation
Logistic Regression/Markov Chain presentation
Michael Hankin
 
Qam formulas
Qam formulasQam formulas
Qam formulas
Ashu Jain
 
Transparency7
Transparency7Transparency7
Transparency7
A M
 
1.5.1 measures basic concepts
1.5.1 measures basic concepts1.5.1 measures basic concepts
1.5.1 measures basic concepts
A M
 
Regression
Regression Regression
Regression
Ali Raza
 
(마더세이프 라운드) Logistic regression
(마더세이프 라운드) Logistic regression(마더세이프 라운드) Logistic regression
(마더세이프 라운드) Logistic regressionmothersafe
 
Regression: A skin-deep dive
Regression: A skin-deep diveRegression: A skin-deep dive
Regression: A skin-deep dive
abulyomon
 
ForecastIT 2. Linear Regression & Model Statistics
ForecastIT 2. Linear Regression & Model StatisticsForecastIT 2. Linear Regression & Model Statistics
ForecastIT 2. Linear Regression & Model Statistics
DeepThought, Inc.
 
Generalized Logistic Regression - by example (Anthony Kilili)
Generalized Logistic Regression - by example (Anthony Kilili)Generalized Logistic Regression - by example (Anthony Kilili)
Generalized Logistic Regression - by example (Anthony Kilili)
Anthony Kilili
 
[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)
[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)
[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)
mohamedchaouche
 
C2.1 intro
C2.1 introC2.1 intro
C2.1 intro
Daniel LIAO
 
Mode Choice analysis for work trips using Multinomial Logit model for Windsor...
Mode Choice analysis for work trips using Multinomial Logit model for Windsor...Mode Choice analysis for work trips using Multinomial Logit model for Windsor...
Mode Choice analysis for work trips using Multinomial Logit model for Windsor...
Aakash Bagchi
 

Viewers also liked (20)

Boosted Tree-based Multinomial Logit Model for Aggregated Market Data
Boosted Tree-based Multinomial Logit Model for Aggregated Market DataBoosted Tree-based Multinomial Logit Model for Aggregated Market Data
Boosted Tree-based Multinomial Logit Model for Aggregated Market Data
 
The power of RapidMiner, showing the direct marketing demo
The power of RapidMiner, showing the direct marketing demoThe power of RapidMiner, showing the direct marketing demo
The power of RapidMiner, showing the direct marketing demo
 
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARNHadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
 
Statisticsfor businessproject solution
Statisticsfor businessproject solutionStatisticsfor businessproject solution
Statisticsfor businessproject solution
 
Financialmodeling
FinancialmodelingFinancialmodeling
Financialmodeling
 
Chapter 16
Chapter 16Chapter 16
Chapter 16
 
Midterm
MidtermMidterm
Midterm
 
Ordinal Logistic Regression
Ordinal Logistic RegressionOrdinal Logistic Regression
Ordinal Logistic Regression
 
Logistic Regression/Markov Chain presentation
Logistic Regression/Markov Chain presentationLogistic Regression/Markov Chain presentation
Logistic Regression/Markov Chain presentation
 
Qam formulas
Qam formulasQam formulas
Qam formulas
 
Transparency7
Transparency7Transparency7
Transparency7
 
1.5.1 measures basic concepts
1.5.1 measures basic concepts1.5.1 measures basic concepts
1.5.1 measures basic concepts
 
Regression
Regression Regression
Regression
 
(마더세이프 라운드) Logistic regression
(마더세이프 라운드) Logistic regression(마더세이프 라운드) Logistic regression
(마더세이프 라운드) Logistic regression
 
Regression: A skin-deep dive
Regression: A skin-deep diveRegression: A skin-deep dive
Regression: A skin-deep dive
 
ForecastIT 2. Linear Regression & Model Statistics
ForecastIT 2. Linear Regression & Model StatisticsForecastIT 2. Linear Regression & Model Statistics
ForecastIT 2. Linear Regression & Model Statistics
 
Generalized Logistic Regression - by example (Anthony Kilili)
Generalized Logistic Regression - by example (Anthony Kilili)Generalized Logistic Regression - by example (Anthony Kilili)
Generalized Logistic Regression - by example (Anthony Kilili)
 
[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)
[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)
[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)
 
C2.1 intro
C2.1 introC2.1 intro
C2.1 intro
 
Mode Choice analysis for work trips using Multinomial Logit model for Windsor...
Mode Choice analysis for work trips using Multinomial Logit model for Windsor...Mode Choice analysis for work trips using Multinomial Logit model for Windsor...
Mode Choice analysis for work trips using Multinomial Logit model for Windsor...
 

Similar to Logistic regression for ordered dependant variable with more than 2 levels

How principal components analysis is different from factor
How principal components analysis is different from factorHow principal components analysis is different from factor
How principal components analysis is different from factor
Arup Guha
 
Supervised Learning.pdf
Supervised Learning.pdfSupervised Learning.pdf
Supervised Learning.pdf
gadissaassefa
 
ML MODULE 4.pdf
ML MODULE 4.pdfML MODULE 4.pdf
ML MODULE 4.pdf
Shiwani Gupta
 
A Theory of the Learnable; PAC Learning
A Theory of the Learnable; PAC LearningA Theory of the Learnable; PAC Learning
A Theory of the Learnable; PAC Learning
dhruvgairola
 
GRADIENT OMISSIVE DESCENT IS A MINIMIZATION ALGORITHM
GRADIENT OMISSIVE DESCENT IS A MINIMIZATION ALGORITHMGRADIENT OMISSIVE DESCENT IS A MINIMIZATION ALGORITHM
GRADIENT OMISSIVE DESCENT IS A MINIMIZATION ALGORITHM
ijscai
 
IRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms ComparisonIRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms Comparison
IRJET Journal
 
IRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms ComparisonIRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms Comparison
IRJET Journal
 
autocorrelation.pptx
autocorrelation.pptxautocorrelation.pptx
autocorrelation.pptx
PriyadharshanBobby
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
ijceronline
 
Study the different
Study the differentStudy the different
Study the different
prj_publication
 
Yahya Almalki SOED 2016
Yahya Almalki SOED 2016Yahya Almalki SOED 2016
Yahya Almalki SOED 2016
Colleen Ganley
 
Design and analysis of ra sort
Design and analysis of ra sortDesign and analysis of ra sort
Design and analysis of ra sort
ijfcstjournal
 
working with python
working with pythonworking with python
working with python
bhavesh lande
 
MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...
The Statistical and Applied Mathematical Sciences Institute
 
2018 p 2019-ee-a2
2018 p 2019-ee-a22018 p 2019-ee-a2
2018 p 2019-ee-a2
uetian12
 
Ica group 3[1]
Ica group 3[1]Ica group 3[1]
Ica group 3[1]
Apoorva Srinivasan
 
Big Data Analysis
Big Data AnalysisBig Data Analysis
Big Data Analysis
NBER
 
ppt0320defenseday
ppt0320defensedayppt0320defenseday
ppt0320defenseday
Xi (Shay) Zhang, PhD
 
Prediction model of algal blooms using logistic regression and confusion matrix
Prediction model of algal blooms using logistic regression and confusion matrix Prediction model of algal blooms using logistic regression and confusion matrix
Prediction model of algal blooms using logistic regression and confusion matrix
IJECEIAES
 
A new generalized lindley distribution
A new generalized lindley distributionA new generalized lindley distribution
A new generalized lindley distribution
Alexander Decker
 

Similar to Logistic regression for ordered dependant variable with more than 2 levels (20)

How principal components analysis is different from factor
How principal components analysis is different from factorHow principal components analysis is different from factor
How principal components analysis is different from factor
 
Supervised Learning.pdf
Supervised Learning.pdfSupervised Learning.pdf
Supervised Learning.pdf
 
ML MODULE 4.pdf
ML MODULE 4.pdfML MODULE 4.pdf
ML MODULE 4.pdf
 
A Theory of the Learnable; PAC Learning
A Theory of the Learnable; PAC LearningA Theory of the Learnable; PAC Learning
A Theory of the Learnable; PAC Learning
 
GRADIENT OMISSIVE DESCENT IS A MINIMIZATION ALGORITHM
GRADIENT OMISSIVE DESCENT IS A MINIMIZATION ALGORITHMGRADIENT OMISSIVE DESCENT IS A MINIMIZATION ALGORITHM
GRADIENT OMISSIVE DESCENT IS A MINIMIZATION ALGORITHM
 
IRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms ComparisonIRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms Comparison
 
IRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms ComparisonIRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms Comparison
 
autocorrelation.pptx
autocorrelation.pptxautocorrelation.pptx
autocorrelation.pptx
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
Study the different
Study the differentStudy the different
Study the different
 
Yahya Almalki SOED 2016
Yahya Almalki SOED 2016Yahya Almalki SOED 2016
Yahya Almalki SOED 2016
 
Design and analysis of ra sort
Design and analysis of ra sortDesign and analysis of ra sort
Design and analysis of ra sort
 
working with python
working with pythonworking with python
working with python
 
MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...
 
2018 p 2019-ee-a2
2018 p 2019-ee-a22018 p 2019-ee-a2
2018 p 2019-ee-a2
 
Ica group 3[1]
Ica group 3[1]Ica group 3[1]
Ica group 3[1]
 
Big Data Analysis
Big Data AnalysisBig Data Analysis
Big Data Analysis
 
ppt0320defenseday
ppt0320defensedayppt0320defenseday
ppt0320defenseday
 
Prediction model of algal blooms using logistic regression and confusion matrix
Prediction model of algal blooms using logistic regression and confusion matrix Prediction model of algal blooms using logistic regression and confusion matrix
Prediction model of algal blooms using logistic regression and confusion matrix
 
A new generalized lindley distribution
A new generalized lindley distributionA new generalized lindley distribution
A new generalized lindley distribution
 

More from Arup Guha

Structural breaks, unit root tests and long time series
Structural breaks, unit root tests and long time seriesStructural breaks, unit root tests and long time series
Structural breaks, unit root tests and long time series
Arup Guha
 
Partnership with a Premier Business School
Partnership with a Premier Business SchoolPartnership with a Premier Business School
Partnership with a Premier Business School
Arup Guha
 
Beer Market Analytics Solutions
Beer Market Analytics SolutionsBeer Market Analytics Solutions
Beer Market Analytics Solutions
Arup Guha
 
Database marketing
Database marketingDatabase marketing
Database marketing
Arup Guha
 
Using survival analysis results
Using survival analysis results Using survival analysis results
Using survival analysis results
Arup Guha
 
Measuring Actual Effect Of Tv Ads On Sales Lk
Measuring Actual Effect Of Tv Ads On Sales   LkMeasuring Actual Effect Of Tv Ads On Sales   Lk
Measuring Actual Effect Of Tv Ads On Sales Lk
Arup Guha
 

More from Arup Guha (6)

Structural breaks, unit root tests and long time series
Structural breaks, unit root tests and long time seriesStructural breaks, unit root tests and long time series
Structural breaks, unit root tests and long time series
 
Partnership with a Premier Business School
Partnership with a Premier Business SchoolPartnership with a Premier Business School
Partnership with a Premier Business School
 
Beer Market Analytics Solutions
Beer Market Analytics SolutionsBeer Market Analytics Solutions
Beer Market Analytics Solutions
 
Database marketing
Database marketingDatabase marketing
Database marketing
 
Using survival analysis results
Using survival analysis results Using survival analysis results
Using survival analysis results
 
Measuring Actual Effect Of Tv Ads On Sales Lk
Measuring Actual Effect Of Tv Ads On Sales   LkMeasuring Actual Effect Of Tv Ads On Sales   Lk
Measuring Actual Effect Of Tv Ads On Sales Lk
 

Recently uploaded

GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
Mariano Tinti
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
Zilliz
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
Claudio Di Ciccio
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
DianaGray10
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
Pixlogix Infotech
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 

Recently uploaded (20)

GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 

Logistic regression for ordered dependant variable with more than 2 levels

  • 1. Multinomial Logistic Regression Models January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 2.  Logistic regression CAN handle dependant variables with more than two categories  It is important to note whether the response variable is ordinal (consisting of ordered categories like young, middle-aged, old) or nominal (dependant is unordered like red, blue, black)  Some multinomial logistic models are appropriate only for ordered response  It is not mathematically necessary to consider the natural ordering when modeling ordinal response but,  Considering the natural ordering  Leads to a more parsimonious model  Increase power to detect relationships with other variables January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 3.  Applying logistic regression considering the natural order is done using a modeling technique called the “Proportional Odds Model”  Say the dependant variable Y has 4 states measuring the impact of radiation on the human body; fine, sick, serious,dead  Let p1=prob of fine, p2=prob of sick, p3=prob of serious, p4=prob of dead  Let us define a baseline category: fine, since this is the normal stage (we shall see why we need this later) January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 4. What if we break up the modeling of the 4 level ordered dependant into 3 binary logistic situations: 1 – (fine,sick), 2 – (fine,serious),3 – (fine,dead)?  Then we would have 3 logit equations:  Log(p2/p1)=B11+B12X1+B13X2  Log(p3/p1)=B21+B22X1+B23X2  Log(p4/p1)=B31+B32X1+B33X2 X is the degree of radiation dummy with 3 levels so broken into 2 binary dummies  So, 9 parameters to be estmated January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 5. Now consider an alternative model for the same situation  Cumulative logit model:  L1=log(p1/p2+p3+p4)  L2=log(p1+p2/p3+p4)  L3=log(p1+p2+p3/p4)  The obvious way to introduce covariates is  L1=B11+B12X1+B13X2  L2=B21+B22X1+B23X2  L3=B31+B32X1+B33X2 January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 6. Let us simplyfy the model by specifying that the slope parameters are identical over the logit equations. Then,  L1=A1+B1X1+B2X2  L2=A2+B1X1+B2X2  L3=A3+B1X1+B2X2  This is the proportional odds cumulative logit model January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 7.  Suppose that the categorical outcome is actually a categorized version of an unobservable (latent) continuous variable which has a logistic distribution  The continuous scale is divided into five regions by four cut-points c1, c2, c3, c4 which are determined by nature  If Z ≤ c1 we observe Y = 1; if c1 < Z ≤ c2 we observe Y = 2; and so on  Suppose that the Z is related to the X’s through a linear regression  Then, the coarsened categorical variable would be related Y will be related to the X’s by a proportional- odds cumulative logit model January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 8. Let us go back to the model  L1=A1+B1X1+B2X2  L2=A2+B1X1+B2X2  L3=A3+B1X1+B2X2  Note that Lj is the log-odds of falling into or below category j versus falling above it  Aj is the log-odds of falling into or below category j when X1 = X2 = 0  B1 is the increase in log-odds of falling into or below any category associated with a one-unit increase in Xk, holding all the other X-variables constant.  Therefore, a positive slope indicates a tendency for the response level to decrease as the variable decreases January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 9. Our example of 4 levels of impact of radiation corresponding to 3 levels of radiation proc logistic data=radiation_impact; freq count; class radiation / order=data param=ref ref=first; model sickness (order=data descending) = radiation / link=logit aggregate=(radiation) scale=none; run; January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 10. Freq=count  This is important for specifying grouped data  Count is the variable that contains the frequency of occurrance of each observation  In its absence, each row would be considered a separate row of data  Class=radiation  Specifies that radiation is a classification variable to be used in the analysis  SAS would automatically generate n-1 binary dummies for n categories of radiation with param=ref option January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 11. Order=data  Simply tells SAS to arrange the response categories in the order they occur in the input data 1,2,3,4  Param=ref  This implies that there is going to be dummy coding for the classification variable ‘radiation’listed in class  Ref=first  Designates the first ordered level, in this case ‘fine’ as the reference level January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 12. Order=data descending  This tells SAS to reverse the order of the logits  So, instead of the cumulative logit model being  L1=log(p1/p2+p3+p4)  L2=log(p1+p2/p3+p4)  L3=log(p1+p2+p3/p4), it becomes  L1=log(p4/p1+p2+p3)  L2=log(p4+p3/p1+p2)  L3=log(p4+p3+p2/p1)  Now, a positive B1 indicates that a higher value of X1 leads to greater chance of radiation sickness January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 13. Link=logit  fits the cumulative logit model when there are more than two response categories  Aggregate=radiation  Indicates that the goodness of fit statistics are to be calculated on the subpopulations of the variable: radiation  Scale=none  No correction is need for the dispersion parameter  To understand this, read up. This happens when the goodness of fit statistic exceeds its degrees of freedom and need to be corrected for January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 14. When we fit this model, the first output we see: Score Test for the Proportional Odds Assumption Chi-Square DF Pr > ChiSq 17.2866 21 0.6936  Null hypothesis is that the current proportional-odds cumulative logit model is true  Seems like we fail to reject the null and so can proceed to the rest of the output under the current assumption January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 15. Ultimately we are interested in the predicted probabilities OUTPUT <OUT=SAS-data-set><options>  Predicted=  For a cumulative model, it is the predicted cumulative probability (that is, the probability that the response variable is less than or equal to the value of _LEVEL_);  PREDPROBS=I or C  Individual|I requests the predicted probability of each response level.  CUMULATIVE | C requests the cumulative predicted probability of each response level January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India