SlideShare a Scribd company logo
1 of 7
Download to read offline
Limited Dependent Variable
--------------------------------------------------------------------------------------------------------------------
Introduction:
So far, we know how to handle linear estimation models of the type:
Y = β0 + β 1*X 1 + β 2*X 2 + … + ε ≡ Xβ + ε
Where in sometimes we had to transform or add variables to get the equation to be linear:
o Taking logs of Y and/or the X’s
o Adding squared terms
o Adding interactions
Then we can run our estimation, do model checking, visualize results, etc. In all these models Y, the
dependent variable, was continuous. Independent variables could be dichotomous (dummy
variables), but not the dependent variable.
However there are models wherein the dependent variable outcome is binary (takes only two
responses) or has limited responses i.e. the value of response variable is limited within a range. There
are lots of times when the dependent variable of interest takes on one of two values. For example,
suppose you are studying car purchase decisions. You survey consumers to find out if they have
purchased a car in the past year. Your observations can be coded 1 (if the 2 consumer bought a car)
or 0 (if the consumer did not). When your dependent variable can take on one of two values, then
you have a dichotomous dependent variable and the resulting models are sometimes called 0/1
models. Such models fall under limited dependent variables.
We consider models of limited dependent variables in which the economic agent’s response is limited
in some way. The dependent variable, rather than being continuous on the real line (or half–line), is
restricted. In some cases, we are dealing with discrete choice: the response variable may be
restricted to a Boolean or binary choice, indicating that a particular course of action was or was not
selected. In others, it may take on only integer values, such as the number of children per family, or
the ordered values on a Likert scale. Alternatively, it may appear to be a continuous variable with a
number of responses at a threshold value. For instance, the response to the question “how many
hours did you work last week?” will be recorded as zero for the nonworking respondents. None of
these measures are amenable to being modeled by the linear regression methods that we know.
Hence to handle such cases we use Limited Dependent variable models.
Definition: A Limited Dependent Variable, Y, is defined as a dependent variable whose range is
substantively restricted.
Yi *= β’Xi + Ui, where Yi* is a latent variable Ui ~N (0, 2)
Yi * =
The common cases are:
o binary: Y ϵ {0, 1}
o multinomial: Y ϵ {0, 1, 2, ..., k}
o integer: Y ϵ {0, 1, 2, ...}
o censored: Y ϵ {𝑌∗
: 𝑌∗
≥0 }
Categorical and limited dependent variables include dependent variables that are binary,
multicategorical, ordinal, counted, censored, or from truncated populations (Long, 1997). Binary
variables have two categories used to indicate that an event has occurred or that some characteristic
is present (for example, placement in foster care coded as "yes" or "no"). Multicatcgorical variables
have three or more unordered categories (for example, type of foster care placement coded as "kin
care," "non-kin family care," "group home care," or "institutional care"). Ordinal variables have
{
𝑌𝑖
∗
if 𝑌𝑖
∗
> 0
0 if 𝑌𝑖
∗
≤ 0
ranked categories with unknown distances between adjacent categories (for example, restrictiveness
of foster care placement rated on a five-point ordered scale). Count variables indicate the number of
characteristics or events during a given period (for example, number of foster care placements while
in state custody).
Based on the above cases, limited dependent variables can be classified into two categories as below
Methods used to model LDV:
There are many models of LDVs based on what the limitations are. Some of them are :
o 0-1 dependent variables (dummies) by Probit and Logit
o Ordered dependent variables by ordered Probit and Logit
o Categorical dependent variables (with more than two categories) by multinomial logit
o Truncated dependent variables by Heckman’s procedure
o Censored dependent variables by Tobit
o Count (integer) dependent variables by Poisson regression
o Hazard (length) dependent variables by hazard models
Limited Dependent Variable
Discrete dependent variable. Continuous dependent variable
Discrete Choice Models
(Logit /Probit models)
Truncated/Censored Regression
Models
Why LDV Models Are Different
To correctly analyze and interpret any LDV model, it is important to understand two fundamental
differences between LDV and OLS type models.
First, LDV models are intrinsically nonlinear, which means the relationship to be estimated cannot
be written as a summation of terms, where each term is a model coefficient times a model variable.
The intrinsic nonlinearity of LDV models has two major methodological ramifications. First, an
explanatory variable’s marginal effect –– the effect of a unit change in an explanatory variable on the
dependent variable –– does not equal the variable’s model coefficient. Second, the value of this
marginal effect varies over the value of all model variables. These facts imply that one cannot infer
the nature of the true relationship between an explanatory variable and the dependent variable
based solely on the estimated coefficient in a LDV model.
The second fundamental difference is that most LDV models are estimated using the method of
maximum likelihood which, unlike the method of least squares, is not based on minimizing error
variance. This means there is no measure of model “fit” directly comparable to the R-square in OLS
and, as a result, model assessment is largely restricted to testing the joint significance of all model
variables as is done in OLS using an F-test of overall model significance.
Model Estimation:
Basic Problems with OLS estimation for LDV is
o Heteroscedastic in error terms
o Predictions not constrained to match actual outcomes, real problem with actual outcomes,
real problem with predicted values being negative when a negative number isn’t possible
Although it is possible to estimate LDV models with OLS the model is likely to produce point
predictions outside the unit interval [0,1]. We could arbitrarily constrain them to either 0 or 1, but
this linear probability model has other problems: the error term cannot satisfy the assumption of
homoscedasticity. For a given set of X values, there are only two possible values for the disturbance:
−Xβ and (1 − Xβ): the disturbance follows a Binomial distribution. Given the properties of the
Binomial distribution, the variance of the disturbance process, conditioned on X, is Var (u|X) = Xβ (1
− Xβ).
There is no constraint to ensure that this quantity will be positive for arbitrary X values. Therefore,
it will rarely be productive to utilize regression with a binary response variable; we must follow a
different strategy.
Because of the limited ranges of the dependent variable, the standard additive normal error is not
tenable for these models. Instead we must model the probability of various discrete outcomes. LDV
models are usually estimated by maximum likelihood, given the assumed distribution of the
conditional probabilities of various outcomes.
Continuous limited dependent variable:
We turn now to a context where the response variable is not binary nor necessarily integer, but
continuous and subject to truncation. We must fully understand the context in which the data were
generated. Nevertheless, it is quite important that we identify situations of truncated or censored
response variables. Utilizing these variables as the dependent variable in a regression equation
without consideration of these qualities will be misleading.
Truncated Regression
In Truncated Regression model the dependent variable is truncated at a certain point. Observations
with the values of Y above a threshold are not included in the sample. The sample is drawn from a
subset of the population so that only certain values are included in the sample. A subset of
observations are dropped, thus, only the truncated data are available for the regression.
Definition: Case where
),( ii xy
is observed only when
ayi 
(left truncation) or when
byi 
(right
truncation) or when
dyc i 
(double truncation). Example: Let iy
= the profit of the i-th firm as
a percentage of assets and ix
= the four firm concentration ratio of industry the firm is in. Suppose
only firms with positive profit rates are observed and firms with negative profit rates are not
observed. In this case a= 0 and we have a problem where the dependent variable is left truncated.
iii xy   10 Where
0iy
for Ni ,,1  and
),0( 2
 Ni 
Neglecting the truncation can lead to biased estimates of 0 and 1 .
Truncated regression models are used for data where whole observations are missing so that the
values for the dependent and the independent variables are unknown
EXAMPLES:
1. A study of the determinants of incomes of the poor. Only households with income below a certain
poverty line are part of the sample.
2. Suppose we have a sample of AIEEE rejects-those who scored below the 30th percentile. We wish
to estimate an IQ equation: AIEEE=f (education, age, socio economic characteristics etc. ). We will
need to take into account the fact that dependent variable is truncated.
Censored Regression
Censored variables are those whose value is known over some range but unknown beyond a certain
value because they only were recorded or collected (that is, censored) as being at or beyond that
value. For example, exact income might be recorded for those with income less than $100,000.
However, for those with income greater than or equal to $100,000, exact income might be recorded
as "greater than or equal to $100,000" rather than an exact amount. So, exact values for income are
missing above $100,000, but the potential range of income is known. Typically censoring is used to
elicit more reliable data or, as with income, to ensure confidentiality
In the example given income is "censored from above." Censoring can occur from above ("right
censored," sometimes referred to as a "ceiling effect"), below ("left censored," sometimes referred to
as a "floor effect"), or both. Censoring can occur with continuous, ordinal, or count dependent
variables
Censoring might not be apparent until after data are collected and an examination of the distribution
of the dependent variable suggests an upper or lower value at which observations are clustered. For
example, the frequency of enjoyable time spent by study participants with their children in the past
30 days might be measured using a six-point scale, ranging from never to almost every day, and
results might indicate a large cluster of observations at the upper end of the scale. If such clustering
is due to the way the data were collected or recorded, it is reasonable to treat the variable as
censored. For example, the use of "almost every day" as the upper-scale anchor is analogous to using
an upper-income category of greater than or equal to $100,000 because, conceivably, an upper
category such as "every day" might have been used.
Definition: Consider the sample of size n , where Y* is recorded only for those values of Y* greater
than a constant C . For those values of Y*<= C we record the value of C. That is the observations are
Yi * = i=1, 2, 3……. n
The resulting sample y1, y2,…..,yn is said to be a censored sample .
The censored regression model is defined as
Yi * =
Where β is a kx1 vector if unknown parameters and Xi is a kx1 vector of known constants and Ui are
the disturbance that are independently and normally distributed with mean zero and variance 2.
This model was first studied by Tobin (1958) and is also called Tobit model
{
𝑌𝑖
∗
if 𝑌𝑖
∗
> C
𝐶 if 𝑌𝑖
∗
≤ C
{
β’Xi + Ui if 𝑅𝐻𝑆 > 𝑜
𝑜 otherwise

More Related Content

What's hot

Functional Forms of Regression Models | Eonomics
Functional Forms of Regression Models | EonomicsFunctional Forms of Regression Models | Eonomics
Functional Forms of Regression Models | EonomicsTransweb Global Inc
 
Dummy variable
Dummy variableDummy variable
Dummy variableAkram Ali
 
Multicollinearity1
Multicollinearity1Multicollinearity1
Multicollinearity1Muhammad Ali
 
Auto Correlation Presentation
Auto Correlation PresentationAuto Correlation Presentation
Auto Correlation PresentationIrfan Hussain
 
Econometric model ing
Econometric model ingEconometric model ing
Econometric model ingMatt Grant
 
Functional forms in regression
Functional forms in regressionFunctional forms in regression
Functional forms in regressionB SWAMINATHAN
 
Autocorrelation- Detection- part 2- Breusch-Godfrey Test and Durbin's h test
Autocorrelation- Detection- part 2- Breusch-Godfrey Test and Durbin's h testAutocorrelation- Detection- part 2- Breusch-Godfrey Test and Durbin's h test
Autocorrelation- Detection- part 2- Breusch-Godfrey Test and Durbin's h testShilpa Chaudhary
 
Rules for identification
Rules for identificationRules for identification
Rules for identificationGarimaGupta229
 
Autocorrelation- Concept, Causes and Consequences
Autocorrelation- Concept, Causes and ConsequencesAutocorrelation- Concept, Causes and Consequences
Autocorrelation- Concept, Causes and ConsequencesShilpa Chaudhary
 
Heteroscedasticity
HeteroscedasticityHeteroscedasticity
HeteroscedasticityMuhammad Ali
 
Econometrics lecture 1st
Econometrics lecture 1stEconometrics lecture 1st
Econometrics lecture 1stIshaq Ahmad
 
Forecasting with Vector Autoregression
Forecasting with Vector AutoregressionForecasting with Vector Autoregression
Forecasting with Vector AutoregressionBryan Butler, MBA, MS
 
Analysis of Time Series
Analysis of Time SeriesAnalysis of Time Series
Analysis of Time SeriesManu Antony
 

What's hot (20)

Dummy variables
Dummy variablesDummy variables
Dummy variables
 
The probit model
The probit modelThe probit model
The probit model
 
Autocorrelation
AutocorrelationAutocorrelation
Autocorrelation
 
Functional Forms of Regression Models | Eonomics
Functional Forms of Regression Models | EonomicsFunctional Forms of Regression Models | Eonomics
Functional Forms of Regression Models | Eonomics
 
Probit and logit model
Probit and logit modelProbit and logit model
Probit and logit model
 
Dummy variable
Dummy variableDummy variable
Dummy variable
 
Multicollinearity1
Multicollinearity1Multicollinearity1
Multicollinearity1
 
Auto Correlation Presentation
Auto Correlation PresentationAuto Correlation Presentation
Auto Correlation Presentation
 
Econometric model ing
Econometric model ingEconometric model ing
Econometric model ing
 
Functional forms in regression
Functional forms in regressionFunctional forms in regression
Functional forms in regression
 
Logistic Regression Analysis
Logistic Regression AnalysisLogistic Regression Analysis
Logistic Regression Analysis
 
Autocorrelation- Detection- part 2- Breusch-Godfrey Test and Durbin's h test
Autocorrelation- Detection- part 2- Breusch-Godfrey Test and Durbin's h testAutocorrelation- Detection- part 2- Breusch-Godfrey Test and Durbin's h test
Autocorrelation- Detection- part 2- Breusch-Godfrey Test and Durbin's h test
 
Rules for identification
Rules for identificationRules for identification
Rules for identification
 
Autocorrelation- Concept, Causes and Consequences
Autocorrelation- Concept, Causes and ConsequencesAutocorrelation- Concept, Causes and Consequences
Autocorrelation- Concept, Causes and Consequences
 
Introduction to Econometrics
Introduction to EconometricsIntroduction to Econometrics
Introduction to Econometrics
 
Heteroscedasticity
HeteroscedasticityHeteroscedasticity
Heteroscedasticity
 
Econometrics ch3
Econometrics ch3Econometrics ch3
Econometrics ch3
 
Econometrics lecture 1st
Econometrics lecture 1stEconometrics lecture 1st
Econometrics lecture 1st
 
Forecasting with Vector Autoregression
Forecasting with Vector AutoregressionForecasting with Vector Autoregression
Forecasting with Vector Autoregression
 
Analysis of Time Series
Analysis of Time SeriesAnalysis of Time Series
Analysis of Time Series
 

Viewers also liked

Daily Reports "PPL IAIN Salatiga" 2017
Daily Reports "PPL IAIN Salatiga" 2017Daily Reports "PPL IAIN Salatiga" 2017
Daily Reports "PPL IAIN Salatiga" 2017ND Arisanti
 
3 java - variable type
3  java - variable type3  java - variable type
3 java - variable typevinay arora
 
Types of variables in statistics
Types of variables in statisticsTypes of variables in statistics
Types of variables in statisticsZakaria Hossain
 
Importance of null and alternate hypothesis in a scientific research study
Importance of null and alternate hypothesis in a scientific research study Importance of null and alternate hypothesis in a scientific research study
Importance of null and alternate hypothesis in a scientific research study Charm Rammandala
 
Variables And Measurement Scales
Variables And Measurement ScalesVariables And Measurement Scales
Variables And Measurement Scalesguesta861fa
 
Research hypothesis....ppt
Research hypothesis....pptResearch hypothesis....ppt
Research hypothesis....pptRahul Dhaker
 
research-methodology-ppt
 research-methodology-ppt research-methodology-ppt
research-methodology-pptsheetal321
 
Variables
 Variables Variables
Variablesshoffma5
 
Types of Variables
Types of VariablesTypes of Variables
Types of VariablesAli Mustafa
 
Research Methodology
Research MethodologyResearch Methodology
Research Methodologysh_neha252
 

Viewers also liked (16)

Variable
VariableVariable
Variable
 
Daily Reports "PPL IAIN Salatiga" 2017
Daily Reports "PPL IAIN Salatiga" 2017Daily Reports "PPL IAIN Salatiga" 2017
Daily Reports "PPL IAIN Salatiga" 2017
 
3 java - variable type
3  java - variable type3  java - variable type
3 java - variable type
 
6 java - loop
6  java - loop6  java - loop
6 java - loop
 
Types of variables in statistics
Types of variables in statisticsTypes of variables in statistics
Types of variables in statistics
 
Importance of null and alternate hypothesis in a scientific research study
Importance of null and alternate hypothesis in a scientific research study Importance of null and alternate hypothesis in a scientific research study
Importance of null and alternate hypothesis in a scientific research study
 
Kinds Of Variable
Kinds Of VariableKinds Of Variable
Kinds Of Variable
 
Variables And Measurement Scales
Variables And Measurement ScalesVariables And Measurement Scales
Variables And Measurement Scales
 
Research hypothesis....ppt
Research hypothesis....pptResearch hypothesis....ppt
Research hypothesis....ppt
 
Basic variables ppt
Basic variables pptBasic variables ppt
Basic variables ppt
 
Research methodology
Research methodologyResearch methodology
Research methodology
 
research-methodology-ppt
 research-methodology-ppt research-methodology-ppt
research-methodology-ppt
 
Variables
 Variables Variables
Variables
 
Research methodology notes
Research methodology notesResearch methodology notes
Research methodology notes
 
Types of Variables
Types of VariablesTypes of Variables
Types of Variables
 
Research Methodology
Research MethodologyResearch Methodology
Research Methodology
 

Similar to Introduction to Limited Dependent variable

7. logistics regression using spss
7. logistics regression using spss7. logistics regression using spss
7. logistics regression using spssDr Nisha Arora
 
Logistic regression
Logistic regressionLogistic regression
Logistic regressionDrZahid Khan
 
ders 8 Quantile-Regression.ppt
ders 8 Quantile-Regression.pptders 8 Quantile-Regression.ppt
ders 8 Quantile-Regression.pptErgin Akalpler
 
Logistic regression
Logistic regressionLogistic regression
Logistic regressionRupak Roy
 
Logistic regression and analysis using statistical information
Logistic regression and analysis using statistical informationLogistic regression and analysis using statistical information
Logistic regression and analysis using statistical informationAsadJaved304231
 
Quantitative Methods for Lawyers - Class #22 - Regression Analysis - Part 5
Quantitative Methods for Lawyers - Class #22 - Regression Analysis - Part 5Quantitative Methods for Lawyers - Class #22 - Regression Analysis - Part 5
Quantitative Methods for Lawyers - Class #22 - Regression Analysis - Part 5Daniel Katz
 
Applications of regression analysis - Measurement of validity of relationship
Applications of regression analysis - Measurement of validity of relationshipApplications of regression analysis - Measurement of validity of relationship
Applications of regression analysis - Measurement of validity of relationshipRithish Kumar
 
Chapter6
Chapter6Chapter6
Chapter6Vu Vo
 
Logistic regression
Logistic regressionLogistic regression
Logistic regressionDrZahid Khan
 
Multinomial logisticregression basicrelationships
Multinomial logisticregression basicrelationshipsMultinomial logisticregression basicrelationships
Multinomial logisticregression basicrelationshipsAnirudha si
 
A presentation for Multiple linear regression.ppt
A presentation for Multiple linear regression.pptA presentation for Multiple linear regression.ppt
A presentation for Multiple linear regression.pptvigia41
 
Regression analysis ppt
Regression analysis pptRegression analysis ppt
Regression analysis pptElkana Rorio
 

Similar to Introduction to Limited Dependent variable (20)

Bus 173_6.pptx
Bus 173_6.pptxBus 173_6.pptx
Bus 173_6.pptx
 
Log reg pdf.pdf
Log reg pdf.pdfLog reg pdf.pdf
Log reg pdf.pdf
 
Logistic regression sage
Logistic regression sageLogistic regression sage
Logistic regression sage
 
Ders 2 ols .ppt
Ders 2 ols .pptDers 2 ols .ppt
Ders 2 ols .ppt
 
7. logistics regression using spss
7. logistics regression using spss7. logistics regression using spss
7. logistics regression using spss
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
 
ders 8 Quantile-Regression.ppt
ders 8 Quantile-Regression.pptders 8 Quantile-Regression.ppt
ders 8 Quantile-Regression.ppt
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
 
Logistic regression and analysis using statistical information
Logistic regression and analysis using statistical informationLogistic regression and analysis using statistical information
Logistic regression and analysis using statistical information
 
Linear regression theory
Linear regression theoryLinear regression theory
Linear regression theory
 
MModule 1 ppt.pptx
MModule 1 ppt.pptxMModule 1 ppt.pptx
MModule 1 ppt.pptx
 
Quantitative Methods for Lawyers - Class #22 - Regression Analysis - Part 5
Quantitative Methods for Lawyers - Class #22 - Regression Analysis - Part 5Quantitative Methods for Lawyers - Class #22 - Regression Analysis - Part 5
Quantitative Methods for Lawyers - Class #22 - Regression Analysis - Part 5
 
Applications of regression analysis - Measurement of validity of relationship
Applications of regression analysis - Measurement of validity of relationshipApplications of regression analysis - Measurement of validity of relationship
Applications of regression analysis - Measurement of validity of relationship
 
Lecture 4
Lecture 4Lecture 4
Lecture 4
 
Chapter6
Chapter6Chapter6
Chapter6
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
 
Machine Learning (Classification Models)
Machine Learning (Classification Models)Machine Learning (Classification Models)
Machine Learning (Classification Models)
 
Multinomial logisticregression basicrelationships
Multinomial logisticregression basicrelationshipsMultinomial logisticregression basicrelationships
Multinomial logisticregression basicrelationships
 
A presentation for Multiple linear regression.ppt
A presentation for Multiple linear regression.pptA presentation for Multiple linear regression.ppt
A presentation for Multiple linear regression.ppt
 
Regression analysis ppt
Regression analysis pptRegression analysis ppt
Regression analysis ppt
 

Recently uploaded

Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...ThinkInnovation
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxFurkanTasci3
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 

Recently uploaded (20)

Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptx
 
Decoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in ActionDecoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in Action
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 

Introduction to Limited Dependent variable

  • 1. Limited Dependent Variable -------------------------------------------------------------------------------------------------------------------- Introduction: So far, we know how to handle linear estimation models of the type: Y = β0 + β 1*X 1 + β 2*X 2 + … + ε ≡ Xβ + ε Where in sometimes we had to transform or add variables to get the equation to be linear: o Taking logs of Y and/or the X’s o Adding squared terms o Adding interactions Then we can run our estimation, do model checking, visualize results, etc. In all these models Y, the dependent variable, was continuous. Independent variables could be dichotomous (dummy variables), but not the dependent variable. However there are models wherein the dependent variable outcome is binary (takes only two responses) or has limited responses i.e. the value of response variable is limited within a range. There are lots of times when the dependent variable of interest takes on one of two values. For example, suppose you are studying car purchase decisions. You survey consumers to find out if they have purchased a car in the past year. Your observations can be coded 1 (if the 2 consumer bought a car) or 0 (if the consumer did not). When your dependent variable can take on one of two values, then you have a dichotomous dependent variable and the resulting models are sometimes called 0/1 models. Such models fall under limited dependent variables. We consider models of limited dependent variables in which the economic agent’s response is limited in some way. The dependent variable, rather than being continuous on the real line (or half–line), is restricted. In some cases, we are dealing with discrete choice: the response variable may be restricted to a Boolean or binary choice, indicating that a particular course of action was or was not selected. In others, it may take on only integer values, such as the number of children per family, or the ordered values on a Likert scale. Alternatively, it may appear to be a continuous variable with a number of responses at a threshold value. For instance, the response to the question “how many hours did you work last week?” will be recorded as zero for the nonworking respondents. None of
  • 2. these measures are amenable to being modeled by the linear regression methods that we know. Hence to handle such cases we use Limited Dependent variable models. Definition: A Limited Dependent Variable, Y, is defined as a dependent variable whose range is substantively restricted. Yi *= β’Xi + Ui, where Yi* is a latent variable Ui ~N (0, 2) Yi * = The common cases are: o binary: Y ϵ {0, 1} o multinomial: Y ϵ {0, 1, 2, ..., k} o integer: Y ϵ {0, 1, 2, ...} o censored: Y ϵ {𝑌∗ : 𝑌∗ ≥0 } Categorical and limited dependent variables include dependent variables that are binary, multicategorical, ordinal, counted, censored, or from truncated populations (Long, 1997). Binary variables have two categories used to indicate that an event has occurred or that some characteristic is present (for example, placement in foster care coded as "yes" or "no"). Multicatcgorical variables have three or more unordered categories (for example, type of foster care placement coded as "kin care," "non-kin family care," "group home care," or "institutional care"). Ordinal variables have { 𝑌𝑖 ∗ if 𝑌𝑖 ∗ > 0 0 if 𝑌𝑖 ∗ ≤ 0
  • 3. ranked categories with unknown distances between adjacent categories (for example, restrictiveness of foster care placement rated on a five-point ordered scale). Count variables indicate the number of characteristics or events during a given period (for example, number of foster care placements while in state custody). Based on the above cases, limited dependent variables can be classified into two categories as below Methods used to model LDV: There are many models of LDVs based on what the limitations are. Some of them are : o 0-1 dependent variables (dummies) by Probit and Logit o Ordered dependent variables by ordered Probit and Logit o Categorical dependent variables (with more than two categories) by multinomial logit o Truncated dependent variables by Heckman’s procedure o Censored dependent variables by Tobit o Count (integer) dependent variables by Poisson regression o Hazard (length) dependent variables by hazard models Limited Dependent Variable Discrete dependent variable. Continuous dependent variable Discrete Choice Models (Logit /Probit models) Truncated/Censored Regression Models
  • 4. Why LDV Models Are Different To correctly analyze and interpret any LDV model, it is important to understand two fundamental differences between LDV and OLS type models. First, LDV models are intrinsically nonlinear, which means the relationship to be estimated cannot be written as a summation of terms, where each term is a model coefficient times a model variable. The intrinsic nonlinearity of LDV models has two major methodological ramifications. First, an explanatory variable’s marginal effect –– the effect of a unit change in an explanatory variable on the dependent variable –– does not equal the variable’s model coefficient. Second, the value of this marginal effect varies over the value of all model variables. These facts imply that one cannot infer the nature of the true relationship between an explanatory variable and the dependent variable based solely on the estimated coefficient in a LDV model. The second fundamental difference is that most LDV models are estimated using the method of maximum likelihood which, unlike the method of least squares, is not based on minimizing error variance. This means there is no measure of model “fit” directly comparable to the R-square in OLS and, as a result, model assessment is largely restricted to testing the joint significance of all model variables as is done in OLS using an F-test of overall model significance. Model Estimation: Basic Problems with OLS estimation for LDV is o Heteroscedastic in error terms o Predictions not constrained to match actual outcomes, real problem with actual outcomes, real problem with predicted values being negative when a negative number isn’t possible Although it is possible to estimate LDV models with OLS the model is likely to produce point predictions outside the unit interval [0,1]. We could arbitrarily constrain them to either 0 or 1, but this linear probability model has other problems: the error term cannot satisfy the assumption of homoscedasticity. For a given set of X values, there are only two possible values for the disturbance: −Xβ and (1 − Xβ): the disturbance follows a Binomial distribution. Given the properties of the Binomial distribution, the variance of the disturbance process, conditioned on X, is Var (u|X) = Xβ (1 − Xβ).
  • 5. There is no constraint to ensure that this quantity will be positive for arbitrary X values. Therefore, it will rarely be productive to utilize regression with a binary response variable; we must follow a different strategy. Because of the limited ranges of the dependent variable, the standard additive normal error is not tenable for these models. Instead we must model the probability of various discrete outcomes. LDV models are usually estimated by maximum likelihood, given the assumed distribution of the conditional probabilities of various outcomes. Continuous limited dependent variable: We turn now to a context where the response variable is not binary nor necessarily integer, but continuous and subject to truncation. We must fully understand the context in which the data were generated. Nevertheless, it is quite important that we identify situations of truncated or censored response variables. Utilizing these variables as the dependent variable in a regression equation without consideration of these qualities will be misleading. Truncated Regression In Truncated Regression model the dependent variable is truncated at a certain point. Observations with the values of Y above a threshold are not included in the sample. The sample is drawn from a subset of the population so that only certain values are included in the sample. A subset of observations are dropped, thus, only the truncated data are available for the regression. Definition: Case where ),( ii xy is observed only when ayi  (left truncation) or when byi  (right truncation) or when dyc i  (double truncation). Example: Let iy = the profit of the i-th firm as a percentage of assets and ix = the four firm concentration ratio of industry the firm is in. Suppose only firms with positive profit rates are observed and firms with negative profit rates are not observed. In this case a= 0 and we have a problem where the dependent variable is left truncated. iii xy   10 Where 0iy for Ni ,,1  and ),0( 2  Ni  Neglecting the truncation can lead to biased estimates of 0 and 1 . Truncated regression models are used for data where whole observations are missing so that the values for the dependent and the independent variables are unknown
  • 6. EXAMPLES: 1. A study of the determinants of incomes of the poor. Only households with income below a certain poverty line are part of the sample. 2. Suppose we have a sample of AIEEE rejects-those who scored below the 30th percentile. We wish to estimate an IQ equation: AIEEE=f (education, age, socio economic characteristics etc. ). We will need to take into account the fact that dependent variable is truncated. Censored Regression Censored variables are those whose value is known over some range but unknown beyond a certain value because they only were recorded or collected (that is, censored) as being at or beyond that value. For example, exact income might be recorded for those with income less than $100,000. However, for those with income greater than or equal to $100,000, exact income might be recorded as "greater than or equal to $100,000" rather than an exact amount. So, exact values for income are missing above $100,000, but the potential range of income is known. Typically censoring is used to elicit more reliable data or, as with income, to ensure confidentiality In the example given income is "censored from above." Censoring can occur from above ("right censored," sometimes referred to as a "ceiling effect"), below ("left censored," sometimes referred to as a "floor effect"), or both. Censoring can occur with continuous, ordinal, or count dependent variables
  • 7. Censoring might not be apparent until after data are collected and an examination of the distribution of the dependent variable suggests an upper or lower value at which observations are clustered. For example, the frequency of enjoyable time spent by study participants with their children in the past 30 days might be measured using a six-point scale, ranging from never to almost every day, and results might indicate a large cluster of observations at the upper end of the scale. If such clustering is due to the way the data were collected or recorded, it is reasonable to treat the variable as censored. For example, the use of "almost every day" as the upper-scale anchor is analogous to using an upper-income category of greater than or equal to $100,000 because, conceivably, an upper category such as "every day" might have been used. Definition: Consider the sample of size n , where Y* is recorded only for those values of Y* greater than a constant C . For those values of Y*<= C we record the value of C. That is the observations are Yi * = i=1, 2, 3……. n The resulting sample y1, y2,…..,yn is said to be a censored sample . The censored regression model is defined as Yi * = Where β is a kx1 vector if unknown parameters and Xi is a kx1 vector of known constants and Ui are the disturbance that are independently and normally distributed with mean zero and variance 2. This model was first studied by Tobin (1958) and is also called Tobit model { 𝑌𝑖 ∗ if 𝑌𝑖 ∗ > C 𝐶 if 𝑌𝑖 ∗ ≤ C { β’Xi + Ui if 𝑅𝐻𝑆 > 𝑜 𝑜 otherwise