Path analysis with manifest variablesPresentation Transcript
PATH ANALYSIS WITH MANIFEST VARIABLES Carlos Gabriel Contreras Msc Department of statistics University of California Los Angeles.
Introduction: the basic of path analysis Path analysis can be used to test theoretical models that specify causal relationship between a number of observed variables. Path Analysis determines whether the theoretical model successfully accounts for the actual relationship observed in the sample data. In SAS, the ouput of the CALIS procedure provides indices that indicate whether the model, as a whole, fits the data, as quell as significance tests for specific causal paths. When a model provides a relatively poor fit to the data, additional retults from PROC CALIS can be used to modify the model and improve its fit.
Introduction: the basic of path analysis This presentation deals only with causal models in which all variables are manifest (observed) variables. It does not deal with path models that specify causal relationship between LATENT (unobserved) variables (Such models are often called LISREL type.
Some simple path diagrams When studying complex phenomena, if often becomes clear that a given outcome variable of interest is actually influenced by a variety of other variables, few outcome variables of any importance are causally determined by just one variable. a theorist in industrial psychology who believes that an employee’s work performance (the outcome variable of interest) is influenced by the following four variables The employee’s level of intelligence The employee’s level of work motivation. Work place norms Supervisory support.
Intelligence Motivation Work performance Work place norms Supervisory support
Continued. Intelligence, motivation, work place norms, and supervisory support are all antecedent variables within this framework, as they are clearly predicted to precedent and have a causal effect on work performance. Similarly, work performance is the consequent variable in the model, as it is said to be affected by these antecedent variables. In the path analysis literature, many refer to antecedent variables as independent variables, and consequent variables as dependent variables (although these terms obviously do not have the same meaning in path analysis research, which is correlational in nature, as they have in experimental research)
Continued. The boxes in the path are connected to one another by mean of straight, single-headed arrows and curved, double-headed arrows. In the path analysis literature, straight, single-headed arrow is generally used to represent a unidirectional causal path in a path diagram. The arrow originates at the variable exerting the causal influence (the independent variable), and the arrow points toward the variable being affected (the dependent variable). In contrast, a curved double-headed arrow connecting two variables represent a simple covariance or correlation, between the variables. A curved arrow connecting two variables means that the two variables are expected to co vary, but that no hypothesis is made regarding any causal influence between them.
Intelligence Motivation Work place norms Work performance Supervisory support
Continued. The anterior diagram includes the same four variables discussed earlier, but they are arranged in a somewhat different causal sequence. Most notably, worker motivation is now viewed as a mediator variable Mediator variable is a variable that mediates, or coveys, the effect of an antecedent variable onto a consequent variable.
ENDOFENOUS VERSUS EXOGENOUS VARIABLES. In path analysis, a distinction is made between endogenous variables and exogenous variables. An endogenous variables is one whose variability is predicted to be causally affected by other variables in the model. Any variables that has a straight, single-headed arrow pointing at it is an endogenous variable. Work performance in previous examples are clearly a endogenous variable. On the other hand, are constructs that are influenced only by variables that lie outside of the causal model. Exogenous variables do not have any straight, single-headed arrow pointing at them. Intelligence, norms, and support are all exogenous variables.
MANIFEST VERSUS LATENT VARIABLES. A manifest variables is one that is directly measured or observed in the course of an investigation, while a latent variable is a hypothetical construct that is no directly measured or observed. For example, scores on the Weschler Adult Intelligence Scale WAIS, is a manifest variable; it is possible to directly determinate exactly where each subject stands on this variable. On the other hand, intelligence may ne thought of as a latent variable; it is a construct that is presumed to exist, although is cannot be directly observed. In the diagram, manifest variables are represented by rectangles
RECURSIVE VERSUS NONRECURSIVE MODELS. A recursive model is on in which causation flows in only one direction. In a recursive model. A consequent variable never exerts causal influence (either directly or indirectly) on an antecedent variable that first exerts causal influence on it. In other words, recursive models are unidirectional. In contrast, in a no recursive path model, causation may flow in more that one direction, an a variable may have direct or indirect effect on another variable that preceded it in the causal chain.
Necessary conditions in path models: Interval or ratio level measurement. All endogenous variables should ne assessed on an interval or ratio level of measurement. Exogenous variables should also be on an interval or ratio, although exogenous variables may be assessed at a nominal level if they are dummy coded. Alternative procedures for situations in which these assumptions are violated have been discussed elsewhere.
Necessary conditions in path models: minimal number of values. Endogenous variables should be continuous and should assume a minimum of four values.
Necessary conditions in path models: Normally distributed data Although parameter estimated may be correct with non normal data, the statistical test used with PROC CALIS in SAS (Such as the model Chi square test and significance test for path coefficients) assume a multivariate normal distribution. Is has been argued, however, that the maximum likelihood and generalized least squares estimation procedures appear to be fairly robust against moderate violation of this assumption.
Necessary conditions in path models: Linear and additive relationships. Relationships between variables should be linear and additive (that is, relationships between independent and dependent variables should not be curvilinear or interactive)
Necessary conditions in path models: Absence of multicollinearity. Variables should be free of multicollinearity. Multicollinearity is a condition in which one or more variables exhibit very strong correlation.
Necessary conditions in path models: Absence of measurement error. All independent variables should be measured without error. This means that any independent manifest variable that is analyzed should be a perfectly reliable indicator of the underlying construct that it is with variables typically studied in the social sciences, this may be the most frequently violated assumption in the use of path analysis.
Necessary conditions in path models: Inclusion of all nontrivial causes. All know nontrivial causes of a model’s endogenous variables should be included in the model as independent variables.
Necessary conditions in path models: Minimal number of observation. Path analysis is a large sample procedure; it is best that the analysis is based on at least 200 subject (although results based on fewer subject have certainly been reported in the literature) In addition, there should be ratio of at least 5 subjects for each parameter to be estimated. The total number of parameters is the sum of the: path coefficients, variances, covariances.