The use of one of several methods for reducing a set of variables to a lesser number of new variables, each of which is a function of one or more of the original variables.
2. Basics
• Used for Date Reduction & Summarization
• Reduction to manageable level
• Interval or ratio scale
• Interdependence technique: Make no
difference between independent &
dependent variable.
• Multivariate analysis
Factor Analysis Vishal Sarin
3. Assumptions
• Normality
• Linearity
• Factorability of correlation matrix – there
should be sufficient correlations in the data
matrix of variables to justify the application of
factor analysis. Number of correlations at 0.01
level of significance can range from 30% to
50% according to the requirement.
Factor Analysis Vishal Sarin
4. When to apply Factor Analysis?
• To identify the underlying dimensions or
factors.
• To identify a new, smaller set of uncorrelated
variables to replace the original set of
correlated variables in subsequent
multivariate analysis.
• To identify a smaller set of salient variables
from a larger set for use in subsequent
multivariate analysis.
Factor Analysis Vishal Sarin
5. Where we can use?
• Market segmentation
• To determine the brand attributes
• Identify the characteristics of price-sensitive
consumers.
• Understand the media consumption habits of
the target market.
Factor Analysis Vishal Sarin
6. Premises
• Similar to multiple regression analysis i.e. each
variable is expressed in a linear combination.
• Factor model:
Xi = Ai1F1 + Ai2F2 + Ai3F3+ ……..+ AimFm+ViUi
Xi = ith standardized variable
Aij= standardized multiple regression coefficient of variable I on common factor j
F = Common factor
Vi= standardized regression coefficient of variable i on unique factor i
Ui = the unique factor for variable i
m = number of common factors
• The unique factors are uncorrelated with each
other and with the common factors.
Factor Analysis Vishal Sarin
7. Common factors
Fi = Wi1X1 + Wi2X2+ Wi3X3+ ……..+ WikXk
Fi = estimate of ith factor
Wi= weight or factor score coefficient
k = number of variable
• It is possible to select weights or factor score
coefficient so that the first factor explains the
largest portion of the total variance.
Factor Analysis Vishal Sarin
8. Process Formulate the problem
Construct the correlation matrix
Determine the method of factor analysis
Determine the number of factors
Rotate the factors
Interpret the factors
Calculate Select the
the factor score surrogate
variable
Determine the model Fit
Factor Analysis Vishal Sarin
9. Formulate the Problem
• Includes several tasks.
• Objective of Factor analysis should be identified.
• Variables to be specified properly.
• Variable: Interval or ratio scale
• Sample Size: 4 to 5 times as there are variables.
• In many marketing research, the sample size is
small and the ratio is considerably lower. In these
cases, the results should be interpreted
cautiously.
Factor Analysis Vishal Sarin
10. Problem
• Want to determine the underlying benefits
consumers seek from the purchase of a
toothpaste.
• Sample: 30 respondents
• 7-point scale used (1 – strongly disagree, 7 –
strongly disagree)
Factor Analysis Vishal Sarin
11. Questions?
V1 It is important to buy a toothpaste that prevents cavities.
V2 I like a toothpaste that gives shiny teeth.
V3 A toothpaste should strengthen your gums.
V4 I prefer a toothpaste that freshens breath.
V5 Prevention of tooth decay is not an important benefit offered
by a toothpaste.
V6 The most important consideration in buying a toothpaste is
attractive teeth.
Factor Analysis Vishal Sarin
13. Construct of correlation matrix
• The analytical process is based on a matrix of
correlations between the variables.
• For factor analysis, the variables must be
correlated.
• If correlations between all the variables are
small, factor analysis may not be appropriate.
Factor Analysis Vishal Sarin
14. Table shows there are relatively high correlations among
V1 (prevention of cavities), V3(strong gums), and V5
(prevention of tooth decay).
Likewise relatively high correlation among V2 (shiny
teeth), V4 (fresh breath), and V6 (attractive teeth).
Expectation is that these variables will correlate with
some set of factors.
Factor Analysis Vishal Sarin
15. Testing Appropriateness & Adequacy
BARTLETT’S TEST OF SPHERICITY
H0 = the variables are uncorrelated i.e. population
correlation matrix is an identity.
• If this hypothesis not rejected, appropriateness of factor
analysis should be questioned.
KAISER-MEYER-OLKIN (KMO) for SAMPLING ADEQUACY
• This index compares the magnitudes of the observed
correlation coefficients to the magnitude of the partial
correlation coefficient.
• Small value of KMO suggest not to apply factor analysis.
• KMO value greater than 0.5 is desirable.
Factor Analysis Vishal Sarin
16. • The null hypothesis of Bartlett’s test is
rejected. The approx. chi-square is 106.309
with 15 degree of freedom, which is
significant at the 0.05 level.
• The value of KMO is 0.659 (>0.5).
• Thus, factor analysis may be considered an
appropriate technique for analyzing the
correlation matrix.
Factor Analysis Vishal Sarin
17. Determine the method
• The approach used to derive the weights or factor score
coefficients differentiates the various method of factor
analysis.
• Two Approach:
1. Principal Component Analysis: Total variance in the data is
considered. It is recommended when the primary concern
is to determine the minimum number of factors that will
account for maximum variance in the data for use in
subsequent multivariate analysis.
2. Common Factor Analysis: Factors are estimated based only
on the common variance. This method is appropriate when
the primary concern is to identify the underlying
dimensions and the common variance is of interest.
3. Other Approach: methods of unweighted least squares,
generalized least squares, maximum likelihood, image
factoring, and alpha method.
Factor Analysis Vishal Sarin
18. • Communality is the amount of variance a
variable share with all the other variables
being considered. This is also the proportion
of variance explained by the common factors.
Factor Analysis Vishal Sarin
19. • Column shows the eigenvalues. The eigen values for a
factor indicates the total variance attributed to the
factor.
• The total variance accounted for by all six factor is
6.00.
• Factor 1 accounts for a variance of 2.719, which is
(2.719/6)= 45.32 % of total variance, and so on.
Several consideration are involved in
determining the number of factors.
Factor Analysis Vishal Sarin
20. Determine the number of factors
• It is possible to compute as many principal
components as there are variables, but in doing
so, no parsimony is gained.
• Small is good, but how many?
1. A priori determination.
2. Based on Eigenvalues (<1 ignore)
3. Based on Scree plot (see the steepness)
4. Based on %age variance (at least 60% variance)
5. Based on split half reliability (high correspondence included)
Factor Analysis Vishal Sarin
22. Rotate Factors
• An important output from
factor analysis is the factor
matrix.
• A factor matrix contains the
coefficents used to express
the standardized variables in
terms of the factors.
• These coefficients, factor loading, represent
the correlation between the factors and
variables.
• Also called Unrotated factor matrix.
Factor Analysis Vishal Sarin
23. • Although the unrotated factor matrix
indicates the relationship between the factors
and individual variables, it seldom results in
factors that can be interpreted, because
factors are correlated with many variables.
Factor Analysis Vishal Sarin
24. • In rotating the factors, we expect each factor
to have non-zero or significant loading of
coefficient for only some of the variables.
• Rotation does not affect the communalities
and the percentage of total variance
explained.
• Percentage of variance does change.
• Commonly used method is VARIMAX
procedure.
Factor Analysis Vishal Sarin
25. Interpret Factors
Factor 1 has high coefficients
for V1 (prevention of
cavities) and V3 (strong
gums), and a negative
coefficient for V5 (prevention
of tooth decay is
important)*.
Thus, Factor 1 leads to Health
Benefits
Factor 2 is highly related with variable V2 (shiny
teeth), V4 (fresh breath), and V6 (attractive
teeth). Factor 2 leads to Social Benefits.
Factor Analysis Vishal Sarin
27. • Factor Loading Plot confirms this
interpretation. Variable V1, V3, and V5 are at
the end of horizontal axis (factor 1), with V5 at
the end opposite to V1 and V3.
• Variables V2, V4, and V6 are at the end of the
vertical axis (factor 2).
• Thus, consumer appears to seek two major
Benefits from a toothpaste.
Health Benefits & Social Benefits
Factor Analysis Vishal Sarin
28. Calculate Factor Scores
• Factor analysis has its own stand-alone value.
• If goal of factor analysis is to reduce to a
smaller set of variables for multivariate
analysis.
• The factor score can be used instead of the
original variables in subsequent multivariate
analysis.
Factor Analysis Vishal Sarin
29. Select Surrogate Variables
• Used when variable are important over factors.
• By examining the factor matrix, one could select
for each factor the variable with the highest
loading on that factor.
• That variable could then be used as a surrogate
variable.
• This process works well if one factor loading for a
variable is clearly higher than all other factor
loading.
Factor Analysis Vishal Sarin
30. Determine the model Fit
• If there are many large residuals, the factor
model does not provide a good fit.
• We can see only 5 residuals are larger than
0.05, indicating an acceptable model fit.
Factor Analysis Vishal Sarin
31. Perhaps the most
valuable result of all
education is the ability
to make yourself do the
thing you have to do,
when it ought to be
done, whether you like
it or not.
Factor Analysis Vishal Sarin
Thank you.