SlideShare a Scribd company logo
P a g e 1 | 8
DESCRIPTION OF THE TOPIC
Items Description of the Topic
Course Data Analysis for Social Science Teachers
Topic Introduction to Data Analysis
Module Id 1.1
Introduction
In the recent past, quite a bit of importance has been given to data analysis in
research. One of the possible reasons is that empirical evidence establishes a firm grounding
to either accept or reject the proposed hypotheses. The choice of the statistical technique
depends on the nature of the research problem or question and also on the nature of the data
set.
The research questions to solve a research gap or problem may be related to
identifying the degree of relationships among variables, checking for the significance of
group differences, predicting of group memberships or structure, or it could be time-related.
In order to identify associations between two or more variables, depending on
whether their nature of being parametric or non-parametric, correlation, and regression or chi-
square techniques may be adopted. This can be done as a Bi-variate correlation and
regression, multiple correlation and regression, Canonical correlation, Multiple Discriminant
Analysis, and Log-it regression. The bi-variate correlation is a good starting point to identify
the degree of relationship between two continuous variables, such as job and family
satisfaction where either of them can be treated as a DV and IV as the research question may
be. But bi-variate regression would require one of them to be defined as the DV and the other
as the IV. Although these are not multivariate techniques, they form the basis of the
Multivariate Analysis (MVA).
1. Importance of Multivariate Analysis (MVA)
If watching a movie needs to be a pleasant experience, the lighting, the projected film
light and sound effects in the theatre must be optimum. The other factors that may contribute
to a pleasant viewing experience may include, but not limited to, seating arrangements, air-
conditioning and hall odour. If one has to study or measure the pleasantness of watching a
movie in a theatre, all of the above factors must be studied together and not in isolation.
There is a possibility of an unpleasant parking experience that may negatively impact the
pleasantness of watching a movie. So, the real value of measuring the pleasantness of a
movie-watching experience lies in measuring all the influencing factors together.
This is exactly what Multivariate analysis is all about. So, analysis of multiple
variables simultaneously would result in a better picture to arrive at inferences instead of
multiple uni-variate analyses done with the individual variables. Statistical Techniques that
simultaneously analyse multiple measurements of the observed variables are known as
Multivariate Analysis (MVA). We may perform MVA by using multiple variables in a single
relationship or in multiple relationships.
P a g e 2 | 8
In a truly multivariate scenario, all variables must be:
i. Random in nature,
ii. Inter-related, and
iii. Interpreted in unison.
Reading the paper related to testing the Greenhaus and Allen model by Pattusamy and
Jacob (2015) will help in understanding our forthcoming discussions and answering a few
questions in the end. The theoretical model is shown in Figure 1.
Figure 1 - Theoretical Model
From Figure 1, it is seen that family-work conflict (FWC) will have a negative effect
on job satisfaction (JS) while family-work facilitation (FWF) will have a positive effect on
job satisfaction. Similarly, work-family facilitation (WFF) will have a positive effect on
family satisfaction (FS) while work-family conflict (WFC) will have a negative effect on
family satisfaction. Both job and family satisfaction will influence feelings of work-family
balance positively which in turn will positively influence life satisfaction (LS). All the above
statements have been hypothesized and can be stated conclusively if we have empirical data
to establish the stated hypotheses. The use of appropriate statistical methods will facilitate the
data analysis to arrive at well-grounded inferences and conclusions.
Univariate statistical tests involve one dependent variable. Examples include, but are
not limited to, t-tests of means, analysis of variance (ANOVA), analysis of covariance and
simple linear regression (with one dependent and one independent variable). Having said so
much about the importance of data analysis, let us have a quick look at a few multivariate
techniques that we are likely to study in detail during the course of this study.
The next section leads us to the classification of MVA.
P a g e 3 | 8
2. Classification of MVA
MVA can be classified as Dependence techniques and Interdependence techniques.
2.1 Dependence techniques (used when there are one or more dependent variables and
independent variables. Eg. Multiple regression analysis)
i. Multiple regression and multiple correlation
ii. Multiple Discriminant Analysis (MDA) and Logistic Regression
iii. Canonical Correlation Analysis
iv. Multivariate Analysis of Variance and Covariance
v. Conjoint Analysis
vi. Structural Equation Modelling (SEM) and Confirmatory Factor Analysis (CFA)
2.1.1 Multiple Regression
Let us presume that some previous research has established that cars with higher
engine capacity and higher unladen weight offer lesser fuel efficiency (possibly validated
using a correlation analysis). If a researcher wants to predict the fuel efficiency based on
engine capacity and unladen weight, then fuel efficiency is treated as the dependent variable
while engine capacity and unladen weight are treated as the independent variables. The
researcher collects data on fuel efficiency, engine capacity and an unladen weight of about
100 cars or more (that run on the same type of fuel) and would possibly use the multiple
regression (MR) method to predict fuel efficiency. In order to use the MR method the
dependent and the independent variables (two or more) must be metric data.
2.1.2 Multiple Discriminant Analysis (MDA) and Logit Analysis
If the dependent variable is dichotomous (Yes/No, Men / Women) type, then MDA is
an appropriate technique. The independent variables need to be metric data. MDA helps to
understand group differences and to predict the possibility that an observation or object
would belong to a specific group. An example that we had discussed in MR in the previous
section, suppose we had data on the engine capacity and unladen weight of about 100 plus
cars (that run on the same type of fuel) and if we want to classify them as Big and Small cars,
then MDA would be a relevant technique.
Logit Analysis also is known as Logistics regression is a combination of MR and
MDA. Although the regression principle is similar to that of MR, the DV in Logit regression
need not be metric as in the case of MR but can be a dichotomous variable as in MDA.
Another distinguishing fact of Logit regression is that it can accommodate both metric and
non-metric IVs and overlook the multivariate normality assumption.
P a g e 4 | 8
2.1.3 Canonical Correlation Analysis
If there are multiple metric dependent and metric independent variables to be
correlated and regressed, then the right tool is Canonical Correlation Analysis. We actually
try to determine the associations between two sets of variables. For example, we might study
the relationship between a number of indices of fuel efficiency (the DVs such as Indicated
Horse Power (IHP) and Brake Horse Power (BHP)) and the IVs (such as engine capacity,
unladen weight of the car, and age of the car).
2.1.4 Multivariate Analysis of Variance and Covariance
In-order to simultaneously explore the relationship between multiple categorical
independent variables, which are also called treatments, and two or more metric dependent
variables, an ideal technique would be the Multivariate Analysis of Variance and Covariance
(MANOVA). If the analysis requires the elimination of the effect of the uncontrolled metric
independent variables, which are known as covariates, on the dependent variables, then the
multivariate analysis of covariance (MANCOVA) is used. Both MANOVA and MANCOVA
may be done as one way or factorial. In our car example with fuel efficiency as the DV, age
of the car can be treated as a covariate.
2.1.5 Conjoint Analysis
Conjoint Analysis is a contemporary dependence technique that would help a decision-maker
(product design head) evaluate the importance of attributes (typically product attributes)
along with its levels. Let us say we have three attributes of a car, namely, airbags (2, 4, or 6
airbags), speakers for infotainment (2, 4 or 6 speakers) and steering wheel height adjustment
(low, medium and high). If we want to know popular combinations preferred by car
enthusiasts, we may have to ask them to rate all of the 27 combinations. For example a car
enthusiast may prefer 6 airbags, 4 speakers and medium height for his steering wheel.
Likewise, there are 27 possible combinations. However, using conjoint analysis it is possible
to capture the ratings of the prospective car buyer with just 9 or more combinations. The
conjoint analysis helps a great deal is product design simulation studies.
2.1.6 Structural Equation Modelling (SEM) and Confirmatory Factor Analysis
(CFA)
While multiple regression examines a single relationship between a DV and multiple
IVs in an SEM, it is possible to examine multiple relationships simultaneously. Generally, a
CFA is done prior to the SEM. The SEM consists of the structural and the measurement
model. The structural model may have one or more DVs and one or more IVs with all
relationships defined. Each of the DVs and IVS may be either uni- or multi-dimensional and
each of the dimensions may be measured using scale items for indicators. The CFA will show
the contribution of each scale item to its dimension and the extent to which it measures the
same. By this the measurement model is evaluated. After the validity and reliability of the
P a g e 5 | 8
measurement model are established, the structural model is evaluated to establish and prove
or disprove hypotheses. Hence, SEM supports simultaneous assessment of relationships and
accommodates multi-item scales.
2. 2 Interdependence techniques (absence of dependent or independent variables but
involves techniques to simultaneously analyze all variables together in the set. Eg. Factor
Analysis).
a) Factor Analysis (both Principal Component Analysis and Common Factor Analysis)
b) Cluster Analysis
c) Perceptual Mapping (also called as Multidimensional Scaling)
d) Correspondence Analysis
2.2.1 Factor Analysis
The objective of factor analysis is to reduce the number of measured variables into
meaningful factors (or variates) with minimal loss of information. This can either be done by
the PCA method or by common factor analysis. Suppose a prospective car buyer is
considering the color of the car, the aerodynamic design, body-colored bumpers, height-
adjustable steering column, driver seat height adjustment, touch screen for infotainment, ABS
and Airbags. If the opinion of the car buyer is captured using a 7 point Likert scale, either
PCA or common factor analysis may group these eight variables in three groups, namely,
external features (colour of the car, the aerodynamic design, body-colored bumpers), internal
features (height-adjustable steering column, driver seat height adjustment, touch screen for
infotainment) and safety features (ABS and Airbags). So factor analysis helps us to reduce
eight variables into three meaningful factors (variates).
2.2.2 Cluster Analysis
In the car example that we have been discussing so far, suppose we have the data on
engine capacities of about 130 cars with the engine capacities ranging from a minimum of
799cc to 2399cc and we want these 130 cars to be placed in three groups, namely, small,
medium and large cars, cluster analysis would be a recommended technique. The Cluster
analysis algorithm places the objects in homogeneous groups depending on the characteristics
specified by the researcher. In our example, the cars would be placed in groups based on
engine capacity. Clustering can be done based on multiple characteristics too. Either
hierarchical or non-hierarchical clustering procedures may be adopted. Basically hierarchical
methods could be either agglomerative or divisive. The algorithms followed in the
hierarchical methods are single, complete and average linkage methods. The other methods
are the Centroid and Ward methods. Alternatively the non-hierarchical clustering popularly
follows the k-means algorithm and places objects in cluster groups once the number of
clusters is specified. The decision on whether to adopt the hierarchical or non-hierarchical
procedure depends on the choice of the researcher and the problem defined.
P a g e 6 | 8
2.2.3 Perceptual Mapping
If we consider two dimensions of the car, namely, fuel efficiency and driving comfort
and we want to know how the brands of cars currently available in the market are positioned
in the minds of the car enthusiasts and perceived by the car enthusiasts, the right technique is
Perceptual Mapping (PM) also known as Multi-dimensional Scaling (MDS). MDS
typically helps a researcher to determine the perceived relative image of the cars (in this case)
considering the two dimensions. In MDS, unlike in factor or cluster analysis, a solution can
be obtained for each respondent and there is no variate. The researcher makes choices
between similarity and preference data, disaggregate and aggregate analysis and on whether
to use the Compositional or decompositional methods. Although earlier MDS programs were
predominantly non-metric in output, the contemporary programs provide metric output.
2.2.4 Correspondence Analysis
If we have non-metric data such as colors of the cars, classification of car size such as
small, medium and large and we want to position the cars in a perceptual map, then the
technique to be adopted is the Correspondence Analysis (CA). It starts with a cross-tabulation
of the two attributes, namely, colors and car size; after that it carries out a non-metric to
metric conversion, and then leads to dimension reduction and finally the perceptual map is
prepared. CA is the best option for a multivariate representation of interdependence for non-
metric data.
3. Nature of Data
The following table gives a summary of the nature of data:
Name of the Multivariate
Technique
Nature of the Data
DV IV
Canonical Correlation Metric, Non-metric Metric, Non-metric
MANOVA Metric Non-metric
ANOVA Metric Non-metric
MDA Non-metric Metric
Multiple Regression Metric Metric, Non-metric
Conjoint Analysis Non-metric, Metric Non-metric
SEM Metric Metric, Non-metric
4 Some Generic Tips to Perform Multivariate Analysis
P a g e 7 | 8
While performing MVA on the research problem, it would help if the researcher observes the
following tips:
1. Ensure that both statistical and practical significance exists in the research being done.
2. The sample size should be adequate but neither under sized nor over sized.
3. Clearly, understand the nature of the data.
4. Use a minimum number of variables in the model to obtain the desired results.
5. Identify and eliminate errors.
6. Ensure a fool-proof validation of the results.
I hope the above content gives you a fair idea of the existing multivariate techniques
that we would be covering in our course and a snapshot of their applications. For further
learning, may I also suggest the open courseware by Cynthia et al., (2011), titled “Statistical
Thinking and Data Analysis”.
Although at the beginning of this discussion, I had suggested the reading of the paper
by Pattusamy and Jacob (2015), throughout the discussion I used examples relating to cars. If
you have understood the application of the discussed MVA tests with the variables in the car
example, you should be able to answer a few fundamental questions relating to data analysis
with respect to the variables in the paper. Here are your challenges.
Self-Assessment:
You could suggest appropriate statistical tests to answer the following research
questions. It does help if you could also justify your choice of the technique.
1. Are men more satisfied with their jobs than women?
2. Does life satisfaction vary with age?
3. Will feelings of work-life balance influence the relationship between job
satisfaction and life satisfaction?
4. Would there be a difference in the strength of the relationship between family
satisfaction and life satisfaction between men and women?
5. Would it be possible to categorize men who are highly and moderately satisfied in
their lives?
P a g e 8 | 8
References
1. Barbara G.T and Linda S.F, Using Multivariate Statistics, 6th Edition, Pearson Education
Inc, pp. 612-680.
2. Cynthia Rudin, Allison Chang, and Dimitrios Bisias. 15.075J Statistical Thinking and
Data Analysis. Fall 2011. Massachusetts Institute of Technology: MIT
OpenCourseWare, https://ocw.mit.edu. License: Creative Commons BY-NC-SA.
3. Hair J.F, Black W.C, Babin B.J and Anderson R.E, Multivariate Data Analysis, 7th
Edition, Pearson Education (South Asia), pp. 89-149.
4. Murugan Pattusamy and Jayanth Jacob, A test of Greenhaus and Allen (2011) model on
Work Family Balance, Current Psychology, Springer, 2015.
5. Zumbo B.D. (2014) Univariate Tests. In: Michalos A.C. (eds) Encyclopedia of Quality
of Life and Well-Being Research. Springer, Dordrecht
***************************************************************************

More Related Content

What's hot

Otto_Elmgart_Noise_Vol_Struct
Otto_Elmgart_Noise_Vol_StructOtto_Elmgart_Noise_Vol_Struct
Otto_Elmgart_Noise_Vol_StructOtto Elmgart
 
Unit iv statistical tools
Unit iv statistical toolsUnit iv statistical tools
Unit iv statistical tools
sujianush
 
Econometrics notes (Introduction, Simple Linear regression, Multiple linear r...
Econometrics notes (Introduction, Simple Linear regression, Multiple linear r...Econometrics notes (Introduction, Simple Linear regression, Multiple linear r...
Econometrics notes (Introduction, Simple Linear regression, Multiple linear r...
Muhammad Ali
 
Slides sem on pls-complete
Slides sem on pls-completeSlides sem on pls-complete
Slides sem on pls-complete
Dr Hemant Sharma
 
Overview of Multivariate Statistical Methods
Overview of Multivariate Statistical MethodsOverview of Multivariate Statistical Methods
Overview of Multivariate Statistical Methods
ThomasUttaro1
 
Ash bus 308 week 4 quiz
Ash bus 308 week 4 quizAsh bus 308 week 4 quiz
Ash bus 308 week 4 quiz
Faarooqkhaann
 
Ash bus 308 week 4 quiz
Ash bus 308 week 4 quizAsh bus 308 week 4 quiz
Ash bus 308 week 4 quiz
Noahliamwilliam
 
Cb sem and pls-sem
Cb sem and pls-semCb sem and pls-sem
Cb sem and pls-sem
Noreen Hassan
 
Ash bus 308 week 4 quiz
Ash bus 308 week 4 quizAsh bus 308 week 4 quiz
Ash bus 308 week 4 quiz
uopassignment
 
Discriminant analysis
Discriminant analysisDiscriminant analysis
Discriminant analysis
Amritashish Bagchi
 
s.analysis
s.analysiss.analysis
s.analysis
kavi ...
 
Multi-dimensional time series based approach for Banking Regulatory Stress Te...
Multi-dimensional time series based approach for Banking Regulatory Stress Te...Multi-dimensional time series based approach for Banking Regulatory Stress Te...
Multi-dimensional time series based approach for Banking Regulatory Stress Te...
Genpact Ltd
 
Factor analysis in R by Aman Chauhan
Factor analysis in R by Aman ChauhanFactor analysis in R by Aman Chauhan
Factor analysis in R by Aman Chauhan
Aman Chauhan
 
Discriminant analysis
Discriminant analysisDiscriminant analysis
Discriminant analysis
Wansuklangk
 
Optimised random mutations for
Optimised random mutations forOptimised random mutations for
Optimised random mutations for
ijaia
 
Evaluating competing predictive distributions
Evaluating competing predictive distributionsEvaluating competing predictive distributions
Evaluating competing predictive distributionsAndreas Collett
 
Chapter 05
Chapter 05Chapter 05
Chapter 05bmcfad01
 

What's hot (19)

Otto_Elmgart_Noise_Vol_Struct
Otto_Elmgart_Noise_Vol_StructOtto_Elmgart_Noise_Vol_Struct
Otto_Elmgart_Noise_Vol_Struct
 
Unit iv statistical tools
Unit iv statistical toolsUnit iv statistical tools
Unit iv statistical tools
 
Evaluacion cuatro
Evaluacion cuatroEvaluacion cuatro
Evaluacion cuatro
 
Econometrics notes (Introduction, Simple Linear regression, Multiple linear r...
Econometrics notes (Introduction, Simple Linear regression, Multiple linear r...Econometrics notes (Introduction, Simple Linear regression, Multiple linear r...
Econometrics notes (Introduction, Simple Linear regression, Multiple linear r...
 
Slides sem on pls-complete
Slides sem on pls-completeSlides sem on pls-complete
Slides sem on pls-complete
 
Overview of Multivariate Statistical Methods
Overview of Multivariate Statistical MethodsOverview of Multivariate Statistical Methods
Overview of Multivariate Statistical Methods
 
Ash bus 308 week 4 quiz
Ash bus 308 week 4 quizAsh bus 308 week 4 quiz
Ash bus 308 week 4 quiz
 
Geert van Kollenburg-masterthesis
Geert van Kollenburg-masterthesisGeert van Kollenburg-masterthesis
Geert van Kollenburg-masterthesis
 
Ash bus 308 week 4 quiz
Ash bus 308 week 4 quizAsh bus 308 week 4 quiz
Ash bus 308 week 4 quiz
 
Cb sem and pls-sem
Cb sem and pls-semCb sem and pls-sem
Cb sem and pls-sem
 
Ash bus 308 week 4 quiz
Ash bus 308 week 4 quizAsh bus 308 week 4 quiz
Ash bus 308 week 4 quiz
 
Discriminant analysis
Discriminant analysisDiscriminant analysis
Discriminant analysis
 
s.analysis
s.analysiss.analysis
s.analysis
 
Multi-dimensional time series based approach for Banking Regulatory Stress Te...
Multi-dimensional time series based approach for Banking Regulatory Stress Te...Multi-dimensional time series based approach for Banking Regulatory Stress Te...
Multi-dimensional time series based approach for Banking Regulatory Stress Te...
 
Factor analysis in R by Aman Chauhan
Factor analysis in R by Aman ChauhanFactor analysis in R by Aman Chauhan
Factor analysis in R by Aman Chauhan
 
Discriminant analysis
Discriminant analysisDiscriminant analysis
Discriminant analysis
 
Optimised random mutations for
Optimised random mutations forOptimised random mutations for
Optimised random mutations for
 
Evaluating competing predictive distributions
Evaluating competing predictive distributionsEvaluating competing predictive distributions
Evaluating competing predictive distributions
 
Chapter 05
Chapter 05Chapter 05
Chapter 05
 

Similar to Introduction to data analysis

Data Analytics Notes
Data Analytics NotesData Analytics Notes
Data Analytics Notes
TRANJAY CHANDEL
 
2012-Nathans-PARE-RegressionGuidebook.pdf
2012-Nathans-PARE-RegressionGuidebook.pdf2012-Nathans-PARE-RegressionGuidebook.pdf
2012-Nathans-PARE-RegressionGuidebook.pdf
s_p2000
 
A researcher in attempting to run a regression model noticed a neg.docx
A researcher in attempting to run a regression model noticed a neg.docxA researcher in attempting to run a regression model noticed a neg.docx
A researcher in attempting to run a regression model noticed a neg.docx
evonnehoggarth79783
 
Data-Analysis.pptx
Data-Analysis.pptxData-Analysis.pptx
Data-Analysis.pptx
RobertRevamonte1
 
7Repeated Measures Designs for Interval DataLearnin.docx
7Repeated Measures Designs  for Interval DataLearnin.docx7Repeated Measures Designs  for Interval DataLearnin.docx
7Repeated Measures Designs for Interval DataLearnin.docx
evonnehoggarth79783
 
Introduction to regression
Introduction to regressionIntroduction to regression
Introduction to regression
Dr. C.V. Suresh Babu
 
Advanced StatisticsUnit 5There are several r.docx
Advanced StatisticsUnit 5There are several r.docxAdvanced StatisticsUnit 5There are several r.docx
Advanced StatisticsUnit 5There are several r.docx
nettletondevon
 
STRUCTURAL EQUATION MODEL (SEM)
STRUCTURAL EQUATION MODEL (SEM)STRUCTURAL EQUATION MODEL (SEM)
STRUCTURAL EQUATION MODEL (SEM)
AJHSSR Journal
 
Brm unit iv - cheet sheet
Brm   unit iv - cheet sheetBrm   unit iv - cheet sheet
Brm unit iv - cheet sheet
Hallmark B-school
 
ders 6 Panel data analysis.pptx
ders 6 Panel data analysis.pptxders 6 Panel data analysis.pptx
ders 6 Panel data analysis.pptx
Ergin Akalpler
 
Factor analysis using spss 2005
Factor analysis using spss 2005Factor analysis using spss 2005
Factor analysis using spss 2005
jamescupello
 
Factor analysis (1)
Factor analysis (1)Factor analysis (1)
Factor analysis (1)
CVA170032STUDENT
 
Factor Analysis (Marketing Research)
Factor Analysis (Marketing Research)Factor Analysis (Marketing Research)
Factor Analysis (Marketing Research)
Mohammad Saif Alam
 
Multivariate Approaches in Nursing Research Assignment.pdf
Multivariate Approaches in Nursing Research Assignment.pdfMultivariate Approaches in Nursing Research Assignment.pdf
Multivariate Approaches in Nursing Research Assignment.pdf
bkbk37
 
Chapter 12Choosing an Appropriate Statistical TestiStockph.docx
Chapter 12Choosing an Appropriate Statistical TestiStockph.docxChapter 12Choosing an Appropriate Statistical TestiStockph.docx
Chapter 12Choosing an Appropriate Statistical TestiStockph.docx
mccormicknadine86
 
Multinomial Logistic Regression.pdf
Multinomial Logistic Regression.pdfMultinomial Logistic Regression.pdf
Multinomial Logistic Regression.pdf
AlemAyahu
 
AHP technique a way to show preferences amongst alternatives
AHP technique a way to show preferences amongst alternativesAHP technique a way to show preferences amongst alternatives
AHP technique a way to show preferences amongst alternatives
ijsrd.com
 
Edison S Statistics
Edison S StatisticsEdison S Statistics
Edison S Statisticsteresa_soto
 
A review of statistics
A review of statisticsA review of statistics
A review of statistics
edisonre
 

Similar to Introduction to data analysis (20)

Data Analytics Notes
Data Analytics NotesData Analytics Notes
Data Analytics Notes
 
2012-Nathans-PARE-RegressionGuidebook.pdf
2012-Nathans-PARE-RegressionGuidebook.pdf2012-Nathans-PARE-RegressionGuidebook.pdf
2012-Nathans-PARE-RegressionGuidebook.pdf
 
A researcher in attempting to run a regression model noticed a neg.docx
A researcher in attempting to run a regression model noticed a neg.docxA researcher in attempting to run a regression model noticed a neg.docx
A researcher in attempting to run a regression model noticed a neg.docx
 
Data-Analysis.pptx
Data-Analysis.pptxData-Analysis.pptx
Data-Analysis.pptx
 
7Repeated Measures Designs for Interval DataLearnin.docx
7Repeated Measures Designs  for Interval DataLearnin.docx7Repeated Measures Designs  for Interval DataLearnin.docx
7Repeated Measures Designs for Interval DataLearnin.docx
 
Introduction to regression
Introduction to regressionIntroduction to regression
Introduction to regression
 
Advanced StatisticsUnit 5There are several r.docx
Advanced StatisticsUnit 5There are several r.docxAdvanced StatisticsUnit 5There are several r.docx
Advanced StatisticsUnit 5There are several r.docx
 
STRUCTURAL EQUATION MODEL (SEM)
STRUCTURAL EQUATION MODEL (SEM)STRUCTURAL EQUATION MODEL (SEM)
STRUCTURAL EQUATION MODEL (SEM)
 
Brm unit iv - cheet sheet
Brm   unit iv - cheet sheetBrm   unit iv - cheet sheet
Brm unit iv - cheet sheet
 
Manova Report
Manova ReportManova Report
Manova Report
 
ders 6 Panel data analysis.pptx
ders 6 Panel data analysis.pptxders 6 Panel data analysis.pptx
ders 6 Panel data analysis.pptx
 
Factor analysis using spss 2005
Factor analysis using spss 2005Factor analysis using spss 2005
Factor analysis using spss 2005
 
Factor analysis (1)
Factor analysis (1)Factor analysis (1)
Factor analysis (1)
 
Factor Analysis (Marketing Research)
Factor Analysis (Marketing Research)Factor Analysis (Marketing Research)
Factor Analysis (Marketing Research)
 
Multivariate Approaches in Nursing Research Assignment.pdf
Multivariate Approaches in Nursing Research Assignment.pdfMultivariate Approaches in Nursing Research Assignment.pdf
Multivariate Approaches in Nursing Research Assignment.pdf
 
Chapter 12Choosing an Appropriate Statistical TestiStockph.docx
Chapter 12Choosing an Appropriate Statistical TestiStockph.docxChapter 12Choosing an Appropriate Statistical TestiStockph.docx
Chapter 12Choosing an Appropriate Statistical TestiStockph.docx
 
Multinomial Logistic Regression.pdf
Multinomial Logistic Regression.pdfMultinomial Logistic Regression.pdf
Multinomial Logistic Regression.pdf
 
AHP technique a way to show preferences amongst alternatives
AHP technique a way to show preferences amongst alternativesAHP technique a way to show preferences amongst alternatives
AHP technique a way to show preferences amongst alternatives
 
Edison S Statistics
Edison S StatisticsEdison S Statistics
Edison S Statistics
 
A review of statistics
A review of statisticsA review of statistics
A review of statistics
 

More from RajaKrishnan M

Shortcomings of Demat Account
Shortcomings of Demat AccountShortcomings of Demat Account
Shortcomings of Demat Account
RajaKrishnan M
 
Demat Account Services
Demat Account ServicesDemat Account Services
Demat Account Services
RajaKrishnan M
 
Depository Participant
Depository ParticipantDepository Participant
Depository Participant
RajaKrishnan M
 
Services provided in Mobile Banking
Services provided in Mobile BankingServices provided in Mobile Banking
Services provided in Mobile Banking
RajaKrishnan M
 
Ombudsman scheme
Ombudsman scheme Ombudsman scheme
Ombudsman scheme
RajaKrishnan M
 
Factors affecting share price
Factors affecting share priceFactors affecting share price
Factors affecting share price
RajaKrishnan M
 
Rights of investors
Rights of investorsRights of investors
Rights of investors
RajaKrishnan M
 
Loss of Confidence of small investors
Loss of Confidence of small investorsLoss of Confidence of small investors
Loss of Confidence of small investors
RajaKrishnan M
 
Facilities by BSE
Facilities by BSEFacilities by BSE
Facilities by BSE
RajaKrishnan M
 
Technological forces fueling e-commerce
Technological forces fueling e-commerceTechnological forces fueling e-commerce
Technological forces fueling e-commerce
RajaKrishnan M
 
Encryption and Decryption
Encryption and DecryptionEncryption and Decryption
Encryption and Decryption
RajaKrishnan M
 
Meaning, Anatomy and Forces Fueling e-commerce
Meaning, Anatomy and Forces Fueling e-commerceMeaning, Anatomy and Forces Fueling e-commerce
Meaning, Anatomy and Forces Fueling e-commerce
RajaKrishnan M
 
Forces Fueling e-commerce
Forces Fueling e-commerceForces Fueling e-commerce
Forces Fueling e-commerce
RajaKrishnan M
 
Inter Organizational e-commerce
Inter Organizational e-commerceInter Organizational e-commerce
Inter Organizational e-commerce
RajaKrishnan M
 
Factors for the success of m-commerce
Factors for the success of m-commerceFactors for the success of m-commerce
Factors for the success of m-commerce
RajaKrishnan M
 
Advantages of E-Commerce
Advantages of E-CommerceAdvantages of E-Commerce
Advantages of E-Commerce
RajaKrishnan M
 
Types of E-Commerce
Types of E-CommerceTypes of E-Commerce
Types of E-Commerce
RajaKrishnan M
 
E-Commerce and E- Businesss
E-Commerce and E- BusinesssE-Commerce and E- Businesss
E-Commerce and E- Businesss
RajaKrishnan M
 
Electronic Data Interchange & Internet
Electronic Data Interchange & InternetElectronic Data Interchange & Internet
Electronic Data Interchange & Internet
RajaKrishnan M
 

More from RajaKrishnan M (20)

Shortcomings of Demat Account
Shortcomings of Demat AccountShortcomings of Demat Account
Shortcomings of Demat Account
 
Demat Account Services
Demat Account ServicesDemat Account Services
Demat Account Services
 
Depository Participant
Depository ParticipantDepository Participant
Depository Participant
 
Services provided in Mobile Banking
Services provided in Mobile BankingServices provided in Mobile Banking
Services provided in Mobile Banking
 
Ombudsman scheme
Ombudsman scheme Ombudsman scheme
Ombudsman scheme
 
Factors affecting share price
Factors affecting share priceFactors affecting share price
Factors affecting share price
 
Rights of investors
Rights of investorsRights of investors
Rights of investors
 
Loss of Confidence of small investors
Loss of Confidence of small investorsLoss of Confidence of small investors
Loss of Confidence of small investors
 
Facilities by BSE
Facilities by BSEFacilities by BSE
Facilities by BSE
 
Technological forces fueling e-commerce
Technological forces fueling e-commerceTechnological forces fueling e-commerce
Technological forces fueling e-commerce
 
Encryption and Decryption
Encryption and DecryptionEncryption and Decryption
Encryption and Decryption
 
Meaning, Anatomy and Forces Fueling e-commerce
Meaning, Anatomy and Forces Fueling e-commerceMeaning, Anatomy and Forces Fueling e-commerce
Meaning, Anatomy and Forces Fueling e-commerce
 
Forces Fueling e-commerce
Forces Fueling e-commerceForces Fueling e-commerce
Forces Fueling e-commerce
 
Inter Organizational e-commerce
Inter Organizational e-commerceInter Organizational e-commerce
Inter Organizational e-commerce
 
Factors for the success of m-commerce
Factors for the success of m-commerceFactors for the success of m-commerce
Factors for the success of m-commerce
 
Advantages of E-Commerce
Advantages of E-CommerceAdvantages of E-Commerce
Advantages of E-Commerce
 
Types of E-Commerce
Types of E-CommerceTypes of E-Commerce
Types of E-Commerce
 
E-Commerce and E- Businesss
E-Commerce and E- BusinesssE-Commerce and E- Businesss
E-Commerce and E- Businesss
 
RFID
RFIDRFID
RFID
 
Electronic Data Interchange & Internet
Electronic Data Interchange & InternetElectronic Data Interchange & Internet
Electronic Data Interchange & Internet
 

Recently uploaded

678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf
CarlosHernanMontoyab2
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
GeoBlogs
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
Jisc
 
Instructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptxInstructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptx
Jheel Barad
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
MIRIAMSALINAS13
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
siemaillard
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
Anna Sz.
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
EugeneSaldivar
 
Honest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptxHonest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptx
timhan337
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
Balvir Singh
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
EduSkills OECD
 
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdfAdversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Po-Chuan Chen
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
TechSoup
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
Vivekanand Anglo Vedic Academy
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
camakaiclarkmusic
 
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
Levi Shapiro
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th SemesterGuidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Atul Kumar Singh
 

Recently uploaded (20)

678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
 
Instructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptxInstructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptx
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
 
Honest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptxHonest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptx
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
 
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdfAdversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
 
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th SemesterGuidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th Semester
 

Introduction to data analysis

  • 1. P a g e 1 | 8 DESCRIPTION OF THE TOPIC Items Description of the Topic Course Data Analysis for Social Science Teachers Topic Introduction to Data Analysis Module Id 1.1 Introduction In the recent past, quite a bit of importance has been given to data analysis in research. One of the possible reasons is that empirical evidence establishes a firm grounding to either accept or reject the proposed hypotheses. The choice of the statistical technique depends on the nature of the research problem or question and also on the nature of the data set. The research questions to solve a research gap or problem may be related to identifying the degree of relationships among variables, checking for the significance of group differences, predicting of group memberships or structure, or it could be time-related. In order to identify associations between two or more variables, depending on whether their nature of being parametric or non-parametric, correlation, and regression or chi- square techniques may be adopted. This can be done as a Bi-variate correlation and regression, multiple correlation and regression, Canonical correlation, Multiple Discriminant Analysis, and Log-it regression. The bi-variate correlation is a good starting point to identify the degree of relationship between two continuous variables, such as job and family satisfaction where either of them can be treated as a DV and IV as the research question may be. But bi-variate regression would require one of them to be defined as the DV and the other as the IV. Although these are not multivariate techniques, they form the basis of the Multivariate Analysis (MVA). 1. Importance of Multivariate Analysis (MVA) If watching a movie needs to be a pleasant experience, the lighting, the projected film light and sound effects in the theatre must be optimum. The other factors that may contribute to a pleasant viewing experience may include, but not limited to, seating arrangements, air- conditioning and hall odour. If one has to study or measure the pleasantness of watching a movie in a theatre, all of the above factors must be studied together and not in isolation. There is a possibility of an unpleasant parking experience that may negatively impact the pleasantness of watching a movie. So, the real value of measuring the pleasantness of a movie-watching experience lies in measuring all the influencing factors together. This is exactly what Multivariate analysis is all about. So, analysis of multiple variables simultaneously would result in a better picture to arrive at inferences instead of multiple uni-variate analyses done with the individual variables. Statistical Techniques that simultaneously analyse multiple measurements of the observed variables are known as Multivariate Analysis (MVA). We may perform MVA by using multiple variables in a single relationship or in multiple relationships.
  • 2. P a g e 2 | 8 In a truly multivariate scenario, all variables must be: i. Random in nature, ii. Inter-related, and iii. Interpreted in unison. Reading the paper related to testing the Greenhaus and Allen model by Pattusamy and Jacob (2015) will help in understanding our forthcoming discussions and answering a few questions in the end. The theoretical model is shown in Figure 1. Figure 1 - Theoretical Model From Figure 1, it is seen that family-work conflict (FWC) will have a negative effect on job satisfaction (JS) while family-work facilitation (FWF) will have a positive effect on job satisfaction. Similarly, work-family facilitation (WFF) will have a positive effect on family satisfaction (FS) while work-family conflict (WFC) will have a negative effect on family satisfaction. Both job and family satisfaction will influence feelings of work-family balance positively which in turn will positively influence life satisfaction (LS). All the above statements have been hypothesized and can be stated conclusively if we have empirical data to establish the stated hypotheses. The use of appropriate statistical methods will facilitate the data analysis to arrive at well-grounded inferences and conclusions. Univariate statistical tests involve one dependent variable. Examples include, but are not limited to, t-tests of means, analysis of variance (ANOVA), analysis of covariance and simple linear regression (with one dependent and one independent variable). Having said so much about the importance of data analysis, let us have a quick look at a few multivariate techniques that we are likely to study in detail during the course of this study. The next section leads us to the classification of MVA.
  • 3. P a g e 3 | 8 2. Classification of MVA MVA can be classified as Dependence techniques and Interdependence techniques. 2.1 Dependence techniques (used when there are one or more dependent variables and independent variables. Eg. Multiple regression analysis) i. Multiple regression and multiple correlation ii. Multiple Discriminant Analysis (MDA) and Logistic Regression iii. Canonical Correlation Analysis iv. Multivariate Analysis of Variance and Covariance v. Conjoint Analysis vi. Structural Equation Modelling (SEM) and Confirmatory Factor Analysis (CFA) 2.1.1 Multiple Regression Let us presume that some previous research has established that cars with higher engine capacity and higher unladen weight offer lesser fuel efficiency (possibly validated using a correlation analysis). If a researcher wants to predict the fuel efficiency based on engine capacity and unladen weight, then fuel efficiency is treated as the dependent variable while engine capacity and unladen weight are treated as the independent variables. The researcher collects data on fuel efficiency, engine capacity and an unladen weight of about 100 cars or more (that run on the same type of fuel) and would possibly use the multiple regression (MR) method to predict fuel efficiency. In order to use the MR method the dependent and the independent variables (two or more) must be metric data. 2.1.2 Multiple Discriminant Analysis (MDA) and Logit Analysis If the dependent variable is dichotomous (Yes/No, Men / Women) type, then MDA is an appropriate technique. The independent variables need to be metric data. MDA helps to understand group differences and to predict the possibility that an observation or object would belong to a specific group. An example that we had discussed in MR in the previous section, suppose we had data on the engine capacity and unladen weight of about 100 plus cars (that run on the same type of fuel) and if we want to classify them as Big and Small cars, then MDA would be a relevant technique. Logit Analysis also is known as Logistics regression is a combination of MR and MDA. Although the regression principle is similar to that of MR, the DV in Logit regression need not be metric as in the case of MR but can be a dichotomous variable as in MDA. Another distinguishing fact of Logit regression is that it can accommodate both metric and non-metric IVs and overlook the multivariate normality assumption.
  • 4. P a g e 4 | 8 2.1.3 Canonical Correlation Analysis If there are multiple metric dependent and metric independent variables to be correlated and regressed, then the right tool is Canonical Correlation Analysis. We actually try to determine the associations between two sets of variables. For example, we might study the relationship between a number of indices of fuel efficiency (the DVs such as Indicated Horse Power (IHP) and Brake Horse Power (BHP)) and the IVs (such as engine capacity, unladen weight of the car, and age of the car). 2.1.4 Multivariate Analysis of Variance and Covariance In-order to simultaneously explore the relationship between multiple categorical independent variables, which are also called treatments, and two or more metric dependent variables, an ideal technique would be the Multivariate Analysis of Variance and Covariance (MANOVA). If the analysis requires the elimination of the effect of the uncontrolled metric independent variables, which are known as covariates, on the dependent variables, then the multivariate analysis of covariance (MANCOVA) is used. Both MANOVA and MANCOVA may be done as one way or factorial. In our car example with fuel efficiency as the DV, age of the car can be treated as a covariate. 2.1.5 Conjoint Analysis Conjoint Analysis is a contemporary dependence technique that would help a decision-maker (product design head) evaluate the importance of attributes (typically product attributes) along with its levels. Let us say we have three attributes of a car, namely, airbags (2, 4, or 6 airbags), speakers for infotainment (2, 4 or 6 speakers) and steering wheel height adjustment (low, medium and high). If we want to know popular combinations preferred by car enthusiasts, we may have to ask them to rate all of the 27 combinations. For example a car enthusiast may prefer 6 airbags, 4 speakers and medium height for his steering wheel. Likewise, there are 27 possible combinations. However, using conjoint analysis it is possible to capture the ratings of the prospective car buyer with just 9 or more combinations. The conjoint analysis helps a great deal is product design simulation studies. 2.1.6 Structural Equation Modelling (SEM) and Confirmatory Factor Analysis (CFA) While multiple regression examines a single relationship between a DV and multiple IVs in an SEM, it is possible to examine multiple relationships simultaneously. Generally, a CFA is done prior to the SEM. The SEM consists of the structural and the measurement model. The structural model may have one or more DVs and one or more IVs with all relationships defined. Each of the DVs and IVS may be either uni- or multi-dimensional and each of the dimensions may be measured using scale items for indicators. The CFA will show the contribution of each scale item to its dimension and the extent to which it measures the same. By this the measurement model is evaluated. After the validity and reliability of the
  • 5. P a g e 5 | 8 measurement model are established, the structural model is evaluated to establish and prove or disprove hypotheses. Hence, SEM supports simultaneous assessment of relationships and accommodates multi-item scales. 2. 2 Interdependence techniques (absence of dependent or independent variables but involves techniques to simultaneously analyze all variables together in the set. Eg. Factor Analysis). a) Factor Analysis (both Principal Component Analysis and Common Factor Analysis) b) Cluster Analysis c) Perceptual Mapping (also called as Multidimensional Scaling) d) Correspondence Analysis 2.2.1 Factor Analysis The objective of factor analysis is to reduce the number of measured variables into meaningful factors (or variates) with minimal loss of information. This can either be done by the PCA method or by common factor analysis. Suppose a prospective car buyer is considering the color of the car, the aerodynamic design, body-colored bumpers, height- adjustable steering column, driver seat height adjustment, touch screen for infotainment, ABS and Airbags. If the opinion of the car buyer is captured using a 7 point Likert scale, either PCA or common factor analysis may group these eight variables in three groups, namely, external features (colour of the car, the aerodynamic design, body-colored bumpers), internal features (height-adjustable steering column, driver seat height adjustment, touch screen for infotainment) and safety features (ABS and Airbags). So factor analysis helps us to reduce eight variables into three meaningful factors (variates). 2.2.2 Cluster Analysis In the car example that we have been discussing so far, suppose we have the data on engine capacities of about 130 cars with the engine capacities ranging from a minimum of 799cc to 2399cc and we want these 130 cars to be placed in three groups, namely, small, medium and large cars, cluster analysis would be a recommended technique. The Cluster analysis algorithm places the objects in homogeneous groups depending on the characteristics specified by the researcher. In our example, the cars would be placed in groups based on engine capacity. Clustering can be done based on multiple characteristics too. Either hierarchical or non-hierarchical clustering procedures may be adopted. Basically hierarchical methods could be either agglomerative or divisive. The algorithms followed in the hierarchical methods are single, complete and average linkage methods. The other methods are the Centroid and Ward methods. Alternatively the non-hierarchical clustering popularly follows the k-means algorithm and places objects in cluster groups once the number of clusters is specified. The decision on whether to adopt the hierarchical or non-hierarchical procedure depends on the choice of the researcher and the problem defined.
  • 6. P a g e 6 | 8 2.2.3 Perceptual Mapping If we consider two dimensions of the car, namely, fuel efficiency and driving comfort and we want to know how the brands of cars currently available in the market are positioned in the minds of the car enthusiasts and perceived by the car enthusiasts, the right technique is Perceptual Mapping (PM) also known as Multi-dimensional Scaling (MDS). MDS typically helps a researcher to determine the perceived relative image of the cars (in this case) considering the two dimensions. In MDS, unlike in factor or cluster analysis, a solution can be obtained for each respondent and there is no variate. The researcher makes choices between similarity and preference data, disaggregate and aggregate analysis and on whether to use the Compositional or decompositional methods. Although earlier MDS programs were predominantly non-metric in output, the contemporary programs provide metric output. 2.2.4 Correspondence Analysis If we have non-metric data such as colors of the cars, classification of car size such as small, medium and large and we want to position the cars in a perceptual map, then the technique to be adopted is the Correspondence Analysis (CA). It starts with a cross-tabulation of the two attributes, namely, colors and car size; after that it carries out a non-metric to metric conversion, and then leads to dimension reduction and finally the perceptual map is prepared. CA is the best option for a multivariate representation of interdependence for non- metric data. 3. Nature of Data The following table gives a summary of the nature of data: Name of the Multivariate Technique Nature of the Data DV IV Canonical Correlation Metric, Non-metric Metric, Non-metric MANOVA Metric Non-metric ANOVA Metric Non-metric MDA Non-metric Metric Multiple Regression Metric Metric, Non-metric Conjoint Analysis Non-metric, Metric Non-metric SEM Metric Metric, Non-metric 4 Some Generic Tips to Perform Multivariate Analysis
  • 7. P a g e 7 | 8 While performing MVA on the research problem, it would help if the researcher observes the following tips: 1. Ensure that both statistical and practical significance exists in the research being done. 2. The sample size should be adequate but neither under sized nor over sized. 3. Clearly, understand the nature of the data. 4. Use a minimum number of variables in the model to obtain the desired results. 5. Identify and eliminate errors. 6. Ensure a fool-proof validation of the results. I hope the above content gives you a fair idea of the existing multivariate techniques that we would be covering in our course and a snapshot of their applications. For further learning, may I also suggest the open courseware by Cynthia et al., (2011), titled “Statistical Thinking and Data Analysis”. Although at the beginning of this discussion, I had suggested the reading of the paper by Pattusamy and Jacob (2015), throughout the discussion I used examples relating to cars. If you have understood the application of the discussed MVA tests with the variables in the car example, you should be able to answer a few fundamental questions relating to data analysis with respect to the variables in the paper. Here are your challenges. Self-Assessment: You could suggest appropriate statistical tests to answer the following research questions. It does help if you could also justify your choice of the technique. 1. Are men more satisfied with their jobs than women? 2. Does life satisfaction vary with age? 3. Will feelings of work-life balance influence the relationship between job satisfaction and life satisfaction? 4. Would there be a difference in the strength of the relationship between family satisfaction and life satisfaction between men and women? 5. Would it be possible to categorize men who are highly and moderately satisfied in their lives?
  • 8. P a g e 8 | 8 References 1. Barbara G.T and Linda S.F, Using Multivariate Statistics, 6th Edition, Pearson Education Inc, pp. 612-680. 2. Cynthia Rudin, Allison Chang, and Dimitrios Bisias. 15.075J Statistical Thinking and Data Analysis. Fall 2011. Massachusetts Institute of Technology: MIT OpenCourseWare, https://ocw.mit.edu. License: Creative Commons BY-NC-SA. 3. Hair J.F, Black W.C, Babin B.J and Anderson R.E, Multivariate Data Analysis, 7th Edition, Pearson Education (South Asia), pp. 89-149. 4. Murugan Pattusamy and Jayanth Jacob, A test of Greenhaus and Allen (2011) model on Work Family Balance, Current Psychology, Springer, 2015. 5. Zumbo B.D. (2014) Univariate Tests. In: Michalos A.C. (eds) Encyclopedia of Quality of Life and Well-Being Research. Springer, Dordrecht ***************************************************************************