SlideShare a Scribd company logo
Page 1 of 15
DESCRIPTION OF THE TOPIC
Choosing the right statistical method for data analysis is always a challenge as it dependent on a
host of things.
Before we discuss the major determinants of choice of a method in detail, it is also important to
understand that one should have a Research/Data Analysis blueprint of the study one is
undertaking.
1. Research/Data Analysis Blueprint
Generally, the research starts with a broad research question that is often divided into more
measurable, narrower objectives (See Figure 1). Each objective is achieved by splitting the subject
matter into certain statistically testable hypotheses.
Items Description of Topic
Course Data Analysis for Social Science Teachers
Topic
Choosing the Right Statistical Method for
Data Analysis
Page 2 of 15
Figure 1: The Research Blueprint--Objective Hypotheses Mapping
There is no standard rule as to how many hypotheses a research objective can have. One research
objective might have one or two or more hypotheses. However, it is important that each objective
be split into one or more testable hypotheses.
In order that one is clear about how a hypothesis is tested, one must identify the variables
associated with each of the hypotheses (see Figure 2). There is no rule as to how many variables a
hypothesis will have. There could be a hypothesis with just one variable (such as test of population
mean to be equal to a number) or there could be two variables (like tests of hypothesis of
association or difference) or even more (like factor analysis/multiple regression).
Each of the variables is then identified as a Dependent or Independent variable given the nature of
the hypothesis being tested. Further against each variable, its level of measurement is noted. We
shall have them noted as Nominal, Ordinal, Interval or Ratio. Often the nominal and ordinal levels
are be combined into Categorical whereas the Interval and Ratio levels are labeled as Numerical.
Page 3 of 15
The categorical variable is also called Non-Metric or non-Parametric variable. The Numerical
Variables are also called metric or parametric or sometimes even as a continuous variable by some
authors.
Figure 2: The Research Blueprint—Objective-Hypothesis-Variable-Test Mapping
2. Major Determinants of Choice of s Statistical Method
The choice of particular statistical method is generally determined the following:
a) Number and Level of Measurement of Variables
b) Distribution of the variable
c) Dependence and Independence Structure
d) Nature of the Hypothesis
e) Sample Size
We shall now briefly discuss the above:
Page 4 of 15
2.1. Level of Measurement of Variables
We know that there are four levels of measurement:
a) Nominal
b) Ordinal
c) Interval
d) Ratio
Often the nominal and ordinal levels are to be combined into Categorical whereas the Interval and
Ratio levels are to be labeled as Numerical. The categorical variable is also called Non-Metric or
non-Parametric variable. The Numerical Variables are also called metric or parametric or
sometimes it is even called a continuous variable by some authors.
While choosing a particular test, we shall be asking the question:
What is the level of measurement of the data?
--Nominal/Ordinal/interval/Ration
Or simply Categorical or Numerical?
2.2. Distribution of Underlying Variables
Based on the level of measurement, the data might follow a distribution like Normal, Binominal,
Poisson etc. and it might not have a distribution. The variables measured on nominal and ordinal
scales generally do not have any distribution whereas the numerical variables might follow a
normal distribution or other distribution. The tests that are used when the categorical variables are
involved are called non-parametric or distribution-free tests. The tests that are used with numerical
variables will be called parametric tests.
While choosing a particular test we shall be asking the question:
Page 5 of 15
Is the data parametric (measured on a numerical scale) or non-parametric (measured on
a categorical scale)?
2.3. Nature of Hypothesis
Broadly a hypothesis can be categorized as:
a) Hypothesis of Association/Causation and
b) Hypothesis of Differences
The hypothesis of association/causation examines the nature and strength of the relationship
between variables. Correlation, Regression are such examples.
The hypothesis of difference examines whether the two populations differ on a parameter like
mean. Using hypothesis of difference, we generally test the equality of two or more population
means.
While choosing a particular test, we shall be asking the question:
What is the nature of the hypothesis?
---Hypothesis of Association/Causation OR Hypothesis of Differences
2.4. No. of Variables in the Hypothesis
The number of variables associated with a hypothesis is also an important determinant of the
choice of a statistical technique.
Based on the number of variables, we sometimes even classify the statistical techniques as
Univariate (involving one variable) /Bi-variate (two variables)/ Multivariate (more than two)
techniques.
Page 6 of 15
While choosing a particular test, we shall be asking the question:
How many Variables are there in the hypothesis?
-- One or two or more than two
3. An approach for Choosing a Statistical Method
Several authors present different approaches to choose a statistical method. An approach generally
involves starting with one of the above determinants and drilling down with other determinants.
For instance, we might start with the question: What is the nature of the hypothesis? Then, ask the
question: How many variables are involved? And then ask: What is the level of measurement of
each of the variables? And so on. Alternatively, we might start with, say, the number of variables
in the hypothesis, then the nature of the hypothesis and so on.
We suggest starting with the question of a number of variables. The following sections present the
self-explanatory flow charts of how to choose a test once you started with the question: How many
variables are involved in the hypothesis? One or two or more than two. Accordingly, the sections
are titled as Statistical Methods for Univariate /Bi-variate /Multivariate data
3.1. Statistical Methods for Univariate Data
Figure 3 presents the flowchart of how a method can be chosen when the hypothesis involves just
one variable.
Page 7 of 15
Figure 3: Statistical Methods for Univariate Data
We will ask what is it that we are trying to do. Are we trying to describe the data or Are we trying
to make an inference? Trying to make an inference with univariate data generally involves testing
whether the population mean equals a particular numeral like whether µ =3..
Let us look at the first wing: Descriptive statistics.
The kind of descriptive statistics we can use to describe the univariate data straight away depends
on the level of measurement of the variable.
● For nominal data, the measure of central tendency is always mode and mode is the only
choice if your data is nominal. Further, we don't have any measure of spread or variance
when data is on a nominal scale.
● When data is on an ordinal scale, we have two choices of central tendency that is mode and
median. We can use the interquartile range as a measure of dispersion or variance.
Page 8 of 15
● When data is measured in interval or ratio scale, we can use all the three measures of central
tendency, i.e. mean, median and mode. And we can also use several measures of dispersion
such as interquartile range, range, variance and standard deviation.
On the other hand, if we are interested in the hypothesis whether the population mean equals a
particular numeral like µ =3?So, in this case, we call it a hypothesis of difference involving a single
variable and the test is one-sample t-test. Our univariate data is on the numerical scale (interval or
ratio), so we use the one-sample t-test.
3.2. Statistical Methods for Bi-variate Data
Quite often, we will be interested in testing the hypothesis that involves two variables or
sometimes we also have one variable measured across two samples.
Figure 4 presents the flowchart of how a method can be chosen when the hypothesis involves two
variables or two samples measured on one variable.
Page 9 of 15
Figure 4: Statistical Methods for Bi-variate Data
We will start with the question:
What is the nature of the hypothesis?
---Hypothesis of Association/Causation OR Hypothesis of Differences
1. Hypothesis of Difference: A hypothesis of difference in this context generally involves testing
for the equality of two population means (whether µ1=µ2?).
Page 10 of 15
Then, we can ask this question :
Is this data parametric or non-parametric?
When the data is parametric(meaning the underlying variable has a distribution), we will ask this
question whether the samples are independent or dependent. In independent samples, we measure
one variable on two samples whereas a dependent sample generally involves repeated
measurements(twice) of the same variables on a single sample.
If the samples happen to be independent, we use an independent sample t-test, otherwise we use a
paired sample t-test.
And for non-parametric data, we use the Mann-Whitney U test to test the hypothesis of differences.
2. Hypothesis of Association: In Hypothesis of Association again we ask this question whether
the data is parametric or non-parametric. And if the data is parametric, the next level question is
whether we want to look at the association between the two variables or there is a cause-effect
relationship. In Association between the variables, we simply try to know whether two variables
are related. Whereas in causation one of the variables is dependent and the other will be
independent and we just want to see to what extent the independent variable explains the changes
in the dependent variable.
For parametric data, when we are examining the association; the test will be the Pearson
coefficient correlation. And for causation we use Regression.
For non-parametric data, we ask a next level question: whether the data is measured on a nominal
or ordinal scale. If it measured on a nominal scale we use Chi-square test of association. If the data
is measured on an ordinal scale, we use Spearman’s Rank correlation.
Page 11 of 15
3.3. Statistical Methods for Multivariate Data
Figure 5 presents the flowchart of how a method can be chosen when the hypothesis involves more
than two variables.
Figure 5: Statistical Methods for Multivariate Data (1)
In multivariate data, again we will start with the same question whether it is the hypothesis of
difference or the hypothesis of association.
Under the hypothesis of difference again we need to know that data is parametric or non-
parametric. When the data happened to be parametric, we use ANOVA and if the data is
nonparametric, we use Kruskal-Wallis.
Page 12 of 15
Testing a Hypothesis of Association we can ask the question: What is the level of measurement
of the dependent variable, i.e., numerical or categorical?
When the dependent variable is numerical, the next question is to look at whether all independent
variables are also numerical? If all the independent variables are also numerical, then we use
Multiple Regression.
When the dependent variable is categorical, then we look at the type of independent variables. If
all the independent variables are numerical, then we use Multiple Discriminant Analysis. We may
have a case where one or two independent variables are categorical and other variables are
numerical. In this case we use Logistic Regression.
Figure 6 presents the flowchart of how a method is chosen in some special cases involving more
than two variables.
Page 13 of 15
Figure 6: Statistical Methods for Univariate Data (2)
When we are interested in variable/Dimension Reduction that means we don’t have dependent and
independent relation between the variables or when we are working at the item level and we would
like to group the items into certain variables, we use the Factor Analysis. And, of course, the factor
analysis has two variants: exploratory analysis and conformity analysis.
And sometimes we are interested, based on some criteria, to group the cases or respondents(not
the variables) of our study then in such case we will use Cluster Analysis.
The major difference between the factor Analysis and Cluster analysis is:
In Factor Analysis, several variables or several items are grouped into fewer Dimensions or fewer
Variables. In Cluster Analysis, the respondents or subjects in the study are grouped into certain
clusters.
Page 14 of 15
We might also have a situation where you examine several relationships and there are multiple
dependencies. Then, we use Structural Equation Modelling.
4. Choosing between the Z Test and t-test
One more important confusion normally people have is when to use Z -test and when to use t-test.
In the previous discussion, wherever we used t-test that could be a possibility, Z-test can be used.
Figure 7 presents the flow chart of how to choose between a t-test and z-test.
Figure 7: Choosing between Z test and t-test
We start with the question: Is population normal? If the population is normal, then we go with
another question: Is the standard deviation of the population known?
Page 15 of 15
If population is normal and the standard deviation of the population is known, we use Z-test. If
the standard deviation of the population is not known, then we use t-test.
If the population is not normal, then we ask the question as to whether the sample size is more than
or equal to 30. If the sample size is more than or equal to 30, then we go back to the same logic of
asking the question: Is the standard deviation of the population known? If the standard deviation
of the population is known, we use Z-test. If the standard deviation of the population is not known
then we use t-test.
If the sample size is not more than 30, we need to ask whether it is a large population. If it is a
large population, we use Binomial test; if it is not a large population, we use Hyper Geometric
Test.
References
1. Hair, J. F., Black, W. C., Babin, B. J., & Anderson, R. E. (2013). Multivariate data analysis:
Pearson new international edition. Pearson Higher Ed.
2. Field, A. (2013). Discovering statistics using IBM SPSS statistics. sage.

More Related Content

What's hot

Basic Statistics & Data Analysis
Basic Statistics & Data AnalysisBasic Statistics & Data Analysis
Basic Statistics & Data Analysis
Ajendra Sharma
 
Analysis of data in research
Analysis of data in researchAnalysis of data in research
Analysis of data in research
Abhijeet Birari
 
Statistics in research
Statistics in researchStatistics in research
Statistics in research
Balaji P
 
Types of data
Types of data Types of data
Types of data
Kiran Rawat
 
Brm (one tailed and two tailed hypothesis)
Brm (one tailed and two tailed hypothesis)Brm (one tailed and two tailed hypothesis)
Brm (one tailed and two tailed hypothesis)Upama Dwivedi
 
Data analysis and Presentation
Data analysis and PresentationData analysis and Presentation
Data analysis and Presentation
Jignesh Kariya
 
Multinomial logisticregression basicrelationships
Multinomial logisticregression basicrelationshipsMultinomial logisticregression basicrelationships
Multinomial logisticregression basicrelationships
Anirudha si
 
DATA Types
DATA TypesDATA Types
DATA Types
Aniruddha Deshmukh
 
Chapter 6 formulation of hypothesis
Chapter 6 formulation of hypothesisChapter 6 formulation of hypothesis
Chapter 6 formulation of hypothesis
NiranjanHN3
 
Data analysis
Data analysisData analysis
Data analysis
Mira K Desai
 
statistic
statisticstatistic
statistic
Pwalmiki
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
Attaullah Khan
 
Parametric & non parametric
Parametric & non parametricParametric & non parametric
Parametric & non parametric
ANCYBS
 
Non parametric tests by meenu
Non parametric tests by meenuNon parametric tests by meenu
Non parametric tests by meenu
meenu saharan
 
Sampling techniques and types
Sampling techniques and typesSampling techniques and types
Sampling techniques and types
NITISH SADOTRA
 
TYPES OF RESEARCH
TYPES OF RESEARCHTYPES OF RESEARCH
TYPES OF RESEARCH
vaishnavi vishwanath
 
Methods of Data Collection in Quantitative Research (Biostatistik)
Methods of Data Collection in Quantitative Research (Biostatistik)Methods of Data Collection in Quantitative Research (Biostatistik)
Methods of Data Collection in Quantitative Research (Biostatistik)
AKak Long
 
descriptive and inferential statistics
descriptive and inferential statisticsdescriptive and inferential statistics
descriptive and inferential statisticsMona Sajid
 

What's hot (20)

Basic Statistics & Data Analysis
Basic Statistics & Data AnalysisBasic Statistics & Data Analysis
Basic Statistics & Data Analysis
 
Spss an introduction
Spss  an introductionSpss  an introduction
Spss an introduction
 
Analysis of data in research
Analysis of data in researchAnalysis of data in research
Analysis of data in research
 
Statistics in research
Statistics in researchStatistics in research
Statistics in research
 
Types of data
Types of data Types of data
Types of data
 
Brm (one tailed and two tailed hypothesis)
Brm (one tailed and two tailed hypothesis)Brm (one tailed and two tailed hypothesis)
Brm (one tailed and two tailed hypothesis)
 
Data analysis and Presentation
Data analysis and PresentationData analysis and Presentation
Data analysis and Presentation
 
Multinomial logisticregression basicrelationships
Multinomial logisticregression basicrelationshipsMultinomial logisticregression basicrelationships
Multinomial logisticregression basicrelationships
 
DATA Types
DATA TypesDATA Types
DATA Types
 
Chapter 6 formulation of hypothesis
Chapter 6 formulation of hypothesisChapter 6 formulation of hypothesis
Chapter 6 formulation of hypothesis
 
Data analysis
Data analysisData analysis
Data analysis
 
statistic
statisticstatistic
statistic
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Parametric & non parametric
Parametric & non parametricParametric & non parametric
Parametric & non parametric
 
Research process
Research processResearch process
Research process
 
Non parametric tests by meenu
Non parametric tests by meenuNon parametric tests by meenu
Non parametric tests by meenu
 
Sampling techniques and types
Sampling techniques and typesSampling techniques and types
Sampling techniques and types
 
TYPES OF RESEARCH
TYPES OF RESEARCHTYPES OF RESEARCH
TYPES OF RESEARCH
 
Methods of Data Collection in Quantitative Research (Biostatistik)
Methods of Data Collection in Quantitative Research (Biostatistik)Methods of Data Collection in Quantitative Research (Biostatistik)
Methods of Data Collection in Quantitative Research (Biostatistik)
 
descriptive and inferential statistics
descriptive and inferential statisticsdescriptive and inferential statistics
descriptive and inferential statistics
 

Similar to Selection of appropriate data analysis technique

Parametric vs non parametric test
Parametric vs non parametric testParametric vs non parametric test
Parametric vs non parametric test
ar9530
 
Inferential Statistics - DAY 4 - B.Ed - AIOU
Inferential Statistics - DAY 4 - B.Ed - AIOUInferential Statistics - DAY 4 - B.Ed - AIOU
Inferential Statistics - DAY 4 - B.Ed - AIOU
EqraBaig
 
When to use, What Statistical Test for data Analysis modified.pptx
When to use, What Statistical Test for data Analysis modified.pptxWhen to use, What Statistical Test for data Analysis modified.pptx
When to use, What Statistical Test for data Analysis modified.pptx
Asokan R
 
statistical analysis.pptx
statistical analysis.pptxstatistical analysis.pptx
statistical analysis.pptx
hayatalakoum1
 
Need a nonplagiarised paper and a form completed by 1006015 before.docx
Need a nonplagiarised paper and a form completed by 1006015 before.docxNeed a nonplagiarised paper and a form completed by 1006015 before.docx
Need a nonplagiarised paper and a form completed by 1006015 before.docx
lea6nklmattu
 
© 2014 Laureate Education, Inc. Page 1 of 5 Week 4 A.docx
© 2014 Laureate Education, Inc.   Page 1 of 5  Week 4 A.docx© 2014 Laureate Education, Inc.   Page 1 of 5  Week 4 A.docx
© 2014 Laureate Education, Inc. Page 1 of 5 Week 4 A.docx
gerardkortney
 
F unit 5.pptx
F unit 5.pptxF unit 5.pptx
F unit 5.pptx
agreshgupta
 
Descriptive Analysis.pptx
Descriptive Analysis.pptxDescriptive Analysis.pptx
Descriptive Analysis.pptx
Parveen Vashisth
 
Presentation1
Presentation1Presentation1
Presentation1
Nalini Singh
 
t-test Parametric test Biostatics and Research Methodology
t-test Parametric test Biostatics and Research Methodologyt-test Parametric test Biostatics and Research Methodology
t-test Parametric test Biostatics and Research Methodology
Nigar Kadar Mujawar,Womens College of Pharmacy,Peth Vadgaon,Kolhapur,416112
 
Basic of Statistical Inference Part-V: Types of Hypothesis Test (Parametric)
Basic of Statistical Inference Part-V: Types of Hypothesis Test (Parametric) Basic of Statistical Inference Part-V: Types of Hypothesis Test (Parametric)
Basic of Statistical Inference Part-V: Types of Hypothesis Test (Parametric)
Dexlab Analytics
 
Data analysis powerpoint
Data analysis powerpointData analysis powerpoint
Data analysis powerpointjamiebrandon
 
Correlation and Regression - ANOVA - DAY 5 - B.Ed - 8614 - AIOU
Correlation and Regression - ANOVA - DAY 5 - B.Ed - 8614 - AIOUCorrelation and Regression - ANOVA - DAY 5 - B.Ed - 8614 - AIOU
Correlation and Regression - ANOVA - DAY 5 - B.Ed - 8614 - AIOU
EqraBaig
 
HYPOTHESES.pptx
HYPOTHESES.pptxHYPOTHESES.pptx
HYPOTHESES.pptx
TalhaKhan420569
 
April Heyward Research Methods Class Session - 8-5-2021
April Heyward Research Methods Class Session - 8-5-2021April Heyward Research Methods Class Session - 8-5-2021
April Heyward Research Methods Class Session - 8-5-2021
April Heyward
 
Methods of Statistical Analysis & Interpretation of Data..pptx
Methods of Statistical Analysis & Interpretation of Data..pptxMethods of Statistical Analysis & Interpretation of Data..pptx
Methods of Statistical Analysis & Interpretation of Data..pptx
heencomm
 
Analyzing quantitative data
Analyzing quantitative dataAnalyzing quantitative data
Analyzing quantitative data
mostafasharafiye
 
Chapter 13 Data Analysis Inferential Methods and Analysis of Time Series
Chapter 13 Data Analysis Inferential Methods and Analysis of Time SeriesChapter 13 Data Analysis Inferential Methods and Analysis of Time Series
Chapter 13 Data Analysis Inferential Methods and Analysis of Time Series
International advisers
 
Kinds Of Variables Kato Begum
Kinds Of Variables Kato BegumKinds Of Variables Kato Begum
Kinds Of Variables Kato BegumDr. Cupid Lucid
 

Similar to Selection of appropriate data analysis technique (20)

Parametric vs non parametric test
Parametric vs non parametric testParametric vs non parametric test
Parametric vs non parametric test
 
Inferential Statistics - DAY 4 - B.Ed - AIOU
Inferential Statistics - DAY 4 - B.Ed - AIOUInferential Statistics - DAY 4 - B.Ed - AIOU
Inferential Statistics - DAY 4 - B.Ed - AIOU
 
When to use, What Statistical Test for data Analysis modified.pptx
When to use, What Statistical Test for data Analysis modified.pptxWhen to use, What Statistical Test for data Analysis modified.pptx
When to use, What Statistical Test for data Analysis modified.pptx
 
statistical analysis.pptx
statistical analysis.pptxstatistical analysis.pptx
statistical analysis.pptx
 
Need a nonplagiarised paper and a form completed by 1006015 before.docx
Need a nonplagiarised paper and a form completed by 1006015 before.docxNeed a nonplagiarised paper and a form completed by 1006015 before.docx
Need a nonplagiarised paper and a form completed by 1006015 before.docx
 
© 2014 Laureate Education, Inc. Page 1 of 5 Week 4 A.docx
© 2014 Laureate Education, Inc.   Page 1 of 5  Week 4 A.docx© 2014 Laureate Education, Inc.   Page 1 of 5  Week 4 A.docx
© 2014 Laureate Education, Inc. Page 1 of 5 Week 4 A.docx
 
F unit 5.pptx
F unit 5.pptxF unit 5.pptx
F unit 5.pptx
 
Descriptive Analysis.pptx
Descriptive Analysis.pptxDescriptive Analysis.pptx
Descriptive Analysis.pptx
 
Presentation1
Presentation1Presentation1
Presentation1
 
t-test Parametric test Biostatics and Research Methodology
t-test Parametric test Biostatics and Research Methodologyt-test Parametric test Biostatics and Research Methodology
t-test Parametric test Biostatics and Research Methodology
 
Basic of Statistical Inference Part-V: Types of Hypothesis Test (Parametric)
Basic of Statistical Inference Part-V: Types of Hypothesis Test (Parametric) Basic of Statistical Inference Part-V: Types of Hypothesis Test (Parametric)
Basic of Statistical Inference Part-V: Types of Hypothesis Test (Parametric)
 
Advanced statistics
Advanced statisticsAdvanced statistics
Advanced statistics
 
Data analysis powerpoint
Data analysis powerpointData analysis powerpoint
Data analysis powerpoint
 
Correlation and Regression - ANOVA - DAY 5 - B.Ed - 8614 - AIOU
Correlation and Regression - ANOVA - DAY 5 - B.Ed - 8614 - AIOUCorrelation and Regression - ANOVA - DAY 5 - B.Ed - 8614 - AIOU
Correlation and Regression - ANOVA - DAY 5 - B.Ed - 8614 - AIOU
 
HYPOTHESES.pptx
HYPOTHESES.pptxHYPOTHESES.pptx
HYPOTHESES.pptx
 
April Heyward Research Methods Class Session - 8-5-2021
April Heyward Research Methods Class Session - 8-5-2021April Heyward Research Methods Class Session - 8-5-2021
April Heyward Research Methods Class Session - 8-5-2021
 
Methods of Statistical Analysis & Interpretation of Data..pptx
Methods of Statistical Analysis & Interpretation of Data..pptxMethods of Statistical Analysis & Interpretation of Data..pptx
Methods of Statistical Analysis & Interpretation of Data..pptx
 
Analyzing quantitative data
Analyzing quantitative dataAnalyzing quantitative data
Analyzing quantitative data
 
Chapter 13 Data Analysis Inferential Methods and Analysis of Time Series
Chapter 13 Data Analysis Inferential Methods and Analysis of Time SeriesChapter 13 Data Analysis Inferential Methods and Analysis of Time Series
Chapter 13 Data Analysis Inferential Methods and Analysis of Time Series
 
Kinds Of Variables Kato Begum
Kinds Of Variables Kato BegumKinds Of Variables Kato Begum
Kinds Of Variables Kato Begum
 

More from RajaKrishnan M

Shortcomings of Demat Account
Shortcomings of Demat AccountShortcomings of Demat Account
Shortcomings of Demat Account
RajaKrishnan M
 
Demat Account Services
Demat Account ServicesDemat Account Services
Demat Account Services
RajaKrishnan M
 
Depository Participant
Depository ParticipantDepository Participant
Depository Participant
RajaKrishnan M
 
Services provided in Mobile Banking
Services provided in Mobile BankingServices provided in Mobile Banking
Services provided in Mobile Banking
RajaKrishnan M
 
Ombudsman scheme
Ombudsman scheme Ombudsman scheme
Ombudsman scheme
RajaKrishnan M
 
Factors affecting share price
Factors affecting share priceFactors affecting share price
Factors affecting share price
RajaKrishnan M
 
Rights of investors
Rights of investorsRights of investors
Rights of investors
RajaKrishnan M
 
Loss of Confidence of small investors
Loss of Confidence of small investorsLoss of Confidence of small investors
Loss of Confidence of small investors
RajaKrishnan M
 
Facilities by BSE
Facilities by BSEFacilities by BSE
Facilities by BSE
RajaKrishnan M
 
Technological forces fueling e-commerce
Technological forces fueling e-commerceTechnological forces fueling e-commerce
Technological forces fueling e-commerce
RajaKrishnan M
 
Encryption and Decryption
Encryption and DecryptionEncryption and Decryption
Encryption and Decryption
RajaKrishnan M
 
Meaning, Anatomy and Forces Fueling e-commerce
Meaning, Anatomy and Forces Fueling e-commerceMeaning, Anatomy and Forces Fueling e-commerce
Meaning, Anatomy and Forces Fueling e-commerce
RajaKrishnan M
 
Forces Fueling e-commerce
Forces Fueling e-commerceForces Fueling e-commerce
Forces Fueling e-commerce
RajaKrishnan M
 
Inter Organizational e-commerce
Inter Organizational e-commerceInter Organizational e-commerce
Inter Organizational e-commerce
RajaKrishnan M
 
Factors for the success of m-commerce
Factors for the success of m-commerceFactors for the success of m-commerce
Factors for the success of m-commerce
RajaKrishnan M
 
Advantages of E-Commerce
Advantages of E-CommerceAdvantages of E-Commerce
Advantages of E-Commerce
RajaKrishnan M
 
Types of E-Commerce
Types of E-CommerceTypes of E-Commerce
Types of E-Commerce
RajaKrishnan M
 
E-Commerce and E- Businesss
E-Commerce and E- BusinesssE-Commerce and E- Businesss
E-Commerce and E- Businesss
RajaKrishnan M
 
Electronic Data Interchange & Internet
Electronic Data Interchange & InternetElectronic Data Interchange & Internet
Electronic Data Interchange & Internet
RajaKrishnan M
 

More from RajaKrishnan M (20)

Shortcomings of Demat Account
Shortcomings of Demat AccountShortcomings of Demat Account
Shortcomings of Demat Account
 
Demat Account Services
Demat Account ServicesDemat Account Services
Demat Account Services
 
Depository Participant
Depository ParticipantDepository Participant
Depository Participant
 
Services provided in Mobile Banking
Services provided in Mobile BankingServices provided in Mobile Banking
Services provided in Mobile Banking
 
Ombudsman scheme
Ombudsman scheme Ombudsman scheme
Ombudsman scheme
 
Factors affecting share price
Factors affecting share priceFactors affecting share price
Factors affecting share price
 
Rights of investors
Rights of investorsRights of investors
Rights of investors
 
Loss of Confidence of small investors
Loss of Confidence of small investorsLoss of Confidence of small investors
Loss of Confidence of small investors
 
Facilities by BSE
Facilities by BSEFacilities by BSE
Facilities by BSE
 
Technological forces fueling e-commerce
Technological forces fueling e-commerceTechnological forces fueling e-commerce
Technological forces fueling e-commerce
 
Encryption and Decryption
Encryption and DecryptionEncryption and Decryption
Encryption and Decryption
 
Meaning, Anatomy and Forces Fueling e-commerce
Meaning, Anatomy and Forces Fueling e-commerceMeaning, Anatomy and Forces Fueling e-commerce
Meaning, Anatomy and Forces Fueling e-commerce
 
Forces Fueling e-commerce
Forces Fueling e-commerceForces Fueling e-commerce
Forces Fueling e-commerce
 
Inter Organizational e-commerce
Inter Organizational e-commerceInter Organizational e-commerce
Inter Organizational e-commerce
 
Factors for the success of m-commerce
Factors for the success of m-commerceFactors for the success of m-commerce
Factors for the success of m-commerce
 
Advantages of E-Commerce
Advantages of E-CommerceAdvantages of E-Commerce
Advantages of E-Commerce
 
Types of E-Commerce
Types of E-CommerceTypes of E-Commerce
Types of E-Commerce
 
E-Commerce and E- Businesss
E-Commerce and E- BusinesssE-Commerce and E- Businesss
E-Commerce and E- Businesss
 
RFID
RFIDRFID
RFID
 
Electronic Data Interchange & Internet
Electronic Data Interchange & InternetElectronic Data Interchange & Internet
Electronic Data Interchange & Internet
 

Recently uploaded

Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Thiyagu K
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
Vivekanand Anglo Vedic Academy
 
Group Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana BuscigliopptxGroup Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana Buscigliopptx
ArianaBusciglio
 
A Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptxA Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptx
thanhdowork
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
heathfieldcps1
 
Chapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdfChapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdf
Kartik Tiwari
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
TechSoup
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
MysoreMuleSoftMeetup
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
Thiyagu K
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
Vikramjit Singh
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
EugeneSaldivar
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
Peter Windle
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
vaibhavrinwa19
 
Honest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptxHonest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptx
timhan337
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
SACHIN R KONDAGURI
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
EduSkills OECD
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
DhatriParmar
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
Peter Windle
 
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBCSTRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
kimdan468
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 

Recently uploaded (20)

Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
 
Group Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana BuscigliopptxGroup Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana Buscigliopptx
 
A Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptxA Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptx
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
 
Chapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdfChapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdf
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
 
Honest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptxHonest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptx
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
 
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBCSTRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 

Selection of appropriate data analysis technique

  • 1. Page 1 of 15 DESCRIPTION OF THE TOPIC Choosing the right statistical method for data analysis is always a challenge as it dependent on a host of things. Before we discuss the major determinants of choice of a method in detail, it is also important to understand that one should have a Research/Data Analysis blueprint of the study one is undertaking. 1. Research/Data Analysis Blueprint Generally, the research starts with a broad research question that is often divided into more measurable, narrower objectives (See Figure 1). Each objective is achieved by splitting the subject matter into certain statistically testable hypotheses. Items Description of Topic Course Data Analysis for Social Science Teachers Topic Choosing the Right Statistical Method for Data Analysis
  • 2. Page 2 of 15 Figure 1: The Research Blueprint--Objective Hypotheses Mapping There is no standard rule as to how many hypotheses a research objective can have. One research objective might have one or two or more hypotheses. However, it is important that each objective be split into one or more testable hypotheses. In order that one is clear about how a hypothesis is tested, one must identify the variables associated with each of the hypotheses (see Figure 2). There is no rule as to how many variables a hypothesis will have. There could be a hypothesis with just one variable (such as test of population mean to be equal to a number) or there could be two variables (like tests of hypothesis of association or difference) or even more (like factor analysis/multiple regression). Each of the variables is then identified as a Dependent or Independent variable given the nature of the hypothesis being tested. Further against each variable, its level of measurement is noted. We shall have them noted as Nominal, Ordinal, Interval or Ratio. Often the nominal and ordinal levels are be combined into Categorical whereas the Interval and Ratio levels are labeled as Numerical.
  • 3. Page 3 of 15 The categorical variable is also called Non-Metric or non-Parametric variable. The Numerical Variables are also called metric or parametric or sometimes even as a continuous variable by some authors. Figure 2: The Research Blueprint—Objective-Hypothesis-Variable-Test Mapping 2. Major Determinants of Choice of s Statistical Method The choice of particular statistical method is generally determined the following: a) Number and Level of Measurement of Variables b) Distribution of the variable c) Dependence and Independence Structure d) Nature of the Hypothesis e) Sample Size We shall now briefly discuss the above:
  • 4. Page 4 of 15 2.1. Level of Measurement of Variables We know that there are four levels of measurement: a) Nominal b) Ordinal c) Interval d) Ratio Often the nominal and ordinal levels are to be combined into Categorical whereas the Interval and Ratio levels are to be labeled as Numerical. The categorical variable is also called Non-Metric or non-Parametric variable. The Numerical Variables are also called metric or parametric or sometimes it is even called a continuous variable by some authors. While choosing a particular test, we shall be asking the question: What is the level of measurement of the data? --Nominal/Ordinal/interval/Ration Or simply Categorical or Numerical? 2.2. Distribution of Underlying Variables Based on the level of measurement, the data might follow a distribution like Normal, Binominal, Poisson etc. and it might not have a distribution. The variables measured on nominal and ordinal scales generally do not have any distribution whereas the numerical variables might follow a normal distribution or other distribution. The tests that are used when the categorical variables are involved are called non-parametric or distribution-free tests. The tests that are used with numerical variables will be called parametric tests. While choosing a particular test we shall be asking the question:
  • 5. Page 5 of 15 Is the data parametric (measured on a numerical scale) or non-parametric (measured on a categorical scale)? 2.3. Nature of Hypothesis Broadly a hypothesis can be categorized as: a) Hypothesis of Association/Causation and b) Hypothesis of Differences The hypothesis of association/causation examines the nature and strength of the relationship between variables. Correlation, Regression are such examples. The hypothesis of difference examines whether the two populations differ on a parameter like mean. Using hypothesis of difference, we generally test the equality of two or more population means. While choosing a particular test, we shall be asking the question: What is the nature of the hypothesis? ---Hypothesis of Association/Causation OR Hypothesis of Differences 2.4. No. of Variables in the Hypothesis The number of variables associated with a hypothesis is also an important determinant of the choice of a statistical technique. Based on the number of variables, we sometimes even classify the statistical techniques as Univariate (involving one variable) /Bi-variate (two variables)/ Multivariate (more than two) techniques.
  • 6. Page 6 of 15 While choosing a particular test, we shall be asking the question: How many Variables are there in the hypothesis? -- One or two or more than two 3. An approach for Choosing a Statistical Method Several authors present different approaches to choose a statistical method. An approach generally involves starting with one of the above determinants and drilling down with other determinants. For instance, we might start with the question: What is the nature of the hypothesis? Then, ask the question: How many variables are involved? And then ask: What is the level of measurement of each of the variables? And so on. Alternatively, we might start with, say, the number of variables in the hypothesis, then the nature of the hypothesis and so on. We suggest starting with the question of a number of variables. The following sections present the self-explanatory flow charts of how to choose a test once you started with the question: How many variables are involved in the hypothesis? One or two or more than two. Accordingly, the sections are titled as Statistical Methods for Univariate /Bi-variate /Multivariate data 3.1. Statistical Methods for Univariate Data Figure 3 presents the flowchart of how a method can be chosen when the hypothesis involves just one variable.
  • 7. Page 7 of 15 Figure 3: Statistical Methods for Univariate Data We will ask what is it that we are trying to do. Are we trying to describe the data or Are we trying to make an inference? Trying to make an inference with univariate data generally involves testing whether the population mean equals a particular numeral like whether µ =3.. Let us look at the first wing: Descriptive statistics. The kind of descriptive statistics we can use to describe the univariate data straight away depends on the level of measurement of the variable. ● For nominal data, the measure of central tendency is always mode and mode is the only choice if your data is nominal. Further, we don't have any measure of spread or variance when data is on a nominal scale. ● When data is on an ordinal scale, we have two choices of central tendency that is mode and median. We can use the interquartile range as a measure of dispersion or variance.
  • 8. Page 8 of 15 ● When data is measured in interval or ratio scale, we can use all the three measures of central tendency, i.e. mean, median and mode. And we can also use several measures of dispersion such as interquartile range, range, variance and standard deviation. On the other hand, if we are interested in the hypothesis whether the population mean equals a particular numeral like µ =3?So, in this case, we call it a hypothesis of difference involving a single variable and the test is one-sample t-test. Our univariate data is on the numerical scale (interval or ratio), so we use the one-sample t-test. 3.2. Statistical Methods for Bi-variate Data Quite often, we will be interested in testing the hypothesis that involves two variables or sometimes we also have one variable measured across two samples. Figure 4 presents the flowchart of how a method can be chosen when the hypothesis involves two variables or two samples measured on one variable.
  • 9. Page 9 of 15 Figure 4: Statistical Methods for Bi-variate Data We will start with the question: What is the nature of the hypothesis? ---Hypothesis of Association/Causation OR Hypothesis of Differences 1. Hypothesis of Difference: A hypothesis of difference in this context generally involves testing for the equality of two population means (whether µ1=µ2?).
  • 10. Page 10 of 15 Then, we can ask this question : Is this data parametric or non-parametric? When the data is parametric(meaning the underlying variable has a distribution), we will ask this question whether the samples are independent or dependent. In independent samples, we measure one variable on two samples whereas a dependent sample generally involves repeated measurements(twice) of the same variables on a single sample. If the samples happen to be independent, we use an independent sample t-test, otherwise we use a paired sample t-test. And for non-parametric data, we use the Mann-Whitney U test to test the hypothesis of differences. 2. Hypothesis of Association: In Hypothesis of Association again we ask this question whether the data is parametric or non-parametric. And if the data is parametric, the next level question is whether we want to look at the association between the two variables or there is a cause-effect relationship. In Association between the variables, we simply try to know whether two variables are related. Whereas in causation one of the variables is dependent and the other will be independent and we just want to see to what extent the independent variable explains the changes in the dependent variable. For parametric data, when we are examining the association; the test will be the Pearson coefficient correlation. And for causation we use Regression. For non-parametric data, we ask a next level question: whether the data is measured on a nominal or ordinal scale. If it measured on a nominal scale we use Chi-square test of association. If the data is measured on an ordinal scale, we use Spearman’s Rank correlation.
  • 11. Page 11 of 15 3.3. Statistical Methods for Multivariate Data Figure 5 presents the flowchart of how a method can be chosen when the hypothesis involves more than two variables. Figure 5: Statistical Methods for Multivariate Data (1) In multivariate data, again we will start with the same question whether it is the hypothesis of difference or the hypothesis of association. Under the hypothesis of difference again we need to know that data is parametric or non- parametric. When the data happened to be parametric, we use ANOVA and if the data is nonparametric, we use Kruskal-Wallis.
  • 12. Page 12 of 15 Testing a Hypothesis of Association we can ask the question: What is the level of measurement of the dependent variable, i.e., numerical or categorical? When the dependent variable is numerical, the next question is to look at whether all independent variables are also numerical? If all the independent variables are also numerical, then we use Multiple Regression. When the dependent variable is categorical, then we look at the type of independent variables. If all the independent variables are numerical, then we use Multiple Discriminant Analysis. We may have a case where one or two independent variables are categorical and other variables are numerical. In this case we use Logistic Regression. Figure 6 presents the flowchart of how a method is chosen in some special cases involving more than two variables.
  • 13. Page 13 of 15 Figure 6: Statistical Methods for Univariate Data (2) When we are interested in variable/Dimension Reduction that means we don’t have dependent and independent relation between the variables or when we are working at the item level and we would like to group the items into certain variables, we use the Factor Analysis. And, of course, the factor analysis has two variants: exploratory analysis and conformity analysis. And sometimes we are interested, based on some criteria, to group the cases or respondents(not the variables) of our study then in such case we will use Cluster Analysis. The major difference between the factor Analysis and Cluster analysis is: In Factor Analysis, several variables or several items are grouped into fewer Dimensions or fewer Variables. In Cluster Analysis, the respondents or subjects in the study are grouped into certain clusters.
  • 14. Page 14 of 15 We might also have a situation where you examine several relationships and there are multiple dependencies. Then, we use Structural Equation Modelling. 4. Choosing between the Z Test and t-test One more important confusion normally people have is when to use Z -test and when to use t-test. In the previous discussion, wherever we used t-test that could be a possibility, Z-test can be used. Figure 7 presents the flow chart of how to choose between a t-test and z-test. Figure 7: Choosing between Z test and t-test We start with the question: Is population normal? If the population is normal, then we go with another question: Is the standard deviation of the population known?
  • 15. Page 15 of 15 If population is normal and the standard deviation of the population is known, we use Z-test. If the standard deviation of the population is not known, then we use t-test. If the population is not normal, then we ask the question as to whether the sample size is more than or equal to 30. If the sample size is more than or equal to 30, then we go back to the same logic of asking the question: Is the standard deviation of the population known? If the standard deviation of the population is known, we use Z-test. If the standard deviation of the population is not known then we use t-test. If the sample size is not more than 30, we need to ask whether it is a large population. If it is a large population, we use Binomial test; if it is not a large population, we use Hyper Geometric Test. References 1. Hair, J. F., Black, W. C., Babin, B. J., & Anderson, R. E. (2013). Multivariate data analysis: Pearson new international edition. Pearson Higher Ed. 2. Field, A. (2013). Discovering statistics using IBM SPSS statistics. sage.