2. HYPOTHESIS
TESTING
Hypothesis Testing refers to
1. Making an assumption , called hypothesis , about a population
parameter .
2. Collecting sample data
3. Calculating sample statistics
4. Using the sample statistics to evaluate hypothesis ( how likely it is
that our hypothesized parameter is correct . To test the validity of our
assumption we determine the difference between the hypothesized
parameter value and sample value . )
3. NULL
HYPOTHESIS
The basic assumption regarding population parameter which
can be tested is called null hypothesis .
In other words the statement that may difference between
observed sample statistics and specified population parameter
is due to a sampling error called null hypothesis .
Therefore the null hypothesis means hypothesis of no difference
and it is denoted by Ho .
4. ALTERNATE
HYPOTHESIS
• When the null hypothesis is rejected than the
assumption taken as true is called alternate
hypothesis .
• It is denoted by Ha .
• The alternative hypothesis is a statement used in statistical
inference experiment. It is contradictory to the null hypothesis .
5. STEPS OF TESTING OF
HYPOTHESIS
1.State the null and alternate hypothesis
.
2.Choose the level of significance at size
α .
3.Determine the critical region
4.Use test statistics
5.Making decision or conclusion
6. PARAMETRIC
TEST
• The basic principle behind the parametric tests is that we have a fixed set of
parameters that are used to determine a probabilistic model .
• Parametric tests are those tests for which we have prior knowledge of the
population distribution .
• The parameters used in these test includes mean, standard deviation &
variance.
• The different parametric test are :
T-test
F-test
Z-test
Anova test
7. T-
TEST
• It is a parametric test of hypothesis testing based on Student’s T distribution.
• It is essentially, testing the significance of the difference of the mean values when the sample
size is small (i.e, less than 30)
Assumptions :
•Population distribution is normal, and
•Samples are random and independent
•The sample size is small.
•Population standard deviation is not known.
One Sample T-test: To compare a sample mean
with that of the population mean.
Two-Sample T-test: To compare the
means of two different samples.
t=
𝑥1−𝑥2
𝑠1
2
𝑁1
+
𝑠2
2
𝑁2
𝑧 =
𝑥 − 𝑢
𝜎 𝑛
8. Z-
TEST
1. It is a parametric test of hypothesis testing.
2. It is used to determine whether the means are different when the population variance is
known and the sample size is large ( i.e. , greater than 30).
Assumptions :
•Population distribution is normal
•Samples are random and independent.
•The sample size is large.
•Population standard deviation is known.
One Sample Z-test : To compare a sample mean
with that of the population mean.
Two Sample Z-test : To compare the means
of two different samples.
𝑧 =
𝑥1 + 𝑥2
𝜎1
2
𝑛1
+
𝜎2
2
𝑛2
𝑧 =
𝑥 − 𝑢
𝜎 𝑛
9. F-
TEST
1. It is a parametric test of hypothesis testing based on Snedecor F-distribution.
2. It is a test for the null hypothesis that two normal populations have the same variance.
3. An F-test is regarded as a comparison of equality of sample variances.
4. F-statistic is simply a ratio of two variances.
5. It is calculated :
F = s1
2/s2
2
𝑠2 = 𝑖=1
𝑛
𝑥𝑖 − 𝑥 2
𝑛 − 1
Assumptions :
•Population distribution is normal, and
•Samples are drawn randomly and independently.
10. ANOV
A
1. Also called as Analysis of variance, it is a parametric test of hypothesis testing.
2. It is an extension of the T-Test and Z-test.
3. It is used to test the significance of the differences in the mean values among more than two
sample groups.
4. It uses F-test to statistically test the equality of means and the relative variance between them.
Assumptions :
•Population distribution is normal, and
•Samples are random and independent.
•Homogeneity of sample variance.
11. NON PARAMETRIC
TEST
• Non-parametric test is a statistical analysis method that does not assume the population
data belongs to some prescribed distribution which is determined by some parameters.
• When the data does not meet the requirements to perform a parametric test, a non-
parametric test is used to analyze it.
When the distribution is skewed, a non-parametric test is used.
If the size of the data is too small then validating the distribution of the data
becomes difficult.
If the data is nominal or ordinal, a non-parametric test is used.
• Types of non parametric test :
Chi-square test
Mann-Whitney U Test
Wilcoxon Signed Rank Test
Sign Test
Kruskal Wallis Test
12. CHI-SQUARE TEST
A chi-squared test (symbolically represented as χ2) is basically a data analysis on the basis of
observations of a random set of variables.
It is a comparison of two statistical data sets.
When we consider, the null speculation is true, the sampling distribution of the test statistic is called
as chi-squared distribution.
Finding P-Value :
• P stands for probability here.
• To calculate the p-value, the chi-square test is used in statistics.
•P≤ 0.05; Hypothesis rejected
•P>.05; Hypothesis Accepted
Formula Used :
𝑥2
=
𝑜𝑖 − 𝜀𝑖
2
𝜀𝑖
13. MANN-WHITNEY U
TEST
• This non-parametric test is analogous to t-tests for independent samples. To conduct
such a test the distribution must contain ordinal data. It is also known as the
Wilcoxon rank sum test.
• Null Hypothesis: H0: The two populations under consideration must be equal.
• Test Statistic: U should be smaller of
OR
• where, R1R1 is the sum of ranks in group 1 and R2R2 is the sum of ranks in group
2.
• Decision Criteria: Reject the null hypothesis if U < critical value.
𝑈 = 𝑛1𝑛2 +
𝑛1 𝑛1 + 1
2
− 𝑅1 𝑈 = 𝑛1𝑛2 +
𝑛2 𝑛2 + 1
2
− 𝑅2
14. WILCOXON SIGNED RANK
TEST
• This is the non-parametric test whose counterpart is the
parametric paired t-test.
• It is used to compare two samples that contain ordinal data
and are dependent.
• The Wilcoxon signed rank test assumes that the data comes
from a symmetric distribution.
• Null Hypothesis: H0: The difference in the median is 0.
• Test Statistic: W. W is defined as the smaller of the sums of
the negative and positive ranks.
• Decision Criteria: Reject the null hypothesis if W < critical
value.
15. SIGN
TEST
• This non-parametric test is the parametric counterpart to
the paired samples t-test.
• The sign test is similar to the Wilcoxon sign test.
• Null Hypothesis: H0: The difference in the median is 0.
• Test Statistic: The smaller value among the number of
positive and negative signs.
• Decision Criteria: Reject the null hypothesis if the test
statistic < critical value.
16. KRUSKAL WALLIS
TEST
• The parametric one-way ANOVA test is analogous to the non-parametric
Kruskal Wallis test. It is used for comparing more than two groups of data
that are independent and ordinal.
• Null Hypothesis: H0H0: m population medians are equal
• Test Statistic:
𝐻 =
12
𝑁 𝑁 + 1
𝛴1
𝑚
𝑅𝑖
2
𝑛𝑖
− 3 𝑁 + 1
• where, N = total sample size, nj and Rj are the sample size and the sum
of ranks of the jth group
• Decision Criteria: Reject the null hypothesis if H > critical value