The chi-square test is used to determine if an observed frequency distribution differs from an expected theoretical distribution. It can test goodness of fit, independence of attributes, and homogeneity. The test involves calculating chi-square by taking the sum of the squares of the differences between observed and expected frequencies divided by expected frequencies. For the test to be valid, certain conditions must be met regarding sample size, expected frequencies, independence, and randomness. The test has some limitations such as not measuring strength of association and being unreliable with small expected frequencies.
Assumptions of parametric and non-parametric tests
Testing the assumption of normality
Commonly used non-parametric tests
Applying tests in SPSS
Advantages of non-parametric tests
Limitations
The error (or disturbance) of an observed value is the deviation of the observed value from the (unobservable) true value of a quantity of interest (for example, a population mean), and the residual of an observed value is the difference between the observed value and the estimated value of the quantity of interest (for example, a sample mean).
Suppose there is a series of observations from a univariate distribution and we want to estimate the mean of that distribution (the so-called location model). In this case, the errors are the deviations of the observations from the population mean, while the residuals are the deviations of the observations from the sample mean.
A statistical error (or disturbance) is the amount by which an observation differs from its expected value, the latter being based on the whole population from which the statistical unit was chosen randomly. For example, if the mean height in a population of 21-year-old men is 1.75 meters, and one randomly chosen man is 1.80 meters tall, then the "error" is 0.05 meters; if the randomly chosen man is 1.70 meters tall, then the "error" is −0.05 meters. The expected value, being the mean of the entire population, is typically not observable, and hence the statistical error cannot be observed either.
It is most useful for the students of BBA for the subject of "Data Analysis and Modeling"/
It has covered the content of chapter- Data regression Model
Visit for more on www.ramkumarshah.com.np/
Assumptions of parametric and non-parametric tests
Testing the assumption of normality
Commonly used non-parametric tests
Applying tests in SPSS
Advantages of non-parametric tests
Limitations
The error (or disturbance) of an observed value is the deviation of the observed value from the (unobservable) true value of a quantity of interest (for example, a population mean), and the residual of an observed value is the difference between the observed value and the estimated value of the quantity of interest (for example, a sample mean).
Suppose there is a series of observations from a univariate distribution and we want to estimate the mean of that distribution (the so-called location model). In this case, the errors are the deviations of the observations from the population mean, while the residuals are the deviations of the observations from the sample mean.
A statistical error (or disturbance) is the amount by which an observation differs from its expected value, the latter being based on the whole population from which the statistical unit was chosen randomly. For example, if the mean height in a population of 21-year-old men is 1.75 meters, and one randomly chosen man is 1.80 meters tall, then the "error" is 0.05 meters; if the randomly chosen man is 1.70 meters tall, then the "error" is −0.05 meters. The expected value, being the mean of the entire population, is typically not observable, and hence the statistical error cannot be observed either.
It is most useful for the students of BBA for the subject of "Data Analysis and Modeling"/
It has covered the content of chapter- Data regression Model
Visit for more on www.ramkumarshah.com.np/
The Indian Dental Academy is the Leader in continuing dental education , training dentists in all aspects of dentistry and offering a wide range of dental certified courses in different formats.
Indian dental academy provides dental crown & Bridge,rotary endodontics,fixed orthodontics,
Dental implants courses.for details pls visit www.indiandentalacademy.com ,or call
0091-9248678078
Functional matrix revisited /certified fixed orthodontic courses by Indian...Indian dental academy
The Indian Dental Academy is the Leader in continuing dental education , training dentists in all aspects of dentistry and offering a wide range of dental certified courses in different formats.
Indian dental academy provides dental crown & Bridge,rotary endodontics,fixed orthodontics,
Dental implants courses.for details pls visit www.indiandentalacademy.com ,or call
00919248678078
This test (as a non-parametric test) is based on frequencies and not on the parameters like mean and standard deviation.
The test is used for testing the hypothesis and is not useful for estimation.
This test possesses the additive property as has already been explained.
Please Subscribe to this Channel for more solutions and lectures
http://www.youtube.com/onlineteaching
Chapter 11: Goodness-of-Fit and Contingency Tables
11.2: Contingency Tables
Chi square test is a technique to determine the significant difference between observed and expected frequency.
After the calculation is done by substituting the values in chi square formula, the calculated value is compared to the tabulated value.
The result helps to state whether the given data supports null hypothesis or alternate hypothesis.
Chi square test is a non-parametric and was developed by Karl Pearson 1900.
36086 Topic Discussion3Number of Pages 2 (Double Spaced).docxrhetttrevannion
36086 Topic: Discussion3
Number of Pages: 2 (Double Spaced)
Number of sources: 1
Writing Style: APA
Type of document: Essay
Academic Level:Master
Category: Psychology
Language Style: English (U.S.)
Order Instructions: Attached
I will upload the instructions
Reference/Module
Learning Objectives
•Explain what the x2 goodness-of-fit test is and what it does.
•Calculate a x2 goodness-of-fit test.
•List the assumptions of the x2 goodness-of-fit test.
•Calculate the x2 test of independence.
•Interpret the x2 test of independence.
•Explain the assumptions of the x2 test of independence.
The Chi-Square (x2) Goodness-of-Fit test: What It Is and What It Does
The chi-square (x2) goodness-of-fit test is used for comparing categorical information against what we would expect based on previous knowledge. As such, it tests what are called observed frequencies (the frequency with which participants fall into a category) against expected frequencies (the frequency expected in a category if the sample data represent the population). It is a nondirectional test, meaning that the alternative hypothesis is neither one-tailed nor two-tailed. The alternative hypothesis for a x2 goodness-of-fit test is that the observed data do not fit the expected frequencies for the population, and the null hypothesis is that they do fit the expected frequencies for the population. There is no conventional way to write these hypotheses in symbols, as we have done with the previous statistical tests. To illustrate the x2 goodness-of-fit test, let's look at a situation in which its use would be appropriate.
chi-square (x2) goodness-of-fit test A nonparametric inferential procedure that determines how well an observed frequency distribution fits an expected distribution.
observed frequencies The frequency with which participants fall into a category.
expected frequencies The frequency expected in a category if the sample data represent the population.
Calculations for the x2 Goodness-of-Fit Test
Suppose that a researcher is interested in determining whether the teenage pregnancy rate at a particular high school is different from the rate statewide. Assume that the rate statewide is 17%. A random sample of 80 female students is selected from the target high school. Seven of the students are either pregnant now or have been pregnant previously. The χ2goodness-of-fit test measures the observed frequencies against the expected frequencies. The observed and expected frequencies are presented in Table 21.1.
TABLE 21.1Observed and expected frequencies for χ2 goodness-of-fit example
FREQUENCIES
PREGNANT
NOT PREGNANT
Observed
7
73
Expected
14
66
As can be seen in the table, the observed frequencies represent the number of high school females in the sample of 80 who were pregnant versus not pregnant. The expected frequencies represent what we would expect based on chance, given what is known about the population. In this case, we would expect 17% of the females to be pregnant .
The ppt cover General Introduction to the topic,
Description of CHI-SQUARE TEST, Contingency table, Degree of Freedom, Determination of Chi – square test, Assumption for validity of chi - square test, Characteristics , Applications, Limitations
2. CONTENTS:
•IMPORTANT TERMS
•INTRODUCTION
•CHARACTERISTICS OF THE TEST
•CHI SQUARE DISTRIBUTION
•APPLICATIONS OF CHI SQUARE TEST
•CALCULATION OF THE CHI SQUARE
•CONDITION FOR THE APPLICATION OF THE TEST
•EXAMPLE
•YATE’S CORRECTION FOR CONTINUITY
•LIMITATIONS OF THE TEST.
3. IMPORTANT TERMS
1) PARAMETRIC TEST: The test in which, the population
constants like mean,std deviation, std error, correlation
coefficient, proportion etc. and data tend to follow one
assumed or established distribution such as normal,
binomial, poisson etc.
2) NON PARAMETRIC TEST: the test in which no constant of a
population is used. Data do not follow any specific
distribution and no assumption are made in these tests. E.g.
to classify good, better and best we just allocate arbitrary
numbers or marks to each category.
3) HYPOTHESIS: It is a definite statement about the population
parameters.
4. 4) NULL HYPOTHESIS: (H0) states that no association exists
between the two cross-tabulated variables in the population,
and therefore the variables are statistically independent. E.g.
if we want to compare 2 methods method A and method B for
its superiority, and if the assumption is that both methods are
equally good, then this assumption is called as NULL
HYPOTHESIS.
5) ALTERNATIVE HYPOTHESIS: (H1) proposes that the two
variables are related in the population. If we assume that
from 2 methods, method A is superior than method B, then
this assumption is called as ALTERNATIVE HYPOTHESIS.
5. 6) DEGREE OF FREEDOM: It denotes the extent of
independence (freedom) enjoyed by a given set of observed
frequencies Suppose we are given a set of n observed
frequencies which are subjected to k independent
constraints(restrictions) then,
d.f. = (number of frequencies) – (number of independent
constraints on them)
In other terms,
df = (r – 1)(c – 1)
where
r = the number of rows
c = the number of columns
7) CONTINGENCY TABLE: When the table is prepared by
enumeration of qualitative data by entering the actual
frequencies, and if that table represents occurance of two
sets of events, that table is called the contingency table.
(Latin, con- together, tangere- to touch). It is also called as
an association table.
6. INTRODUCTION
The chi-square test is an important test amongst the several
tests of significance developed by statisticians.
Is was developed by Karl Pearson in1900.
CHI SQUARE TEST is a non parametric test not based on any
assumption or distribution of any variable.
This statistical test follows a specific distribution known as chi
square distribution.
In general The test we use to measure the differences between
what is observed and what is expected according to an
assumed hypothesis is called the chi-square test.
7. IMPORTANT CHARACTERISTICS OF A CHI
SQUARE TEST
This test (as a non-parametric test) is based on
frequencies and not on the parameters like mean and
standard deviation.
The test is used for testing the hypothesis and is not
useful for estimation.
This test can also be applied to a complex contingency
table with several classes and as such is a very useful
test in research work.
This test is an important non-parametric test as no rigid
assumptions are necessary in regard to the type of
population, no need of parameter values and relatively
less mathematical details are involved.
8. CHI SQUARE DISTRIBUTION:
If X1, X2,….Xn are independent normal variates and each is
distributed normally with mean zero and standard deviation
unity, then X12+X22+……+Xn2= ∑ Xi2 is distributed as chi square (c2
)with n degrees of freedom (d.f.) where n is large. The chi square
curve for d.f. N=1,5 and 9 is as follows.
9. If degree of freedom > 2 : Distribution is bell shaped
If degree of freedom = 2 : Distribution is L shaped with
maximum ordinate at zero
If degree of freedom <2 (>0) : Distribution L shaped with
infinite ordinate at the origin.
10. APPLICATIONS OF A CHI SQUARE TEST.
This test can be used in
1) Goodness of fit of distributions
2) test of independence of attributes
3) test of homogenity.
11. 1) TEST OF GOODNESS OF FIT OF DISTRIBUTIONS:
This test enables us to see how well does the assumed
theoretical distribution (such as Binomial distribution,
Poisson distribution or Normal distribution) fit to the
observed data.
The c2 test formula for goodness of fit is:
(o e)
2
2
e
Where,
o = observed frequency
e = expected frequency
If c2 (calculated) > c2 (tabulated), with (n-1) d.f, then null
hypothesis is rejected otherwise accepted.
And if null hypothesis is accepted, then it can be concluded
that the given distribution follows theoretical distribution.
12. 2) TEST OF INDEPENDENCE OF ATTRIBUTES
Test enables us to explain whether or not two attributes are
associated.
For instance, we may be interested in knowing whether a new
medicine is effective in controlling fever or not, c2 test is
useful.
In such a situation, we proceed with the null hypothesis that
the two attributes (viz., new medicine and control of fever) are
independent which means that new medicine is not effective
in controlling fever.
c2 (calculated) > c2 (tabulated) at a certain level of
significancfe for given degrees of freedom, the null hypothesis
is rejected, i.e. two variables are dependent.(i.e., the new
medicine is effective in controlling the fever) and if, c2
(calculated) <c2 (tabulated) ,the null hypothesis is accepted,
i.e. 2 variables are independent.(i.e., the new medicine is not
effective in controlling the fever).
when null hypothesis is rejected, it can be concluded that there is
a significant association between two attributes.
13. 3) TEST OF HOMOGENITY
This test can also be used to test whether the occurance of
events follow uniformity or not e.g. the admission of
patients in government hospital in all days of week is
uniform or not can be tested with the help of chi square
test.
c2 (calculated) < c2 (tabulated), then null hypothesis is
accepted, and it can be concluded that there is a uniformity
in the occurance of the events. (uniformity in the admission
of patients through out the week)
14. CALCULATION OF CHI SQUARE
(o e)
2
2
e
Where,
O = observed frequency
E = expected frequency
If two distributions (observed and theoretical) are exactly alike,
c2 = 0; (but generally due to sampling errors, c2 is not equal to
zero)
15. STEPS INVOLVED IN CALCULATING c2
1) Calculate the expected frequencies and the observed
frequencies:
Expected frequencies fe : the cell frequencies that would be
expected in a contingency table if the two variables were
statistically independent.
Observed frequencies fo: the cell frequencies actually
observed in a contingency table.
fe = (column total)(row total)
N
To obtain the expected frequencies for any cell in any cross-
tabulation in which the two variables are assumed
independent, multiply the row and column totals for that cell
and divide the product by the total number of cases in the
table.
16. 2) Then c2 is calculated as follows:
( fe fo )
2
c
2
fe
17. CONDITIONS FOR THE APPLICATION OF c2
TEST
The following conditions should be satisfied before X2 test can
be applied:
1) The data must be in the form of frequencies
2) The frequency data must have a precise numerical value and
must be organised into categories or groups.
3) Observations recorded and used are collected on a random
basis.
4) All the itmes in the sample must be independent.
5) No group should contain very few items, say less than 10. In
case where the frequencies are less than 10, regrouping is
done by combining the frequencies of adjoining groups so
that the new frequencies become greater than 10. (Some
statisticians take this number as 5, but 10 is regarded as
better by most of the statisticians.)
6) The overall number of items must also be reasonably large.
It should normally be at least 50.
20. YATE’S CORRECTION
If in the 2*2 contingency table, the expected frequencies are small
say less than 5, then c2 test can’t be used. In that case, the direct
formula of the chi square test is modified and given by Yate’s
correction for continuity
R1R2C1C2
21. LIMITATIONS OF A CHI SQUARE TEST
1) The data is from a random sample.
2) This test applied in a four fould table, will not give a reliable
result with one degree of freedom if the expected value in any
cell is less than 5.
in such case, Yate’s correction is necessry. i.e. reduction of
the mode of (o – e) by half.
3) Even if Yate’s correction, the test may be misleading if any
expected frequency is much below 5. in that case another
appropriate test should be applied.
4) In contingency tables larger than 2*2, Yate’s correction
cannot be applied.
5) Interprit this test with caution if sample total or total of
values in all the cells is less than 50.
22. 6) This test tells the presence or absence of an association
between the events but doesn’t measure the strength of
association.
7) This test doesn’t indicate the cause and effect, it only tells
the probability of occurance of association by chance.
8) the test is to be applied only when the individual
observations of sample are independent which means that
the occurrence of one individual observation
(event) has no effect upon the occurrence of any other
observation (event) in the sample under consideration.