SlideShare a Scribd company logo
1 of 56
Chi Square distribution and
Analysis of frequency
Dr Inn Kynn Khaing
Lecturer, Department of Biostatistics
M.B.,B.S, MPH, MSc (Healthcare Administration) (Japan)
Outline
 Introduction
 Mathematical properties of the chi-square distribution
 Tests of Goodness-of-fit
 Tests of Independence
 Tests of Homogeneity
 Fisher exact test
 McNemar’s test
2
 Most frequently employed statistical technique for the analysis
of count or frequency data
 E.g. a sample of hospitalized patients how many are male and
how many are female
 E.g. the type of insurance coverage differs according to gender.
 E.g. There is a relationship between area of residence and
diagnosis.
3
 The chi-square distribution may be derived from normal distributions
Chi-square distribution with 1 degree of freedom
Mathematical form of the chi-square distribution
4
5
 Chi-square distribution : in testing hypotheses where the data available
for analysis are in the form of frequencies.
 tests of goodness-of-fit, tests of independence, and tests of homogeneity…
 Goodness-of-fit tests: they test the goodness-of-fit of observed
frequencies to frequencies that one would expect if the data were
generated under some particular theory or hypothesis.
 It refers to a comparison of a sample distribution to some theoretical
distribution that it is assumed describes the population from which the
sample came.
 Karl Pearson, who showed that the chi-square distribution may be used as
a test of the agreement between observation and hypothesis whenever
the data are in the form of frequencies. 6
 Observed frequencies - the number of subjects or objects in our sample
that fall into the various categories of the variable of interest.
For example, if we have a sample of 100 hospital patients, we may
observe that 50 are married, 30 are single, 15 are widowed, and 5 are
divorced.
 Expected frequencies - the number of subjects or objects in our sample
that we would expect to observe if some null hypothesis about the
variable is true.
For example, our null hypothesis might be that the four categories of
marital status are equally represented in the population from which we
drew our sample. In that case we would expect our sample to contain 25
married, 25 single, 25 widowed, and 25 divorced patients. 7
 When the null hypothesis is true, X2 is distributed approximately as x2 with
k- r degrees of freedom.
 k = the number of groups for which observed and expected frequencies are
available
 r = the number of restrictions or constraints imposed on the given
comparison.
 The nature of X2 is such that when there is close agreement between
observed and expected frequencies, it is small,
 When the agreement is poor, it is large.
 There is perfect agreement between the observed frequencies and the
frequencies that one would expect, given that H0 is true, Oi - Ei = 0
8
 The quantity ∑ [ (Oi - Ei )2/ Ei ] will be small if the observed and expected
frequencies are close together and will be large if the differences are
large.
 The computed value of X2 is compared with the tabulated value of x2 with
k – r degrees of freedom.
 The decision rule, then, is: Reject H0 if X2 is greater than or equal to the
tabulated x2 for the chosen value of a.
9
 A goodness-of-fit test is appropriate when one wishes to decide
if an observed distribution of frequencies is incompatible with
some preconceived or hypothesized distribution.
 In other words, one determines if actual numbers are similar
to the expected or theoretical numbers i.e. goodness of fit to a
theory.
10
Cranor and Christensen (A-1) conducted a study to assess short-term clinical,
economic, and humanistic outcomes of pharmaceutical care services for
patients with diabetes in community pharmacies. For 47 of the subjects in the
study, cholesterol levels are summarized in Table.
Cholesterol Level (mg/dl) Number of Subjects
100.0–124.9 1
125.0–149.9 3
150.0–174.9 8
175.0–199.9 18
200.0–224.9 6
225.0–249.9 4
250.0–274.9 4
275.0–299.9 3 11
Solution:
1. Data. See Table
2. Assumptions. We assume that the sample available for analysis is a
simple random sample.
3. Hypotheses.
H0: In the population from which the sample was drawn, cholesterol
levels are normally distributed.
HA: The sampled population is not normally distributed.
4. Test statistic. The test statistic is
5. Distribution of test statistic. If H0 is true, the test statistic is distributed
approximately as chi-square with k-r degrees of freedom. The appropriate
degrees of freedom are 8 (the number of groups or class intervals) - 3 (for
the three restrictions: making ∑Ei = ∑Oi, and estimating μ and σ from the
sample data) = 5. 12
Solution:
6. Decision rule. We will reject H0 if the computed value of X2 is equal to
or greater than the critical value of chi-square.
7. Calculation of test statistic.
13
14
Solution:
15
Solution:
16
17
Solution:
8. Statistical decision. When we compare X2 = 10.566 with values of X2 in
Appendix Table F, we see that it is less than x2
.95 = 11.070, so that we
cannot reject the null hypothesis that the sample came from a normally
distributed population.
9. Conclusion. We conclude that in the sampled population, cholesterol levels
may follow a normal distribution.
10. p value. Since 11.070 > 10.566 > 9.236, .05 < p < .10. In other words, the
probability of obtaining a value of X2 as large as 10.566, when the null
hypothesis is true, is between .05 and .10.
We conclude that such an event is not sufficiently rare to reject the null
hypothesis that the data come from a normal distribution. 18
In a study designed to determine patient acceptance of a new pain
reliever, 100 physicians each selected a sample of 25 patients to
participate in the study. Each patient, after trying the new pain reliever for
a specified period of time, was asked whether it was preferable to the pain
reliever used regularly in the past.
We are interested in determining whether or not these data are
compatible with the hypothesis that they were drawn from a population
that follows a binomial distribution.
19
20
Solution:
A total of 500 patients out of the 2500 patients participating in the study
said they preferred the new pain reliever, so that our point estimate of p is
ṗ = 500/2500 = 0.20.
The expected relative frequencies can be obtained by evaluating the
binomial function for x = 0, 1, . . ., 25.
21
22
The first expected frequency is less than 1, so that we follow
Cochran’s suggestion and combine this group with the second group.
When we do this, all the expected frequencies are greater than 1.
23
Solution:
From the data, we compute
The appropriate degrees of freedom are 10 less 2, or 8. One degree of
freedom is lost because we force the total of the expected frequencies to equal
the total observed frequencies, and one degree of freedom is sacrificed
because we estimated p from the sample data.
We compare our computed X2 with the tabulated x2 with 8 degrees of
freedom and find that it is significant at the .005 level of significance; p < .005.
We reject the null hypothesis that the data came from a binomial
distribution. 24
A hospital administrator wishes to test the null hypothesis that
emergency admissions follow Poisson distribution with λ= 3. Suppose
that over a period of 90 days the numbers of emergency admissions
were as shown in Table.
25
26
Solution:
First expected relative frequency is obtained by evaluating
Each of the expected relative frequencies is multiplied by 90 to obtain the
corresponding expected frequencies.
Since the parameter, λ, was specified in the null hypothesis, we
do not lose a degree of freedom for reasons of estimation, so that the
appropriate degrees of freedom are 9 – 1 = 8.
27
28
We also note that the last three expected frequencies are less than
1, so that they must be combined to avoid having any expected
frequencies less than 1.
Solution:
we find that the critical value of x2 for 8 degrees of freedom α= .05 is
15.507, so that we cannot reject the null hypothesis at the .05 level, or for
that matter any reasonable level, of significance (p > .10).
Therefore, we conclude that emergency admissions at this hospital may
follow a Poisson distribution with λ= 3.
29
See in Text
30
 Another the most frequent use of the chi-square distribution
 To test the null hypothesis that two criteria of classification, when
applied to the same set of entities, are independent.
 Two criteria of classification are independent if the distribution of one
criterion is the same no matter what the distribution of the other
criterion
 For example, if socioeconomic status and area of residence of the
inhabitants of a certain city are independent, we would expect to find
the same proportion of families in the low, medium, and high
socioeconomic groups in all areas of the city.
31
Contingency Table
 The classification, according to two criteria, of a set of entities, say,
people, can be shown by a table in which the r rows represent the various
levels of one criterion of classification and the c columns represent the
various levels of the second criterion.
 Such a table is generally called a contingency table, with dimension r x c.
the two criteria of
classification are not
independent
32
In 1992, the U.S. Public Health Service and the Centers for Disease Control and
Prevention recommended that all women of childbearing age consume 400mg
of folic acid daily to reduce the risk of having a pregnancy that is affected by a
neural tube defect such as spina bifida or anencephaly. 693 pregnant women
called a teratology information service about their use of folic acid
supplementation. The researchers wished to determine if preconceptional use
of folic acid and race are independent.
33
Solution:
1. Data. See Table
2. Assumptions. We assume that the sample available for analysis is equivalent
to a simple random sample drawn from the population of interest.
3. Hypotheses.
H0: Race and preconceptional use of folic acid are independent.
HA: The two variables are not independent.
Let α=.05.
4. Test statistic. The test statistic is
5. Distribution of test statistic. When H0 is true, X2 is distributed approximately
as x2 with (r-1) (c-1) = (3-1) (2-1) = 2 x 1= 2 degrees of freedom.
6. Decision rule. Reject H0 if the computed value of X2 is equal to or greater
than 5.991.
34
Solution:
7. Calculation of test statistic. The expected frequency for the first cell is
(559 x 282)/636 = 247.86.
35
Solution:
8. Statistical decision. We reject H0 since 9.08960 > 5.991.
9. Conclusion. We conclude that H0 is false, and that there is a relationship
between race and preconceptional use of folic acid.
10. p value. Since 7.378 < 9.08960 < 9.210, .01 < p < .025.
36
Small Expected Frequencies
 The problem of small expected frequencies may be encountered when
analyzing the data of contingency tables.
 Although there is a lack of consensus on how to handle this problem, many
authors currently follow the rule given by Cochran. He suggests that for
contingency tables with more than 1 degree of freedom a minimum
expectation of 1 is allowable if no more than 20 percent of the cells have
expected frequencies of less than 5.
 To meet this rule, adjacent rows and/or adjacent columns may be combined
when to do so is logical in light of other considerations.
37
 The 2 x 2 Contingency Table Sometimes each of two criteria of classification
may be broken down into only two categories, or levels.
 When data are cross classified in this manner, the result is a contingency
table consisting of two rows and two columns. Such a table is commonly
referred to as a 2x2 table.
 The value of X2 may be computed by first calculating the expected cell
frequencies. In the case of a 2x2 contingency table, however, X2 may be
calculated by the following shortcut formula:
38
According to Silver and Aiello (A-4), falls are of major concern among polio
survivors. Researchers wanted to determine the impact of a fall on lifestyle
changes. Table shows the results of a study of 233 polio survivors on whether
fear of falling resulted in lifestyle changes.
39
Solution:
1. Data. From the information given we may construct the 2x2 contingency table
2. Assumptions. We assume that the sample is equivalent to a simple random
sample.
3. Hypotheses.
H0: Fall status and lifestyle change because of fear of falling are independent.
H1: The two variables are not independent.
Let α=.05.
4. Test statistic. The test statistic is
40
Solution:
5. Distribution of test statistic. When H0 is true, X2 is distributed approximately as
x2 with (r-1) (c-1) = (2-1) (2-1) = 1 x 1 =1 degree of freedom.
6. Decision rule. Reject H0 if the computed value of X2 is equal to or > than 3.841.
7. Calculation of test statistic. By Equation
8. Statistical decision. We reject H0 since 31.7391 > 3.841.
9. Conclusion. We conclude that H0 is false, and that there is a relationship
between experiencing a fall and changing one’s lifestyle because of fear of
falling.
10. p value. Since 31.7391 > 7.879, p < .005.
41
Tests of Independence - Characteristics
1. A single sample is selected from a population of interest, and the subjects
or objects are cross-classified on the basis of the two variables of interest.
2. The rationale for calculating expected cell frequencies is based on the
probability law, which states that if two events (here the two criteria of
classification) are independent, the probability of their joint occurrence is
equal to the product of their individual probabilities.
3. The hypotheses and conclusions are stated in terms of the independence
(or lack of independence) of two variables.
44
 one set of marginal totals is said to be fixed, while the other set,
corresponding to the criterion of classification applied to the samples, is
random
 The situation leads to a chi-square test of homogeneity
 The two situations not only involve different sampling procedures; they
lead to different questions and null hypotheses
 The homogeneity test is concerned with the question “Are the samples
drawn from populations that are homogeneous with respect to some
criterion of classification?”
45
Narcolepsy is a disease involving disturbances of the sleep–wake cycle.
Members of the German Migraine and Headache Society studied the
relationship between migraine headaches in 96 subjects diagnosed with
narcolepsy and 96 healthy controls. The results are shown in Table. We wish
to know if we may conclude, on the basis of these data, that the narcolepsy
population and healthy populations represented by the samples are not
homogeneous with respect to migraine frequency.
46
Solution:
1. Data. See Table
2. Assumptions. We assume that we have a simple random sample from each
of the two populations of interest.
3. Hypotheses.
H0: The two populations are homogeneous with respect to migraine
frequency.
HA: The two populations are not homogeneous with respect to migraine
frequency.
Let α = .05.
4. Test statistic. The test statistic is
5. Distribution of test statistic. If H0 is true, X2 is distributed approximately
as x2 with (2-1) (2-1) = 1 x 1 = 1 degree of freedom.
47
Solution:
6. Decision rule. Reject H0 if the computed value of X2 is equal to or greater
than 3.841.
7. Calculation of test statistic.
Subject
Reported
migraine
headache
Observed
Frequency
(Oi)
Expected
Frequency
(Ei)
Oi - Ei
(Oi – Ei )2 (Oi – Ei )2/Ei
Narcoleptic
subjects
Yes 21
No 75
Healthy
controls
Yes 19
No 77
Total 192
48
Reported Migraine Headaches
Yes No Total
Narcoleptic subjects 20 76 96
Healthy controls 20 76 96
Total 40 152 192
49
Solution:
6. Decision rule. Reject H0 if the computed value of X2 is equal to or greater
than 3.841.
7. Calculation of test statistic.
Subject
Reported
migraine
headache
Observed
Frequency
(Oi)
Expected
Frequency
(Ei)
Oi - Ei
(Oi – Ei )2 (Oi – Ei )2/Ei
Narcoleptic
subjects
Yes 21 20
No 75 76
Healthy
controls
Yes 19 20
No 77 76
Total 192 192
50
Solution:
6. Decision rule. Reject H0 if the computed value of X2 is equal to or greater
than 3.841.
7. Calculation of test statistic.
Subject
Reported
migraine
headache
Observed
Frequency
(Oi)
Expected
Frequency
(Ei)
Oi - Ei
(Oi – Ei )2 (Oi – Ei )2/Ei
Narcoleptic
subjects
Yes 21 20 -1
No 75 76 1
Healthy
controls
Yes 19 20 -1
No 77 76 1
Total 192 192
51
Solution:
6. Decision rule. Reject H0 if the computed value of X2 is equal to or greater
than 3.841.
7. Calculation of test statistic.
Subject
Reported
migraine
headache
Observed
Frequency
(Oi)
Expected
Frequency
(Ei)
Oi - Ei
(Oi – Ei )2 (Oi – Ei )2/Ei
Narcoleptic
subjects
Yes 21 20 -1 1
No 75 76 1 1
Healthy
controls
Yes 19 20 -1 1
No 77 76 1 1
Total 192 192
52
Solution:
6. Decision rule. Reject H0 if the computed value of X2 is equal to or greater
than 3.841.
7. Calculation of test statistic.
Subject
Reported
migraine
headache
Observed
Frequency
(Oi)
Expected
Frequency
(Ei)
Oi - Ei
(Oi – Ei )2 (Oi – Ei )2/Ei
Narcoleptic
subjects
Yes 21 20 -1 1 0.05
No 75 76 1 1 0.0132
Healthy
controls
Yes 19 20 -1 1 0.05
No 77 76 1 1 0.0132
Total 192 192 0.1263
53
Solution:
8. Statistical decision. Since .126 is less than the critical value of 3.841, we are
unable to reject the null hypothesis.
9. Conclusion. We conclude that the two populations may be homogeneous
with respect to migraine frequency.
10. p value. p = .722.
54
Tests of Homogeneity - Characteristics
1. Two or more populations are identified in advance, and an independent
sample is drawn from each.
2. Sample subjects or objects are placed in appropriate categories of the
variable of interest.
3. The calculation of expected cell frequencies is based on the rationale that
if the populations are homogeneous as stated in the null hypothesis, the
best estimate of the probability that a subject or object will fall into a
particular category of the variable of interest can be obtained by pooling
the sample data.
4. The hypotheses and conclusions are stated in terms of homogeneity (with
respect to the variable of interest) of populations.
55
Tests of Homogeneity
 It assesses the significance of difference in two or more than two proportions.
i.e, testing the homogenous distribution of particular characteristics among
groups.
 It is concerned with the question “Are the sample drawn from population that
are homogenous with respect to some criteria of classification?”
Tests of Independence
 It tests the association between two events in binomial or multinomial
samples. For example, if socio-economic status and area of residence of
inhabitants of a certain city are independent, we would expect to find the
same proportions of families in the low, medium and high socio-economic
groups in all areas of the city.
 It is concerned with the question “Are the two criteria of classification
independent?” 56
 Although difference in concept and sampling procedures, the two tests are
mathematically identical.
57
58

More Related Content

What's hot

Introduction to Statistics and Probability
Introduction to Statistics and ProbabilityIntroduction to Statistics and Probability
Introduction to Statistics and ProbabilityBhavana Singh
 
Point and Interval Estimation
Point and Interval EstimationPoint and Interval Estimation
Point and Interval EstimationShubham Mehta
 
Mat 255 chapter 3 notes
Mat 255 chapter 3 notesMat 255 chapter 3 notes
Mat 255 chapter 3 notesadrushle
 
Ch4 Confidence Interval
Ch4 Confidence IntervalCh4 Confidence Interval
Ch4 Confidence IntervalFarhan Alfin
 
Statistical Estimation
Statistical Estimation Statistical Estimation
Statistical Estimation Remyagharishs
 
4 2 continuous probability distributionn
4 2 continuous probability    distributionn4 2 continuous probability    distributionn
4 2 continuous probability distributionnLama K Banna
 
Random variables and probability distributions
Random variables and probability distributionsRandom variables and probability distributions
Random variables and probability distributionsAntonio F. Balatar Jr.
 
Introduction to probability distributions-Statistics and probability analysis
Introduction to probability distributions-Statistics and probability analysis Introduction to probability distributions-Statistics and probability analysis
Introduction to probability distributions-Statistics and probability analysis Vijay Hemmadi
 
L10 confidence intervals
L10 confidence intervalsL10 confidence intervals
L10 confidence intervalsLayal Fahad
 
Basic concept of probability
Basic concept of probabilityBasic concept of probability
Basic concept of probabilityIkhlas Rahman
 
F Distribution
F  DistributionF  Distribution
F Distributionjravish
 
Inferential statistics powerpoint
Inferential statistics powerpointInferential statistics powerpoint
Inferential statistics powerpointkellula
 

What's hot (20)

Introduction to Statistics and Probability
Introduction to Statistics and ProbabilityIntroduction to Statistics and Probability
Introduction to Statistics and Probability
 
Point and Interval Estimation
Point and Interval EstimationPoint and Interval Estimation
Point and Interval Estimation
 
Mat 255 chapter 3 notes
Mat 255 chapter 3 notesMat 255 chapter 3 notes
Mat 255 chapter 3 notes
 
Ch4 Confidence Interval
Ch4 Confidence IntervalCh4 Confidence Interval
Ch4 Confidence Interval
 
The Standard Normal Distribution
The Standard Normal Distribution  The Standard Normal Distribution
The Standard Normal Distribution
 
Statistical Estimation
Statistical Estimation Statistical Estimation
Statistical Estimation
 
4 2 continuous probability distributionn
4 2 continuous probability    distributionn4 2 continuous probability    distributionn
4 2 continuous probability distributionn
 
Random variables and probability distributions
Random variables and probability distributionsRandom variables and probability distributions
Random variables and probability distributions
 
Introduction to probability distributions-Statistics and probability analysis
Introduction to probability distributions-Statistics and probability analysis Introduction to probability distributions-Statistics and probability analysis
Introduction to probability distributions-Statistics and probability analysis
 
Point Estimation
Point EstimationPoint Estimation
Point Estimation
 
L10 confidence intervals
L10 confidence intervalsL10 confidence intervals
L10 confidence intervals
 
Basic concept of probability
Basic concept of probabilityBasic concept of probability
Basic concept of probability
 
Parameter estimation
Parameter estimationParameter estimation
Parameter estimation
 
Sampling & Sampling Distribtutions
Sampling & Sampling DistribtutionsSampling & Sampling Distribtutions
Sampling & Sampling Distribtutions
 
F Distribution
F  DistributionF  Distribution
F Distribution
 
Confidence interval
Confidence intervalConfidence interval
Confidence interval
 
Discrete probability distributions
Discrete probability distributionsDiscrete probability distributions
Discrete probability distributions
 
Inferential statistics powerpoint
Inferential statistics powerpointInferential statistics powerpoint
Inferential statistics powerpoint
 
Binary Logistic Regression
Binary Logistic RegressionBinary Logistic Regression
Binary Logistic Regression
 
3 measure of dispersion
3 measure of dispersion3 measure of dispersion
3 measure of dispersion
 

Similar to Chi square(hospital admin) A

Parmetric and non parametric statistical test in clinical trails
Parmetric and non parametric statistical test in clinical trailsParmetric and non parametric statistical test in clinical trails
Parmetric and non parametric statistical test in clinical trailsVinod Pagidipalli
 
Categorical data analysis.pptx
Categorical data analysis.pptxCategorical data analysis.pptx
Categorical data analysis.pptxBegashaw3
 
slides Testing of hypothesis.pptx
slides Testing  of  hypothesis.pptxslides Testing  of  hypothesis.pptx
slides Testing of hypothesis.pptxssuser504dda
 
Running head COURSE PROJECT NCLEX Memorial Hospital .docx
Running head COURSE PROJECT NCLEX Memorial Hospital             .docxRunning head COURSE PROJECT NCLEX Memorial Hospital             .docx
Running head COURSE PROJECT NCLEX Memorial Hospital .docxsusanschei
 
36086 Topic Discussion3Number of Pages 2 (Double Spaced).docx
36086 Topic Discussion3Number of Pages 2 (Double Spaced).docx36086 Topic Discussion3Number of Pages 2 (Double Spaced).docx
36086 Topic Discussion3Number of Pages 2 (Double Spaced).docxrhetttrevannion
 
Chi-square IMP.ppt
Chi-square IMP.pptChi-square IMP.ppt
Chi-square IMP.pptShivraj Nile
 
Chapter 3 Confidence Interval Revby Rao
Chapter 3 Confidence Interval Revby RaoChapter 3 Confidence Interval Revby Rao
Chapter 3 Confidence Interval Revby RaoSumit Prajapati
 
Hypothesis testing: A single sample test
Hypothesis testing: A single sample testHypothesis testing: A single sample test
Hypothesis testing: A single sample testUmme Salma Tuli
 
Parametric & non parametric
Parametric & non parametricParametric & non parametric
Parametric & non parametricANCYBS
 
inferentialstatistics-210411214248.pdf
inferentialstatistics-210411214248.pdfinferentialstatistics-210411214248.pdf
inferentialstatistics-210411214248.pdfChenPalaruan
 
Chapter 2 Probabilty And Distribution
Chapter 2 Probabilty And DistributionChapter 2 Probabilty And Distribution
Chapter 2 Probabilty And Distributionghalan
 
Chapter 3 Confidence Interval
Chapter 3 Confidence IntervalChapter 3 Confidence Interval
Chapter 3 Confidence Intervalghalan
 
Chi squared test
Chi squared testChi squared test
Chi squared testDhruv Patel
 

Similar to Chi square(hospital admin) A (20)

Parmetric and non parametric statistical test in clinical trails
Parmetric and non parametric statistical test in clinical trailsParmetric and non parametric statistical test in clinical trails
Parmetric and non parametric statistical test in clinical trails
 
Categorical data analysis.pptx
Categorical data analysis.pptxCategorical data analysis.pptx
Categorical data analysis.pptx
 
slides Testing of hypothesis.pptx
slides Testing  of  hypothesis.pptxslides Testing  of  hypothesis.pptx
slides Testing of hypothesis.pptx
 
Running head COURSE PROJECT NCLEX Memorial Hospital .docx
Running head COURSE PROJECT NCLEX Memorial Hospital             .docxRunning head COURSE PROJECT NCLEX Memorial Hospital             .docx
Running head COURSE PROJECT NCLEX Memorial Hospital .docx
 
Data analysis
Data analysis Data analysis
Data analysis
 
36086 Topic Discussion3Number of Pages 2 (Double Spaced).docx
36086 Topic Discussion3Number of Pages 2 (Double Spaced).docx36086 Topic Discussion3Number of Pages 2 (Double Spaced).docx
36086 Topic Discussion3Number of Pages 2 (Double Spaced).docx
 
Chi square test
Chi square testChi square test
Chi square test
 
Chi square
Chi square Chi square
Chi square
 
Chi-square IMP.ppt
Chi-square IMP.pptChi-square IMP.ppt
Chi-square IMP.ppt
 
Chapter 3 Confidence Interval Revby Rao
Chapter 3 Confidence Interval Revby RaoChapter 3 Confidence Interval Revby Rao
Chapter 3 Confidence Interval Revby Rao
 
Hypothesis testing: A single sample test
Hypothesis testing: A single sample testHypothesis testing: A single sample test
Hypothesis testing: A single sample test
 
Parametric & non parametric
Parametric & non parametricParametric & non parametric
Parametric & non parametric
 
Test of significance
Test of significanceTest of significance
Test of significance
 
inferentialstatistics-210411214248.pdf
inferentialstatistics-210411214248.pdfinferentialstatistics-210411214248.pdf
inferentialstatistics-210411214248.pdf
 
Inferential statistics
Inferential statisticsInferential statistics
Inferential statistics
 
Chapter 2 Probabilty And Distribution
Chapter 2 Probabilty And DistributionChapter 2 Probabilty And Distribution
Chapter 2 Probabilty And Distribution
 
Chapter 3 Confidence Interval
Chapter 3 Confidence IntervalChapter 3 Confidence Interval
Chapter 3 Confidence Interval
 
20170215 daniel gartner session 3_a1_kss_2016_paper_31
20170215 daniel gartner session 3_a1_kss_2016_paper_3120170215 daniel gartner session 3_a1_kss_2016_paper_31
20170215 daniel gartner session 3_a1_kss_2016_paper_31
 
Chi squared test
Chi squared testChi squared test
Chi squared test
 
Chi
ChiChi
Chi
 

More from Mmedsc Hahm

Solid waste-management-2858710
Solid waste-management-2858710Solid waste-management-2858710
Solid waste-management-2858710Mmedsc Hahm
 
Situation analysis
Situation analysisSituation analysis
Situation analysisMmedsc Hahm
 
Quantification of medicines need
Quantification of medicines needQuantification of medicines need
Quantification of medicines needMmedsc Hahm
 
Quality in hospital
Quality in hospitalQuality in hospital
Quality in hospitalMmedsc Hahm
 
Patient satisfaction &amp; quality in health care (16.3.2016) dr.nyunt nyunt wai
Patient satisfaction &amp; quality in health care (16.3.2016) dr.nyunt nyunt waiPatient satisfaction &amp; quality in health care (16.3.2016) dr.nyunt nyunt wai
Patient satisfaction &amp; quality in health care (16.3.2016) dr.nyunt nyunt waiMmedsc Hahm
 
Introduction to hahm 2017
Introduction to hahm 2017Introduction to hahm 2017
Introduction to hahm 2017Mmedsc Hahm
 
Hss lecture 2016 jan
Hss lecture 2016 janHss lecture 2016 jan
Hss lecture 2016 janMmedsc Hahm
 
Hospital management17
Hospital management17Hospital management17
Hospital management17Mmedsc Hahm
 
Health planning approaches hahm 17
Health planning approaches hahm 17Health planning approaches hahm 17
Health planning approaches hahm 17Mmedsc Hahm
 
Directing and leading 2017
Directing and leading 2017Directing and leading 2017
Directing and leading 2017Mmedsc Hahm
 
Access to medicines p pt 17 10-2015
Access to medicines p pt 17 10-2015Access to medicines p pt 17 10-2015
Access to medicines p pt 17 10-2015Mmedsc Hahm
 
The dynamics of disease transmission
The dynamics of disease transmissionThe dynamics of disease transmission
The dynamics of disease transmissionMmedsc Hahm
 
Study designs dr.wah
Study designs dr.wahStudy designs dr.wah
Study designs dr.wahMmedsc Hahm
 
Standardization dr.wah
Standardization dr.wahStandardization dr.wah
Standardization dr.wahMmedsc Hahm
 

More from Mmedsc Hahm (20)

Solid waste-management-2858710
Solid waste-management-2858710Solid waste-management-2858710
Solid waste-management-2858710
 
Situation analysis
Situation analysisSituation analysis
Situation analysis
 
Quantification of medicines need
Quantification of medicines needQuantification of medicines need
Quantification of medicines need
 
Quality in hospital
Quality in hospitalQuality in hospital
Quality in hospital
 
Patient satisfaction &amp; quality in health care (16.3.2016) dr.nyunt nyunt wai
Patient satisfaction &amp; quality in health care (16.3.2016) dr.nyunt nyunt waiPatient satisfaction &amp; quality in health care (16.3.2016) dr.nyunt nyunt wai
Patient satisfaction &amp; quality in health care (16.3.2016) dr.nyunt nyunt wai
 
Organising
OrganisingOrganising
Organising
 
Nscbl slide
Nscbl slideNscbl slide
Nscbl slide
 
Introduction to hahm 2017
Introduction to hahm 2017Introduction to hahm 2017
Introduction to hahm 2017
 
Hss lecture 2016 jan
Hss lecture 2016 janHss lecture 2016 jan
Hss lecture 2016 jan
 
Hospital management17
Hospital management17Hospital management17
Hospital management17
 
Hopital stat
Hopital statHopital stat
Hopital stat
 
Health planning approaches hahm 17
Health planning approaches hahm 17Health planning approaches hahm 17
Health planning approaches hahm 17
 
Ephs and nhp
Ephs and nhpEphs and nhp
Ephs and nhp
 
Directing and leading 2017
Directing and leading 2017Directing and leading 2017
Directing and leading 2017
 
Concepts of em
Concepts of emConcepts of em
Concepts of em
 
Access to medicines p pt 17 10-2015
Access to medicines p pt 17 10-2015Access to medicines p pt 17 10-2015
Access to medicines p pt 17 10-2015
 
The dynamics of disease transmission
The dynamics of disease transmissionThe dynamics of disease transmission
The dynamics of disease transmission
 
Study designs dr.wah
Study designs dr.wahStudy designs dr.wah
Study designs dr.wah
 
Standardization dr.wah
Standardization dr.wahStandardization dr.wah
Standardization dr.wah
 
Sdg
SdgSdg
Sdg
 

Recently uploaded

Russian Escorts Aishbagh Road * 9548273370 Naughty Call Girls Service in Lucknow
Russian Escorts Aishbagh Road * 9548273370 Naughty Call Girls Service in LucknowRussian Escorts Aishbagh Road * 9548273370 Naughty Call Girls Service in Lucknow
Russian Escorts Aishbagh Road * 9548273370 Naughty Call Girls Service in Lucknowgragteena
 
Low Rate Call Girls In Bommanahalli Just Call 7001305949
Low Rate Call Girls In Bommanahalli Just Call 7001305949Low Rate Call Girls In Bommanahalli Just Call 7001305949
Low Rate Call Girls In Bommanahalli Just Call 7001305949ps5894268
 
Call Girls in Mohali Surbhi ❤️🍑 9907093804 👄🫦 Independent Escort Service Mohali
Call Girls in Mohali Surbhi ❤️🍑 9907093804 👄🫦 Independent Escort Service MohaliCall Girls in Mohali Surbhi ❤️🍑 9907093804 👄🫦 Independent Escort Service Mohali
Call Girls in Mohali Surbhi ❤️🍑 9907093804 👄🫦 Independent Escort Service MohaliHigh Profile Call Girls Chandigarh Aarushi
 
Gurgaon iffco chowk 🔝 Call Girls Service 🔝 ( 8264348440 ) unlimited hard sex ...
Gurgaon iffco chowk 🔝 Call Girls Service 🔝 ( 8264348440 ) unlimited hard sex ...Gurgaon iffco chowk 🔝 Call Girls Service 🔝 ( 8264348440 ) unlimited hard sex ...
Gurgaon iffco chowk 🔝 Call Girls Service 🔝 ( 8264348440 ) unlimited hard sex ...soniya singh
 
Dehradun Call Girls Service ❤️🍑 9675010100 👄🫦Independent Escort Service Dehradun
Dehradun Call Girls Service ❤️🍑 9675010100 👄🫦Independent Escort Service DehradunDehradun Call Girls Service ❤️🍑 9675010100 👄🫦Independent Escort Service Dehradun
Dehradun Call Girls Service ❤️🍑 9675010100 👄🫦Independent Escort Service DehradunNiamh verma
 
VIP Call Girls Hyderabad Megha 9907093804 Independent Escort Service Hyderabad
VIP Call Girls Hyderabad Megha 9907093804 Independent Escort Service HyderabadVIP Call Girls Hyderabad Megha 9907093804 Independent Escort Service Hyderabad
VIP Call Girls Hyderabad Megha 9907093804 Independent Escort Service Hyderabaddelhimodelshub1
 
Call Girls Hyderabad Kirti 9907093804 Independent Escort Service Hyderabad
Call Girls Hyderabad Kirti 9907093804 Independent Escort Service HyderabadCall Girls Hyderabad Kirti 9907093804 Independent Escort Service Hyderabad
Call Girls Hyderabad Kirti 9907093804 Independent Escort Service Hyderabaddelhimodelshub1
 
VIP Call Girl Sector 25 Gurgaon Just Call Me 9899900591
VIP Call Girl Sector 25 Gurgaon Just Call Me 9899900591VIP Call Girl Sector 25 Gurgaon Just Call Me 9899900591
VIP Call Girl Sector 25 Gurgaon Just Call Me 9899900591adityaroy0215
 
Call Girl Hyderabad Madhuri 9907093804 Independent Escort Service Hyderabad
Call Girl Hyderabad Madhuri 9907093804 Independent Escort Service HyderabadCall Girl Hyderabad Madhuri 9907093804 Independent Escort Service Hyderabad
Call Girl Hyderabad Madhuri 9907093804 Independent Escort Service Hyderabaddelhimodelshub1
 
Russian Call Girls in Chandigarh Ojaswi ❤️🍑 9907093804 👄🫦 Independent Escort ...
Russian Call Girls in Chandigarh Ojaswi ❤️🍑 9907093804 👄🫦 Independent Escort ...Russian Call Girls in Chandigarh Ojaswi ❤️🍑 9907093804 👄🫦 Independent Escort ...
Russian Call Girls in Chandigarh Ojaswi ❤️🍑 9907093804 👄🫦 Independent Escort ...High Profile Call Girls Chandigarh Aarushi
 
Kukatpally Call Girls Services 9907093804 High Class Babes Here Call Now
Kukatpally Call Girls Services 9907093804 High Class Babes Here Call NowKukatpally Call Girls Services 9907093804 High Class Babes Here Call Now
Kukatpally Call Girls Services 9907093804 High Class Babes Here Call NowHyderabad Call Girls Services
 
College Call Girls Hyderabad Sakshi 9907093804 Independent Escort Service Hyd...
College Call Girls Hyderabad Sakshi 9907093804 Independent Escort Service Hyd...College Call Girls Hyderabad Sakshi 9907093804 Independent Escort Service Hyd...
College Call Girls Hyderabad Sakshi 9907093804 Independent Escort Service Hyd...delhimodelshub1
 
Basics of Anatomy- Language of Anatomy.pptx
Basics of Anatomy- Language of Anatomy.pptxBasics of Anatomy- Language of Anatomy.pptx
Basics of Anatomy- Language of Anatomy.pptxAyush Gupta
 
Call Girls Madhapur 7001305949 all area service COD available Any Time
Call Girls Madhapur 7001305949 all area service COD available Any TimeCall Girls Madhapur 7001305949 all area service COD available Any Time
Call Girls Madhapur 7001305949 all area service COD available Any Timedelhimodelshub1
 
Call Girls Hyderabad Krisha 9907093804 Independent Escort Service Hyderabad
Call Girls Hyderabad Krisha 9907093804 Independent Escort Service HyderabadCall Girls Hyderabad Krisha 9907093804 Independent Escort Service Hyderabad
Call Girls Hyderabad Krisha 9907093804 Independent Escort Service Hyderabaddelhimodelshub1
 
Call Girls Service Chandigarh Grishma ❤️🍑 9907093804 👄🫦 Independent Escort Se...
Call Girls Service Chandigarh Grishma ❤️🍑 9907093804 👄🫦 Independent Escort Se...Call Girls Service Chandigarh Grishma ❤️🍑 9907093804 👄🫦 Independent Escort Se...
Call Girls Service Chandigarh Grishma ❤️🍑 9907093804 👄🫦 Independent Escort Se...High Profile Call Girls Chandigarh Aarushi
 
Russian Call Girls in Goa Samaira 7001305949 Independent Escort Service Goa
Russian Call Girls in Goa Samaira 7001305949 Independent Escort Service GoaRussian Call Girls in Goa Samaira 7001305949 Independent Escort Service Goa
Russian Call Girls in Goa Samaira 7001305949 Independent Escort Service Goanarwatsonia7
 

Recently uploaded (20)

Russian Escorts Aishbagh Road * 9548273370 Naughty Call Girls Service in Lucknow
Russian Escorts Aishbagh Road * 9548273370 Naughty Call Girls Service in LucknowRussian Escorts Aishbagh Road * 9548273370 Naughty Call Girls Service in Lucknow
Russian Escorts Aishbagh Road * 9548273370 Naughty Call Girls Service in Lucknow
 
Low Rate Call Girls In Bommanahalli Just Call 7001305949
Low Rate Call Girls In Bommanahalli Just Call 7001305949Low Rate Call Girls In Bommanahalli Just Call 7001305949
Low Rate Call Girls In Bommanahalli Just Call 7001305949
 
Call Girls in Mohali Surbhi ❤️🍑 9907093804 👄🫦 Independent Escort Service Mohali
Call Girls in Mohali Surbhi ❤️🍑 9907093804 👄🫦 Independent Escort Service MohaliCall Girls in Mohali Surbhi ❤️🍑 9907093804 👄🫦 Independent Escort Service Mohali
Call Girls in Mohali Surbhi ❤️🍑 9907093804 👄🫦 Independent Escort Service Mohali
 
Gurgaon iffco chowk 🔝 Call Girls Service 🔝 ( 8264348440 ) unlimited hard sex ...
Gurgaon iffco chowk 🔝 Call Girls Service 🔝 ( 8264348440 ) unlimited hard sex ...Gurgaon iffco chowk 🔝 Call Girls Service 🔝 ( 8264348440 ) unlimited hard sex ...
Gurgaon iffco chowk 🔝 Call Girls Service 🔝 ( 8264348440 ) unlimited hard sex ...
 
College Call Girls Dehradun Kavya 🔝 7001305949 🔝 📍 Independent Escort Service...
College Call Girls Dehradun Kavya 🔝 7001305949 🔝 📍 Independent Escort Service...College Call Girls Dehradun Kavya 🔝 7001305949 🔝 📍 Independent Escort Service...
College Call Girls Dehradun Kavya 🔝 7001305949 🔝 📍 Independent Escort Service...
 
Dehradun Call Girls Service ❤️🍑 9675010100 👄🫦Independent Escort Service Dehradun
Dehradun Call Girls Service ❤️🍑 9675010100 👄🫦Independent Escort Service DehradunDehradun Call Girls Service ❤️🍑 9675010100 👄🫦Independent Escort Service Dehradun
Dehradun Call Girls Service ❤️🍑 9675010100 👄🫦Independent Escort Service Dehradun
 
VIP Call Girls Hyderabad Megha 9907093804 Independent Escort Service Hyderabad
VIP Call Girls Hyderabad Megha 9907093804 Independent Escort Service HyderabadVIP Call Girls Hyderabad Megha 9907093804 Independent Escort Service Hyderabad
VIP Call Girls Hyderabad Megha 9907093804 Independent Escort Service Hyderabad
 
Call Girls Hyderabad Kirti 9907093804 Independent Escort Service Hyderabad
Call Girls Hyderabad Kirti 9907093804 Independent Escort Service HyderabadCall Girls Hyderabad Kirti 9907093804 Independent Escort Service Hyderabad
Call Girls Hyderabad Kirti 9907093804 Independent Escort Service Hyderabad
 
VIP Call Girl Sector 25 Gurgaon Just Call Me 9899900591
VIP Call Girl Sector 25 Gurgaon Just Call Me 9899900591VIP Call Girl Sector 25 Gurgaon Just Call Me 9899900591
VIP Call Girl Sector 25 Gurgaon Just Call Me 9899900591
 
Call Girl Hyderabad Madhuri 9907093804 Independent Escort Service Hyderabad
Call Girl Hyderabad Madhuri 9907093804 Independent Escort Service HyderabadCall Girl Hyderabad Madhuri 9907093804 Independent Escort Service Hyderabad
Call Girl Hyderabad Madhuri 9907093804 Independent Escort Service Hyderabad
 
Russian Call Girls in Chandigarh Ojaswi ❤️🍑 9907093804 👄🫦 Independent Escort ...
Russian Call Girls in Chandigarh Ojaswi ❤️🍑 9907093804 👄🫦 Independent Escort ...Russian Call Girls in Chandigarh Ojaswi ❤️🍑 9907093804 👄🫦 Independent Escort ...
Russian Call Girls in Chandigarh Ojaswi ❤️🍑 9907093804 👄🫦 Independent Escort ...
 
Kukatpally Call Girls Services 9907093804 High Class Babes Here Call Now
Kukatpally Call Girls Services 9907093804 High Class Babes Here Call NowKukatpally Call Girls Services 9907093804 High Class Babes Here Call Now
Kukatpally Call Girls Services 9907093804 High Class Babes Here Call Now
 
College Call Girls Hyderabad Sakshi 9907093804 Independent Escort Service Hyd...
College Call Girls Hyderabad Sakshi 9907093804 Independent Escort Service Hyd...College Call Girls Hyderabad Sakshi 9907093804 Independent Escort Service Hyd...
College Call Girls Hyderabad Sakshi 9907093804 Independent Escort Service Hyd...
 
Basics of Anatomy- Language of Anatomy.pptx
Basics of Anatomy- Language of Anatomy.pptxBasics of Anatomy- Language of Anatomy.pptx
Basics of Anatomy- Language of Anatomy.pptx
 
Call Girls Madhapur 7001305949 all area service COD available Any Time
Call Girls Madhapur 7001305949 all area service COD available Any TimeCall Girls Madhapur 7001305949 all area service COD available Any Time
Call Girls Madhapur 7001305949 all area service COD available Any Time
 
Call Girls Hyderabad Krisha 9907093804 Independent Escort Service Hyderabad
Call Girls Hyderabad Krisha 9907093804 Independent Escort Service HyderabadCall Girls Hyderabad Krisha 9907093804 Independent Escort Service Hyderabad
Call Girls Hyderabad Krisha 9907093804 Independent Escort Service Hyderabad
 
Call Girls Service Chandigarh Grishma ❤️🍑 9907093804 👄🫦 Independent Escort Se...
Call Girls Service Chandigarh Grishma ❤️🍑 9907093804 👄🫦 Independent Escort Se...Call Girls Service Chandigarh Grishma ❤️🍑 9907093804 👄🫦 Independent Escort Se...
Call Girls Service Chandigarh Grishma ❤️🍑 9907093804 👄🫦 Independent Escort Se...
 
Russian Call Girls in Dehradun Komal 🔝 7001305949 🔝 📍 Independent Escort Serv...
Russian Call Girls in Dehradun Komal 🔝 7001305949 🔝 📍 Independent Escort Serv...Russian Call Girls in Dehradun Komal 🔝 7001305949 🔝 📍 Independent Escort Serv...
Russian Call Girls in Dehradun Komal 🔝 7001305949 🔝 📍 Independent Escort Serv...
 
Russian Call Girls in Goa Samaira 7001305949 Independent Escort Service Goa
Russian Call Girls in Goa Samaira 7001305949 Independent Escort Service GoaRussian Call Girls in Goa Samaira 7001305949 Independent Escort Service Goa
Russian Call Girls in Goa Samaira 7001305949 Independent Escort Service Goa
 
Call Girl Dehradun Aashi 🔝 7001305949 🔝 💃 Independent Escort Service Dehradun
Call Girl Dehradun Aashi 🔝 7001305949 🔝 💃 Independent Escort Service DehradunCall Girl Dehradun Aashi 🔝 7001305949 🔝 💃 Independent Escort Service Dehradun
Call Girl Dehradun Aashi 🔝 7001305949 🔝 💃 Independent Escort Service Dehradun
 

Chi square(hospital admin) A

  • 1. Chi Square distribution and Analysis of frequency Dr Inn Kynn Khaing Lecturer, Department of Biostatistics M.B.,B.S, MPH, MSc (Healthcare Administration) (Japan)
  • 2. Outline  Introduction  Mathematical properties of the chi-square distribution  Tests of Goodness-of-fit  Tests of Independence  Tests of Homogeneity  Fisher exact test  McNemar’s test 2
  • 3.  Most frequently employed statistical technique for the analysis of count or frequency data  E.g. a sample of hospitalized patients how many are male and how many are female  E.g. the type of insurance coverage differs according to gender.  E.g. There is a relationship between area of residence and diagnosis. 3
  • 4.  The chi-square distribution may be derived from normal distributions Chi-square distribution with 1 degree of freedom Mathematical form of the chi-square distribution 4
  • 5. 5
  • 6.  Chi-square distribution : in testing hypotheses where the data available for analysis are in the form of frequencies.  tests of goodness-of-fit, tests of independence, and tests of homogeneity…  Goodness-of-fit tests: they test the goodness-of-fit of observed frequencies to frequencies that one would expect if the data were generated under some particular theory or hypothesis.  It refers to a comparison of a sample distribution to some theoretical distribution that it is assumed describes the population from which the sample came.  Karl Pearson, who showed that the chi-square distribution may be used as a test of the agreement between observation and hypothesis whenever the data are in the form of frequencies. 6
  • 7.  Observed frequencies - the number of subjects or objects in our sample that fall into the various categories of the variable of interest. For example, if we have a sample of 100 hospital patients, we may observe that 50 are married, 30 are single, 15 are widowed, and 5 are divorced.  Expected frequencies - the number of subjects or objects in our sample that we would expect to observe if some null hypothesis about the variable is true. For example, our null hypothesis might be that the four categories of marital status are equally represented in the population from which we drew our sample. In that case we would expect our sample to contain 25 married, 25 single, 25 widowed, and 25 divorced patients. 7
  • 8.  When the null hypothesis is true, X2 is distributed approximately as x2 with k- r degrees of freedom.  k = the number of groups for which observed and expected frequencies are available  r = the number of restrictions or constraints imposed on the given comparison.  The nature of X2 is such that when there is close agreement between observed and expected frequencies, it is small,  When the agreement is poor, it is large.  There is perfect agreement between the observed frequencies and the frequencies that one would expect, given that H0 is true, Oi - Ei = 0 8
  • 9.  The quantity ∑ [ (Oi - Ei )2/ Ei ] will be small if the observed and expected frequencies are close together and will be large if the differences are large.  The computed value of X2 is compared with the tabulated value of x2 with k – r degrees of freedom.  The decision rule, then, is: Reject H0 if X2 is greater than or equal to the tabulated x2 for the chosen value of a. 9
  • 10.  A goodness-of-fit test is appropriate when one wishes to decide if an observed distribution of frequencies is incompatible with some preconceived or hypothesized distribution.  In other words, one determines if actual numbers are similar to the expected or theoretical numbers i.e. goodness of fit to a theory. 10
  • 11. Cranor and Christensen (A-1) conducted a study to assess short-term clinical, economic, and humanistic outcomes of pharmaceutical care services for patients with diabetes in community pharmacies. For 47 of the subjects in the study, cholesterol levels are summarized in Table. Cholesterol Level (mg/dl) Number of Subjects 100.0–124.9 1 125.0–149.9 3 150.0–174.9 8 175.0–199.9 18 200.0–224.9 6 225.0–249.9 4 250.0–274.9 4 275.0–299.9 3 11
  • 12. Solution: 1. Data. See Table 2. Assumptions. We assume that the sample available for analysis is a simple random sample. 3. Hypotheses. H0: In the population from which the sample was drawn, cholesterol levels are normally distributed. HA: The sampled population is not normally distributed. 4. Test statistic. The test statistic is 5. Distribution of test statistic. If H0 is true, the test statistic is distributed approximately as chi-square with k-r degrees of freedom. The appropriate degrees of freedom are 8 (the number of groups or class intervals) - 3 (for the three restrictions: making ∑Ei = ∑Oi, and estimating μ and σ from the sample data) = 5. 12
  • 13. Solution: 6. Decision rule. We will reject H0 if the computed value of X2 is equal to or greater than the critical value of chi-square. 7. Calculation of test statistic. 13
  • 14. 14
  • 17. 17
  • 18. Solution: 8. Statistical decision. When we compare X2 = 10.566 with values of X2 in Appendix Table F, we see that it is less than x2 .95 = 11.070, so that we cannot reject the null hypothesis that the sample came from a normally distributed population. 9. Conclusion. We conclude that in the sampled population, cholesterol levels may follow a normal distribution. 10. p value. Since 11.070 > 10.566 > 9.236, .05 < p < .10. In other words, the probability of obtaining a value of X2 as large as 10.566, when the null hypothesis is true, is between .05 and .10. We conclude that such an event is not sufficiently rare to reject the null hypothesis that the data come from a normal distribution. 18
  • 19. In a study designed to determine patient acceptance of a new pain reliever, 100 physicians each selected a sample of 25 patients to participate in the study. Each patient, after trying the new pain reliever for a specified period of time, was asked whether it was preferable to the pain reliever used regularly in the past. We are interested in determining whether or not these data are compatible with the hypothesis that they were drawn from a population that follows a binomial distribution. 19
  • 20. 20
  • 21. Solution: A total of 500 patients out of the 2500 patients participating in the study said they preferred the new pain reliever, so that our point estimate of p is ṗ = 500/2500 = 0.20. The expected relative frequencies can be obtained by evaluating the binomial function for x = 0, 1, . . ., 25. 21
  • 22. 22 The first expected frequency is less than 1, so that we follow Cochran’s suggestion and combine this group with the second group. When we do this, all the expected frequencies are greater than 1.
  • 23. 23
  • 24. Solution: From the data, we compute The appropriate degrees of freedom are 10 less 2, or 8. One degree of freedom is lost because we force the total of the expected frequencies to equal the total observed frequencies, and one degree of freedom is sacrificed because we estimated p from the sample data. We compare our computed X2 with the tabulated x2 with 8 degrees of freedom and find that it is significant at the .005 level of significance; p < .005. We reject the null hypothesis that the data came from a binomial distribution. 24
  • 25. A hospital administrator wishes to test the null hypothesis that emergency admissions follow Poisson distribution with λ= 3. Suppose that over a period of 90 days the numbers of emergency admissions were as shown in Table. 25
  • 26. 26
  • 27. Solution: First expected relative frequency is obtained by evaluating Each of the expected relative frequencies is multiplied by 90 to obtain the corresponding expected frequencies. Since the parameter, λ, was specified in the null hypothesis, we do not lose a degree of freedom for reasons of estimation, so that the appropriate degrees of freedom are 9 – 1 = 8. 27
  • 28. 28 We also note that the last three expected frequencies are less than 1, so that they must be combined to avoid having any expected frequencies less than 1.
  • 29. Solution: we find that the critical value of x2 for 8 degrees of freedom α= .05 is 15.507, so that we cannot reject the null hypothesis at the .05 level, or for that matter any reasonable level, of significance (p > .10). Therefore, we conclude that emergency admissions at this hospital may follow a Poisson distribution with λ= 3. 29
  • 31.  Another the most frequent use of the chi-square distribution  To test the null hypothesis that two criteria of classification, when applied to the same set of entities, are independent.  Two criteria of classification are independent if the distribution of one criterion is the same no matter what the distribution of the other criterion  For example, if socioeconomic status and area of residence of the inhabitants of a certain city are independent, we would expect to find the same proportion of families in the low, medium, and high socioeconomic groups in all areas of the city. 31
  • 32. Contingency Table  The classification, according to two criteria, of a set of entities, say, people, can be shown by a table in which the r rows represent the various levels of one criterion of classification and the c columns represent the various levels of the second criterion.  Such a table is generally called a contingency table, with dimension r x c. the two criteria of classification are not independent 32
  • 33. In 1992, the U.S. Public Health Service and the Centers for Disease Control and Prevention recommended that all women of childbearing age consume 400mg of folic acid daily to reduce the risk of having a pregnancy that is affected by a neural tube defect such as spina bifida or anencephaly. 693 pregnant women called a teratology information service about their use of folic acid supplementation. The researchers wished to determine if preconceptional use of folic acid and race are independent. 33
  • 34. Solution: 1. Data. See Table 2. Assumptions. We assume that the sample available for analysis is equivalent to a simple random sample drawn from the population of interest. 3. Hypotheses. H0: Race and preconceptional use of folic acid are independent. HA: The two variables are not independent. Let α=.05. 4. Test statistic. The test statistic is 5. Distribution of test statistic. When H0 is true, X2 is distributed approximately as x2 with (r-1) (c-1) = (3-1) (2-1) = 2 x 1= 2 degrees of freedom. 6. Decision rule. Reject H0 if the computed value of X2 is equal to or greater than 5.991. 34
  • 35. Solution: 7. Calculation of test statistic. The expected frequency for the first cell is (559 x 282)/636 = 247.86. 35
  • 36. Solution: 8. Statistical decision. We reject H0 since 9.08960 > 5.991. 9. Conclusion. We conclude that H0 is false, and that there is a relationship between race and preconceptional use of folic acid. 10. p value. Since 7.378 < 9.08960 < 9.210, .01 < p < .025. 36
  • 37. Small Expected Frequencies  The problem of small expected frequencies may be encountered when analyzing the data of contingency tables.  Although there is a lack of consensus on how to handle this problem, many authors currently follow the rule given by Cochran. He suggests that for contingency tables with more than 1 degree of freedom a minimum expectation of 1 is allowable if no more than 20 percent of the cells have expected frequencies of less than 5.  To meet this rule, adjacent rows and/or adjacent columns may be combined when to do so is logical in light of other considerations. 37
  • 38.  The 2 x 2 Contingency Table Sometimes each of two criteria of classification may be broken down into only two categories, or levels.  When data are cross classified in this manner, the result is a contingency table consisting of two rows and two columns. Such a table is commonly referred to as a 2x2 table.  The value of X2 may be computed by first calculating the expected cell frequencies. In the case of a 2x2 contingency table, however, X2 may be calculated by the following shortcut formula: 38
  • 39. According to Silver and Aiello (A-4), falls are of major concern among polio survivors. Researchers wanted to determine the impact of a fall on lifestyle changes. Table shows the results of a study of 233 polio survivors on whether fear of falling resulted in lifestyle changes. 39
  • 40. Solution: 1. Data. From the information given we may construct the 2x2 contingency table 2. Assumptions. We assume that the sample is equivalent to a simple random sample. 3. Hypotheses. H0: Fall status and lifestyle change because of fear of falling are independent. H1: The two variables are not independent. Let α=.05. 4. Test statistic. The test statistic is 40
  • 41. Solution: 5. Distribution of test statistic. When H0 is true, X2 is distributed approximately as x2 with (r-1) (c-1) = (2-1) (2-1) = 1 x 1 =1 degree of freedom. 6. Decision rule. Reject H0 if the computed value of X2 is equal to or > than 3.841. 7. Calculation of test statistic. By Equation 8. Statistical decision. We reject H0 since 31.7391 > 3.841. 9. Conclusion. We conclude that H0 is false, and that there is a relationship between experiencing a fall and changing one’s lifestyle because of fear of falling. 10. p value. Since 31.7391 > 7.879, p < .005. 41
  • 42. Tests of Independence - Characteristics 1. A single sample is selected from a population of interest, and the subjects or objects are cross-classified on the basis of the two variables of interest. 2. The rationale for calculating expected cell frequencies is based on the probability law, which states that if two events (here the two criteria of classification) are independent, the probability of their joint occurrence is equal to the product of their individual probabilities. 3. The hypotheses and conclusions are stated in terms of the independence (or lack of independence) of two variables. 44
  • 43.  one set of marginal totals is said to be fixed, while the other set, corresponding to the criterion of classification applied to the samples, is random  The situation leads to a chi-square test of homogeneity  The two situations not only involve different sampling procedures; they lead to different questions and null hypotheses  The homogeneity test is concerned with the question “Are the samples drawn from populations that are homogeneous with respect to some criterion of classification?” 45
  • 44. Narcolepsy is a disease involving disturbances of the sleep–wake cycle. Members of the German Migraine and Headache Society studied the relationship between migraine headaches in 96 subjects diagnosed with narcolepsy and 96 healthy controls. The results are shown in Table. We wish to know if we may conclude, on the basis of these data, that the narcolepsy population and healthy populations represented by the samples are not homogeneous with respect to migraine frequency. 46
  • 45. Solution: 1. Data. See Table 2. Assumptions. We assume that we have a simple random sample from each of the two populations of interest. 3. Hypotheses. H0: The two populations are homogeneous with respect to migraine frequency. HA: The two populations are not homogeneous with respect to migraine frequency. Let α = .05. 4. Test statistic. The test statistic is 5. Distribution of test statistic. If H0 is true, X2 is distributed approximately as x2 with (2-1) (2-1) = 1 x 1 = 1 degree of freedom. 47
  • 46. Solution: 6. Decision rule. Reject H0 if the computed value of X2 is equal to or greater than 3.841. 7. Calculation of test statistic. Subject Reported migraine headache Observed Frequency (Oi) Expected Frequency (Ei) Oi - Ei (Oi – Ei )2 (Oi – Ei )2/Ei Narcoleptic subjects Yes 21 No 75 Healthy controls Yes 19 No 77 Total 192 48
  • 47. Reported Migraine Headaches Yes No Total Narcoleptic subjects 20 76 96 Healthy controls 20 76 96 Total 40 152 192 49
  • 48. Solution: 6. Decision rule. Reject H0 if the computed value of X2 is equal to or greater than 3.841. 7. Calculation of test statistic. Subject Reported migraine headache Observed Frequency (Oi) Expected Frequency (Ei) Oi - Ei (Oi – Ei )2 (Oi – Ei )2/Ei Narcoleptic subjects Yes 21 20 No 75 76 Healthy controls Yes 19 20 No 77 76 Total 192 192 50
  • 49. Solution: 6. Decision rule. Reject H0 if the computed value of X2 is equal to or greater than 3.841. 7. Calculation of test statistic. Subject Reported migraine headache Observed Frequency (Oi) Expected Frequency (Ei) Oi - Ei (Oi – Ei )2 (Oi – Ei )2/Ei Narcoleptic subjects Yes 21 20 -1 No 75 76 1 Healthy controls Yes 19 20 -1 No 77 76 1 Total 192 192 51
  • 50. Solution: 6. Decision rule. Reject H0 if the computed value of X2 is equal to or greater than 3.841. 7. Calculation of test statistic. Subject Reported migraine headache Observed Frequency (Oi) Expected Frequency (Ei) Oi - Ei (Oi – Ei )2 (Oi – Ei )2/Ei Narcoleptic subjects Yes 21 20 -1 1 No 75 76 1 1 Healthy controls Yes 19 20 -1 1 No 77 76 1 1 Total 192 192 52
  • 51. Solution: 6. Decision rule. Reject H0 if the computed value of X2 is equal to or greater than 3.841. 7. Calculation of test statistic. Subject Reported migraine headache Observed Frequency (Oi) Expected Frequency (Ei) Oi - Ei (Oi – Ei )2 (Oi – Ei )2/Ei Narcoleptic subjects Yes 21 20 -1 1 0.05 No 75 76 1 1 0.0132 Healthy controls Yes 19 20 -1 1 0.05 No 77 76 1 1 0.0132 Total 192 192 0.1263 53
  • 52. Solution: 8. Statistical decision. Since .126 is less than the critical value of 3.841, we are unable to reject the null hypothesis. 9. Conclusion. We conclude that the two populations may be homogeneous with respect to migraine frequency. 10. p value. p = .722. 54
  • 53. Tests of Homogeneity - Characteristics 1. Two or more populations are identified in advance, and an independent sample is drawn from each. 2. Sample subjects or objects are placed in appropriate categories of the variable of interest. 3. The calculation of expected cell frequencies is based on the rationale that if the populations are homogeneous as stated in the null hypothesis, the best estimate of the probability that a subject or object will fall into a particular category of the variable of interest can be obtained by pooling the sample data. 4. The hypotheses and conclusions are stated in terms of homogeneity (with respect to the variable of interest) of populations. 55
  • 54. Tests of Homogeneity  It assesses the significance of difference in two or more than two proportions. i.e, testing the homogenous distribution of particular characteristics among groups.  It is concerned with the question “Are the sample drawn from population that are homogenous with respect to some criteria of classification?” Tests of Independence  It tests the association between two events in binomial or multinomial samples. For example, if socio-economic status and area of residence of inhabitants of a certain city are independent, we would expect to find the same proportions of families in the low, medium and high socio-economic groups in all areas of the city.  It is concerned with the question “Are the two criteria of classification independent?” 56
  • 55.  Although difference in concept and sampling procedures, the two tests are mathematically identical. 57
  • 56. 58