sample size phd-finalpresentation111.ppt

SAMPLE SIZE CALCULATIONS
Presented By:
Dr. Nivedita Yadav
Dr. Parul Singhal
Dr. Kanishka Tyagi
Dr. Akanksha Sirohi
Dr. Aarushi
Dr. Aanchal Singh
Guided By:
Dr. Kaynat Nasser

NEED FOR SAMPLE SIZE
CALCULATION
• Sample-size determination is often an important step in
planning an epidemiological study
• An adequate sample size helps ensure that the study will
yield reliable information.
• Conducting a study with an inadequate sample size is not
only futile, it is also un ethical.
• Different study design need different method of sample
size calculation and one formula cannot be used in all
designs.
• Determining sample size is a very important issue
because samples that are too large may waste time,
resources and money, while samples that are too small
may lead to inaccurate results.

• Sampling frame: It is a complete
enumeration of the sampling units in the study
population, which may be a list, directory,
map, arial configuration.
• Sampling unit: It may be an individual, a
household or a school.
Non-representativeness
of the study population
results in a lowered
accuracy
Small sample size
leads to low precision

KNOWLEDGE OF THE POPULATION PARAMETERS
 By pilot surveys
 By use of results of previous surveys
 By intelligent guess

BASIS FOR DETERMINING THE SIZE OF
SAMPLE
 Specification of a precision level.
 Specification of level of confidence.
 Power: The likelihood of rejecting the null hypothesis
when the null hypothesis is false.

MARGIN OF ERROR/SAMPLING ERROR
 The margin of error is a statistic expressing the amount of random sampling error in a survey's
results
 Larger the margin of error, the less confidence.
 The difference between the sample statistic and the related population parameter is called the
sampling error.
Margin of error Sample size

https://www.surveymonkey.com/mp/margin-of-error-calculator/

SAMPLE SIZE
 The choosing of sample size depends on non-
statistical and statistical considerations.
 Nonstatistical: availability of manpower and
sampling frames.
 Statistical considerations : Precision of the estimate
of prevalence and the expected prevalence of the
disease.

SAMPLE SIZE REQUIRED FOR ESTIMATING
POPULATION MEAN
• Suppose we want an interval that extends d units on either side of the estimator
d = (reliability coefficient) x (Standard error)
• If sampling is from a population sufficiently large size, the equation is:
d = z s
n
• When solved for n gives:
n = z2 s2
d2
width of the confidence interval (d)
level of confidence (z)
population variance (s2)

SAMPLE SIZE FOR POPULATION MEAN
 A farm has 1000 young pigs with an initial weight of about 50 kgs.
They put them on a new diet for 3 weeks and want to know how
many pigs to sample so that they can estimate the average weight
gain. We want the results to be within 2 Kgs with 90% confidence
level.
 We have no idea of σ or SD
90% confidence level =1.645

SAMPLE SIZE REQUIRED FOR ESTIMATING
PROPORTIONS
• Same as for population mean.
• Assuming random sampling and approximate normality
in the distribution of p, brings us to the formula for n if
sampling is with replacement, from a population
sufficiently large to warrant ignoring the finite
population correction :
Where q = 1 – p
n
z pq
=
2
2
d

WHAT SAMPLE SIZE FOR PROPORTION
• A researcher wants to estimate the true FMD immunization coverage in a village of cattle
population
• As per literature review , the immunization coverage should be somewhere around 80%
• Precision (absolute): we’d like the result to be within 4% of the true value
• Confidence level: conventional = 95% = 1 - α; therefore, α = 0.05 and z(1-a/2) = 1.96 =
value of the standard normal distribution corresponding to a significance level of 0.05
(1.96 for a 2-sided test at the 0.05 level)
• d = absolute precision = 0.04
• p = expected proportion in the population = 0.80
• z(1-a/2) = 1.96 = value of the standard normal distribution corresponding to a significance
level of a (1.96 for a 2-sided test at the 0.05 level)
z2 . p . (1-p)
n = -------------------------
d2
(1.96)2 (.80) (.20)
= ------------------------------
(0.04)2
= 384

DESCRIPTIVE STUDIES
• In general, these studies can only identify patterns or trends in
disease occurrence over time or in different geographical
locations, but cannot ascertain the causal agent or degree of
exposure.
• To calculate the required sample size in a descriptive study, we
need to know the level of precision, level of confidence or risk
and degree of variability.

FINITE POPULATION CORRECTION FACTOR
 When population sizes are less than 10 times the
estimated sample size, it is possible to use a
finite population correction factor.
 The finite population correction factor measures
how much extra precision we achieve when the
sample size becomes close to the population
size.
N is the size of the population and n is the size of
the sample.
If fpc is close to 1, then there is almost no effect.
When fpc is much smaller than 1, then sampling a
large fraction of the population is indeed having an effect
on precision.

INDEPENDENT CASE-CONTROL STUDIES
α = alpha, β = 1 – power, ψ = odds ratio
m– number of
control subjects per case subject, p1 – probability
of exposure in controls. p0 can be estimated as the
population prevalence
of exposure, nc is the continuity corrected sample
size and Zp is the standard normal deviate for
probability p

SAMPLE SIZE FOR MATCHED CASE-CONTROL
STUDIES

SAMPLE SIZE FOR INDEPENDENT COHORT
STUDIES

SAMPLE SIZE FOR PAIRED COHORT STUDIES

SAMPLE SIZE CALCULATION FOR
CROSS SECTIONAL STUDIES/SURVEYS
For qualitative variable

SAMPLE SIZE CALCULATION FOR CROSS
SECTIONAL STUDIES/SURVEYS
For quantitative variable

CASE – CONTROL STUDY
Qualitative variable

SAMPLE SIZE CALCULATION FOR TESTING A
HYPOTHESIS (CLINICAL TRIALS OR CLINICAL
INTERVENTIONAL STUDIES)

RESOURCE EQUATION METHOD
 It depends on the size of the whole experiment and the
number of treatment groups, not the individual group
sizes.
 If a value of E is less than 10 then more animal should
be included and if it is more than 20 then sample size
should be decreased.
 The resource equation method is useful when there is
no previous estimate of the standard deviation.

RESOURCE EQUATION METHOD EXAMPLE
 For example, if a factorial experiment is planned with both sexes
and three dose levels then there will be six treatment groups. If it
is proposed that there should be eight animals in each treatment
group (as is common), there will be 48 animals in total and E = 48
– 6 = 42. This experiment is unnecessarily large.
 Redesigning it with four animals per group, E = 24 – 6 = 18,
which is within the suggested limits of 10 – 20.
 A power analysis should be used in preference to the resource
equation method wherever possible.
 Unfortunately, power analysis is not so easy to use when there are
more than two groups because it is more difficult (but not
impossible) to specify the effect size of interest.

WHAT FACTORS AFFECT THE POWER OF A
TEST?
To increase the power of your test, you may do any of the
following:
1. Increase the effect size (the difference between the null
and alternative values) to be detected
2. Increase the sample size(s)
3. Decrease the variability in the sample(s)
4. Increase the significance level (alpha) of the test

sample size phd-finalpresentation111.ppt

sample size phd-finalpresentation111.ppt

Recommended

Recommended

More Related Content

Similar to sample size phd-finalpresentation111.ppt

Similar to sample size phd-finalpresentation111.ppt (20)

Recently uploaded

Recently uploaded (20)

sample size phd-finalpresentation111.ppt

Editor's Notes