SlideShare a Scribd company logo
1 of 35
Sample size calculations
Dr Vinodh Kumar O.R
Division of Epidemiology
ICAR-Indian Veterinary Research Institute
Izatnagar, Bareilly-243 122
NEED FOR SAMPLE SIZE CALCULATION
• Sample-size determination is often an important step in
planning an epidemiological study
• An adequate sample size helps ensure that the study will
yield reliable information.
• Conducting a study with an inadequate sample size is not
only futile, it is also un ethical.
• Different study design need different method of sample size
calculation and one formula cannot be used in all designs.
• Determining sample size is a very important issue because
samples that are too large may waste time, resources and
money, while samples that are too small may lead to
inaccurate results.
• Sampling frame: It is a complete
enumeration of the sampling units in
the study population, which may be
a list, directory, map, arial
configuration.
• Sampling unit: It may be an
individual, a household or a school.
Non-representativeness
of the study population
results in a lowered
accuracy
Small sample size
leads to low precision
Knowledge of the population
parameters
• By pilot surveys
• By use of results of previous surveys
• By intelligent guess
a and confidence level
• Alpha (a ): The
significance level of a
test: the probability of
rejecting the null
hypothesis when it is true
(or the probability of
making a Type I error).
• Confidence level: The
probability that an
estimate of a population
parameter is within
certain specified limits of
the true value; commonly
denoted by “1- a”.
• Beta( ) : The probability of
failing to reject the null
hypothesis when it is false (or
the probability of making a Type
II error).
• Power: The probability of
correctly rejecting the null
hypothesis when it is false;
commonly denoted by “1- ”
• Precision: A measure of how
close an estimate is to the true
value of a population parameter.
It may be expressed in absolute
terms or relative to the estimate.
• Degree of precision is the
margin of permissible error
between the estimated value and
the population value.
Basis for determining the size of sample
• Specification of a precision level.
• Specification of level of confidence.
• Power: The likelihood of rejecting the null
hypothesis when the null hypothesis is false.
Margin of error/sampling error
• The margin of error is a statistic expressing the amount of
random sampling error in a survey's results
• Larger the margin of error, the less confidence.
• The difference between the sample statistic and the related
population parameter is called the sampling error.
Margin of error Sample size
https://www.surveymonkey.com/mp/margin-of-error-calculator/
Sample size
• The choosing of sample size depends on non-
statistical and statistical considerations.
• Nonstatistical: availability of manpower and
sampling frames.
• Statistical considerations : Precision of the
estimate of prevalence and the expected
prevalence of the disease.
Sample size required for estimating
population mean
• Suppose we want an interval that extends d units on either side of the
estimator
d = (reliability coefficient) x (Standard error)
• If sampling is from a population sufficiently large size, the equation is:
d = z s
n
• When solved for n gives:
n = z2 s2
d2
width of the confidence interval (d)
level of confidence (z)
population variance (s2)
• A farm has 1000 young pigs with an initial weight of about 50 kgs. They
put them on a new diet for 3 weeks and want to know how many pigs to
sample so that they can estimate the average weight gain. We want the
results to be within 2 Kgs with 90% confidence level.
• We have no idea of σ or SD
Sample size for population mean
90% confidence level =1.645
Sample size required for estimating
proportions
n
z
• Same as for population mean.
• Assuming random sampling and approximate
normality in the distribution of p, brings us to the
formula for n if sampling is with replacement, from
a population sufficiently large to warrant ignoring
the finite population correction :
Where q = 1 – p
pq
=
2
2
d
What Sample Size for proportion
• A researcher wants to estimate the true FMD immunization coverage in a village of
cattle population
• As per literature review , the immunization coverage should be somewhere around 80%
• Precision (absolute): we’d like the result to be within 4% of the true value
• Confidence level: conventional = 95% = 1 - α; therefore, α = 0.05 and z(1-a/2) = 1.96 =
value of the standard normal distribution corresponding to a significance level of 0.05
(1.96 for a 2-sided test at the 0.05 level)
• d = absolute precision = 0.04
• p = expected proportion in the population = 0.80
• z(1-a/2) = 1.96 = value of the standard normal distribution corresponding to a significance
level of a (1.96 for a 2-sided test at the 0.05 level)
z2 . p . (1-p)
n = -------------------------
d2
(1.96)2 (.80) (.20)
= ------------------------------
(0.04)2
= 384
Descriptive studies
• In general, these studies can only identify
patterns or trends in disease occurrence over
time or in different geographical locations, but
cannot ascertain the causal agent or degree of
exposure.
• To calculate the required sample size in a
descriptive study, we need to know the level of
precision, level of confidence or risk and
degree of variability.
Finite population correction factor
• When population sizes are less than 10 times the
estimated sample size, it is possible to use a finite
population correction factor.
• The finite population correction factor measures how
much extra precision we achieve when the sample
size becomes close to the population size.
N is the size of the population and n is the size of
the sample.
If fpc is close to 1, then there is almost no effect.
When fpc is much smaller than 1, then sampling a
large fraction of the population is indeed having an effect
on precision.
Independent case-control studies
α = alpha, β = 1 – power, ψ = odds ratio
m– number of
control subjects per case subject, p1 – probability
of exposure in controls. p0 can be estimated as the
population prevalence
of exposure, nc is the continuity corrected sample
size and Zp is the standard normal deviate for
probability p
Sample size for matched case-control
studies
Sample size for independent cohort
studies
Sample size for paired cohort studies
Sample size calculation for cross sectional
studies/surveys
For qualitative variable
Sample size calculation for cross
sectional studies/surveys
For quantitative variable
Case – control study
Qualitative variable
Sample size calculation for testing a hypothesis
(Clinical trials or clinical interventional studies)
Resource equation method
• It depends on the size of the whole experiment and
the number of treatment groups, not the individual
group sizes.
• If a value of E is less than 10 then more animal
should be included and if it is more than 20 then
sample size should be decreased.
• The resource equation method is useful when there is
no previous estimate of the standard deviation.
• For example, if a factorial experiment is planned
with both sexes and three dose levels then there
will be six treatment groups. If it is proposed
that there should be eight animals in each
treatment group (as is common), there will be 48
animals in total and E = 48 – 6 = 42. This
experiment is unnecessarily large.
• Redesigning it with four animals per group, E =
24 – 6 = 18, which is within the suggested limits
of 10 – 20.
• A power analysis should be used in preference to
the resource equation method wherever possible.
• Unfortunately, power analysis is not so easy to
use when there are more than two groups
because it is more difficult (but not impossible)
to specify the effect size of interest.
Resource equation method example
What factors affect the power of a test?
To increase the power of your test, you may do
any of the following:
1. Increase the effect size (the difference
between the null and alternative values) to be
detected
2. Increase the sample size(s)
3. Decrease the variability in the sample(s)
4. Increase the significance level (alpha) of the
test
Sample size calculation tools
Websites
http://statpages.info/
http://www.openepi.com/Menu/OE_Menu.htm
NCSS PASS
Sample size calculation for ANOVA
ANOVA
ANOVA- Sample size
ANOVA- Sample size
Sample size for ANOVA

More Related Content

Similar to samplesizecalculations-1801190731542.ppt

SAMPLE SIZE DETERMINATION.ppt
SAMPLE SIZE DETERMINATION.pptSAMPLE SIZE DETERMINATION.ppt
SAMPLE SIZE DETERMINATION.pptabdulwehab2
 
samplesizedetermination-221008120007-0081a5b4.ppt
samplesizedetermination-221008120007-0081a5b4.pptsamplesizedetermination-221008120007-0081a5b4.ppt
samplesizedetermination-221008120007-0081a5b4.pptmekuriatadesse
 
Sample Size Determination
Sample Size DeterminationSample Size Determination
Sample Size DeterminationTina Sepehrifar
 
Advanced Biostatistics and Data Analysis abdul ghafoor sajjad
Advanced Biostatistics and Data Analysis abdul ghafoor sajjadAdvanced Biostatistics and Data Analysis abdul ghafoor sajjad
Advanced Biostatistics and Data Analysis abdul ghafoor sajjadHeadDPT
 
Sample size determination
Sample size determinationSample size determination
Sample size determinationGopal Kumar
 
presentation on calculation of sample size
presentation on calculation of sample sizepresentation on calculation of sample size
presentation on calculation of sample sizeRichaMishra186341
 
Sample determinants and size
Sample determinants and sizeSample determinants and size
Sample determinants and sizeTarek Tawfik Amin
 
Sample size
Sample sizeSample size
Sample sizezubis
 
Chapter_2_Sampling.pptx
Chapter_2_Sampling.pptxChapter_2_Sampling.pptx
Chapter_2_Sampling.pptxSubodhPaudel6
 
Bio-Statistics in Bio-Medical research
Bio-Statistics in Bio-Medical researchBio-Statistics in Bio-Medical research
Bio-Statistics in Bio-Medical researchShinjan Patra
 
BASIC STATISTICS AND THEIR INTERPRETATION AND USE IN EPIDEMIOLOGY 050822.pdf
BASIC STATISTICS AND THEIR INTERPRETATION AND USE IN EPIDEMIOLOGY 050822.pdfBASIC STATISTICS AND THEIR INTERPRETATION AND USE IN EPIDEMIOLOGY 050822.pdf
BASIC STATISTICS AND THEIR INTERPRETATION AND USE IN EPIDEMIOLOGY 050822.pdfAdamu Mohammad
 
T test^jsample size^j ethics
T test^jsample size^j ethicsT test^jsample size^j ethics
T test^jsample size^j ethicsAbhishek Thakur
 
Sample size estimation
Sample size estimationSample size estimation
Sample size estimationHanaaBayomy
 
Statistics basics for oncologist kiran
Statistics basics for oncologist kiranStatistics basics for oncologist kiran
Statistics basics for oncologist kiranKiran Ramakrishna
 
PPT on Sample Size, Importance of Sample Size,
PPT on Sample Size, Importance of Sample Size,PPT on Sample Size, Importance of Sample Size,
PPT on Sample Size, Importance of Sample Size,Naveen K L
 

Similar to samplesizecalculations-1801190731542.ppt (20)

SAMPLE SIZE DETERMINATION.ppt
SAMPLE SIZE DETERMINATION.pptSAMPLE SIZE DETERMINATION.ppt
SAMPLE SIZE DETERMINATION.ppt
 
samplesizedetermination-221008120007-0081a5b4.ppt
samplesizedetermination-221008120007-0081a5b4.pptsamplesizedetermination-221008120007-0081a5b4.ppt
samplesizedetermination-221008120007-0081a5b4.ppt
 
Sample Size Determination
Sample Size DeterminationSample Size Determination
Sample Size Determination
 
Advanced Biostatistics and Data Analysis abdul ghafoor sajjad
Advanced Biostatistics and Data Analysis abdul ghafoor sajjadAdvanced Biostatistics and Data Analysis abdul ghafoor sajjad
Advanced Biostatistics and Data Analysis abdul ghafoor sajjad
 
Sample size determination
Sample size determinationSample size determination
Sample size determination
 
Sample size calculation
Sample size calculationSample size calculation
Sample size calculation
 
presentation on calculation of sample size
presentation on calculation of sample sizepresentation on calculation of sample size
presentation on calculation of sample size
 
Sample determinants and size
Sample determinants and sizeSample determinants and size
Sample determinants and size
 
Sample and effect size
Sample and effect sizeSample and effect size
Sample and effect size
 
Sample size
Sample sizeSample size
Sample size
 
Chapter_2_Sampling.pptx
Chapter_2_Sampling.pptxChapter_2_Sampling.pptx
Chapter_2_Sampling.pptx
 
Bio-Statistics in Bio-Medical research
Bio-Statistics in Bio-Medical researchBio-Statistics in Bio-Medical research
Bio-Statistics in Bio-Medical research
 
How to do the maths
How to do the mathsHow to do the maths
How to do the maths
 
BASIC STATISTICS AND THEIR INTERPRETATION AND USE IN EPIDEMIOLOGY 050822.pdf
BASIC STATISTICS AND THEIR INTERPRETATION AND USE IN EPIDEMIOLOGY 050822.pdfBASIC STATISTICS AND THEIR INTERPRETATION AND USE IN EPIDEMIOLOGY 050822.pdf
BASIC STATISTICS AND THEIR INTERPRETATION AND USE IN EPIDEMIOLOGY 050822.pdf
 
T test^jsample size^j ethics
T test^jsample size^j ethicsT test^jsample size^j ethics
T test^jsample size^j ethics
 
Sample size estimation
Sample size estimationSample size estimation
Sample size estimation
 
To p or not to p
To p or not to pTo p or not to p
To p or not to p
 
Statistics basics for oncologist kiran
Statistics basics for oncologist kiranStatistics basics for oncologist kiran
Statistics basics for oncologist kiran
 
PPT on Sample Size, Importance of Sample Size,
PPT on Sample Size, Importance of Sample Size,PPT on Sample Size, Importance of Sample Size,
PPT on Sample Size, Importance of Sample Size,
 
Sample size
Sample sizeSample size
Sample size
 

Recently uploaded

The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfakmcokerachita
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docxPoojaSen20
 

Recently uploaded (20)

The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdf
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docx
 

samplesizecalculations-1801190731542.ppt

  • 1. Sample size calculations Dr Vinodh Kumar O.R Division of Epidemiology ICAR-Indian Veterinary Research Institute Izatnagar, Bareilly-243 122
  • 2. NEED FOR SAMPLE SIZE CALCULATION • Sample-size determination is often an important step in planning an epidemiological study • An adequate sample size helps ensure that the study will yield reliable information. • Conducting a study with an inadequate sample size is not only futile, it is also un ethical. • Different study design need different method of sample size calculation and one formula cannot be used in all designs. • Determining sample size is a very important issue because samples that are too large may waste time, resources and money, while samples that are too small may lead to inaccurate results.
  • 3. • Sampling frame: It is a complete enumeration of the sampling units in the study population, which may be a list, directory, map, arial configuration. • Sampling unit: It may be an individual, a household or a school. Non-representativeness of the study population results in a lowered accuracy Small sample size leads to low precision
  • 4.
  • 5.
  • 6. Knowledge of the population parameters • By pilot surveys • By use of results of previous surveys • By intelligent guess
  • 7. a and confidence level • Alpha (a ): The significance level of a test: the probability of rejecting the null hypothesis when it is true (or the probability of making a Type I error). • Confidence level: The probability that an estimate of a population parameter is within certain specified limits of the true value; commonly denoted by “1- a”.
  • 8. • Beta( ) : The probability of failing to reject the null hypothesis when it is false (or the probability of making a Type II error). • Power: The probability of correctly rejecting the null hypothesis when it is false; commonly denoted by “1- ” • Precision: A measure of how close an estimate is to the true value of a population parameter. It may be expressed in absolute terms or relative to the estimate. • Degree of precision is the margin of permissible error between the estimated value and the population value.
  • 9. Basis for determining the size of sample • Specification of a precision level. • Specification of level of confidence. • Power: The likelihood of rejecting the null hypothesis when the null hypothesis is false.
  • 10. Margin of error/sampling error • The margin of error is a statistic expressing the amount of random sampling error in a survey's results • Larger the margin of error, the less confidence. • The difference between the sample statistic and the related population parameter is called the sampling error. Margin of error Sample size
  • 12. Sample size • The choosing of sample size depends on non- statistical and statistical considerations. • Nonstatistical: availability of manpower and sampling frames. • Statistical considerations : Precision of the estimate of prevalence and the expected prevalence of the disease.
  • 13. Sample size required for estimating population mean • Suppose we want an interval that extends d units on either side of the estimator d = (reliability coefficient) x (Standard error) • If sampling is from a population sufficiently large size, the equation is: d = z s n • When solved for n gives: n = z2 s2 d2 width of the confidence interval (d) level of confidence (z) population variance (s2)
  • 14. • A farm has 1000 young pigs with an initial weight of about 50 kgs. They put them on a new diet for 3 weeks and want to know how many pigs to sample so that they can estimate the average weight gain. We want the results to be within 2 Kgs with 90% confidence level. • We have no idea of σ or SD Sample size for population mean 90% confidence level =1.645
  • 15. Sample size required for estimating proportions n z • Same as for population mean. • Assuming random sampling and approximate normality in the distribution of p, brings us to the formula for n if sampling is with replacement, from a population sufficiently large to warrant ignoring the finite population correction : Where q = 1 – p pq = 2 2 d
  • 16. What Sample Size for proportion • A researcher wants to estimate the true FMD immunization coverage in a village of cattle population • As per literature review , the immunization coverage should be somewhere around 80% • Precision (absolute): we’d like the result to be within 4% of the true value • Confidence level: conventional = 95% = 1 - α; therefore, α = 0.05 and z(1-a/2) = 1.96 = value of the standard normal distribution corresponding to a significance level of 0.05 (1.96 for a 2-sided test at the 0.05 level) • d = absolute precision = 0.04 • p = expected proportion in the population = 0.80 • z(1-a/2) = 1.96 = value of the standard normal distribution corresponding to a significance level of a (1.96 for a 2-sided test at the 0.05 level) z2 . p . (1-p) n = ------------------------- d2 (1.96)2 (.80) (.20) = ------------------------------ (0.04)2 = 384
  • 17. Descriptive studies • In general, these studies can only identify patterns or trends in disease occurrence over time or in different geographical locations, but cannot ascertain the causal agent or degree of exposure. • To calculate the required sample size in a descriptive study, we need to know the level of precision, level of confidence or risk and degree of variability.
  • 18. Finite population correction factor • When population sizes are less than 10 times the estimated sample size, it is possible to use a finite population correction factor. • The finite population correction factor measures how much extra precision we achieve when the sample size becomes close to the population size. N is the size of the population and n is the size of the sample. If fpc is close to 1, then there is almost no effect. When fpc is much smaller than 1, then sampling a large fraction of the population is indeed having an effect on precision.
  • 19. Independent case-control studies α = alpha, β = 1 – power, ψ = odds ratio m– number of control subjects per case subject, p1 – probability of exposure in controls. p0 can be estimated as the population prevalence of exposure, nc is the continuity corrected sample size and Zp is the standard normal deviate for probability p
  • 20. Sample size for matched case-control studies
  • 21. Sample size for independent cohort studies
  • 22. Sample size for paired cohort studies
  • 23. Sample size calculation for cross sectional studies/surveys For qualitative variable
  • 24. Sample size calculation for cross sectional studies/surveys For quantitative variable
  • 25. Case – control study Qualitative variable
  • 26. Sample size calculation for testing a hypothesis (Clinical trials or clinical interventional studies)
  • 27. Resource equation method • It depends on the size of the whole experiment and the number of treatment groups, not the individual group sizes. • If a value of E is less than 10 then more animal should be included and if it is more than 20 then sample size should be decreased. • The resource equation method is useful when there is no previous estimate of the standard deviation.
  • 28. • For example, if a factorial experiment is planned with both sexes and three dose levels then there will be six treatment groups. If it is proposed that there should be eight animals in each treatment group (as is common), there will be 48 animals in total and E = 48 – 6 = 42. This experiment is unnecessarily large. • Redesigning it with four animals per group, E = 24 – 6 = 18, which is within the suggested limits of 10 – 20. • A power analysis should be used in preference to the resource equation method wherever possible. • Unfortunately, power analysis is not so easy to use when there are more than two groups because it is more difficult (but not impossible) to specify the effect size of interest. Resource equation method example
  • 29. What factors affect the power of a test? To increase the power of your test, you may do any of the following: 1. Increase the effect size (the difference between the null and alternative values) to be detected 2. Increase the sample size(s) 3. Decrease the variability in the sample(s) 4. Increase the significance level (alpha) of the test
  • 30. Sample size calculation tools Websites http://statpages.info/ http://www.openepi.com/Menu/OE_Menu.htm
  • 31. NCSS PASS Sample size calculation for ANOVA
  • 32. ANOVA

Editor's Notes

  1. For example, if it is a study in a village (with a population of say, 500) and the objective is to determine the prevalence of some unusual events or factors among the villagers, the selection unit ideally should be individuals residing in the village. In this case, the list of the names of all inhabitants will be the reference sampling frame. But there are situations where the sampling frame could not be worked out so easily. Taking example of a similar study covering a state, it is almost impossible to draw a list of all inhabitants residing in the state. So here, simple random sampling could not be appropriate; one has to make use of a more simple approach
  2. 1.Specification of a precision level: A decision on the tolerable limits of errors is made, i.e. the researcher makes a statement that it does not matter if his sample estimate does not differ from true population value by a certain amount. For example, suppose a Paediatrician plans a study to estimate the population of malnourished children in a village and suppose that the true proportion of malnourished children is 10%. He is satisfied if his estimate does not differ from true value of 10% by 5% i.e. he is okay with the result of his study if his estimate is within 9.5% to 10.5% (i.e. 10±0.5%). 2. Specification of level of confidence: This is the degree of uncertainty or probability that a sample value lies outside a stated limits (i.e. 10 ± 0.5) %. Suppose this measure is 5%, the investigator has to accept the unlikely situation of 1 in 20 cases that the sample result falls aside the desired limit; and if it is 1%, then the chance that the sample result falls outside the desired limits in 1 in 400. However, by convention, the mostly used confidence levels are 5% and 1%; but nothing stops the investigator from tolerating 10%, 2.5% etc.ond level
  3. When the sample size is 50, it does not matter much whether the population is 10 thousand or 10 million. When the sample size is four thousand, then we have about 23% more precision with a population of ten thousand than we would for a population of ten million.