SlideShare a Scribd company logo
1 of 46
Sampling
Sources:
-EPIET Introductory course,
Thomas Grein, Denis Coulombier, Philippe Sudre, Mike Catchpole
-IDEA
Brigitte Helynck, Philippe Malfait, Institut de veille sanitaire
Modified: Denise Antona, EPIET 2003
Objectives of presentation
• Definition of sampling
• Why do we use samples?
• Concept of representativeness
• Main methods of sampling
• Sampling error
• Sample size calculation
Definition of sampling
Procedure by which some members
of a given population are selected as
representatives of the entire population
Definition of sampling terms
• Sampling unit
– Subject under observation on which information is
collected
• Sampling fraction
– Ratio between the sample size and the population
size
• Sampling frame
– Any list of all the sampling units in the population
• Sampling scheme
– Method of selecting sampling units from sampling
frame
Why do we use samples ?
Get information from large populations
– At minimal cost
– At maximum speed
– At increased accuracy
– Using enhanced tools
Sampling
Precision
Cost
What we need to know
• Concepts
– Representativeness
– Sampling methods
– Choice of the right design
• Calculations
– Sampling error
– Design effect
– Sample size
Sampling and representativeness
Sample
Target Population
Sampling
Population
Target Population  Sampling Population  Sample
Representativeness
• Person
• Demographic characteristics (age, sex…)
• Exposure/susceptibility
• Place (Example : urban vs. rural)
• Time
• Seasonality
• Day of the week
• Time of the day
Ensure representativeness before starting,
confirm once completed !!!!!!
Types of samples
• Non-probability samples
• Probability samples
Non probability samples
• Quotas
• Sample reflects population structure
• Time/resources constraints
• Convenience samples (purposive units)
• Biased
• Best or worst scenario
Probability of being chosen : unknown
Probability samples
• Random sampling
• Each subject has a known probability of
being chosen
• Reduces possibility of selection bias
• Allows application of statistical theory to
results
Sampling error
• No sample is the exact mirror image of
the population
• Magnitude of error can be measured in
probability samples
• Expressed by standard error
– of mean, proportion, differences, etc
• Function of
– amount of variability in measuring factor of
interest
– sample size
Methods used in probability samples
• Simple random sampling
• Systematic sampling
• Stratified sampling
• Multistage sampling
• Cluster sampling
Quality of an estimate
Precision
& validity
No precision
Random
error !
Precision but
no validity
Systematic
error (Bias) !
Simple random sampling
• Principle
–Equal chance of drawing each unit
• Procedure
–Number all units
–Randomly draw units
Simple random sampling
• Advantages
–Simple
–Sampling error easily measured
• Disadvantages
–Need complete list of units
–Does not always achieve best
representativeness
–Units may be scattered
Example: evaluate the prevalence of tooth
decay among the 1200 children attending a
school
• List of children attending the school
• Children numerated from 1 to 1200
• Sample size = 100 children
• Random sampling of 100 numbers between 1
and 1200
How to randomly select?
Simple random sampling
Simple random sampling
57172 42088 70098 11333 26902 29959 43909 49607
33883 87680 28923 15659 09839 45817 89405 70743
77950 67344 10609 87119 15859 74577 42791 75889
11607 11596 01796 24498 17009 67119 00614 49529
56149 55678 38169 47228 49931 94303 67448 31286
80719 65101 77729 83949 83358 75230 56624 27549
93809 19505 82000 79068 45552 86776 48980 56684
40950 86216 48161 17646 24164 35513 94057 51834
12182 59744 65695 83710 41125 14291 74773 66391
13382 48076 73151 48724 35670 38453 63154 58116
38629 94576 48859 75654 17152 66516 78796 73099
60728 32063 12431 23898 23683 10853 04038 75246
01881 99056 46747 08846 01331 88163 74462 14551
23094 29831 95387 23917 07421 97869 88092 72201
15243 21100 48125 05243 16181 39641 36970 99522
53501 58431 68149 25405 23463 49168 02048 31522
07698 24181 01161 01527 17046 31460 91507 16050
22921 25930 79579 43488 13211 71120 91715 49881
68127 00501 37484 99278 28751 80855 02035 10910
55309 10713 36439 65660 72554 77021 46279 22705
92034 90892 69853 06175 61221 76825 18239 47687
50612 84077 41387 54107 09190 74305 68196 75634
81415 98504 32168 17822 49946 37545 47201 85224
38461 44528 30953 08633 08049 68698 08759 45611
07556 24587 88753 71626 64864 54986 38964 83534
60557 50031 75829 05622 30237 77795 41870 26300
Table of random numbers
EPITABLE: random number listing
EPITABLE: random number listing
Systematic sampling
• N = 1200, and n = 60
 sampling fraction = 1200/60 = 20
• List persons from 1 to 1200
• Randomly select a number between 1 and
20 (ex : 8)
 1st person selected = the 8th on the
list
 2nd person = 8 + 20 = the 28th
etc .....
Systematic sampling
1 2 3 4 5 6 7 8 9 10 11 12 13 14
15
31 32 33 34 35 36 37 38 39 40 41 42 43 44 45
16 17 18 19 20 21 22 23 24 25 26 27 28 29 30
46 47 48 49 50 51 52 53 54 55 ……..
Systematic sampling
Stratified sampling
• Principle :
–Classify population into internally
homogeneous subgroups (strata)
–Draw sample in each strata
–Combine results of all strata
Stratified sampling
• Advantages
– More precise if variable associated with
strata
– All subgroups represented, allowing
separate conclusions about each of
them
• Disadvantages
– Sampling error difficult to measure
– Loss of precision if very small numbers
sampled in individual strata
Example: Stratified sampling
• Determine vaccination coverage in a
country
• One sample drawn in each region
• Estimates calculated for each stratum
• Each stratum weighted to obtain
estimate for country (average)
Multiple stage sampling
Principle
• = consecutive samplings
• example :
sampling unit = household
– 1rst stage : drawing areas or blocks
– 2nd stage : drawing buildings, houses
– 3rd stage : drawing households
Cluster sampling
• Principle
–Random sample of groups (“clusters”)
of units
–In selected clusters, all units or
proportion (sample) of units included
Example: Cluster sampling
Section 4
Section 5
Section 3
Section 2
Section 1
Cluster sampling
• Advantages
– Simple as complete list of sampling units
within population not required
– Less travel/resources required
• Disadvantages
– Imprecise if clusters homogeneous and
therefore sample variation greater than
population variation (large design effect)
– Sampling error difficult to measure
EPI cluster sampling
To evaluate vaccination coverage:
• Without list of persons
• Total population of villages
• Randomly choose 30 clusters
• 30 cluster of 7 children each= 210 children
Drawing the clusters
You need :
– Map of the region
– Distribution of population (by villages or area)
– Age distribution (population 12-23 m :3%)
1600
220
3200
400
800
200
1200
200
1600
400
53000
7300
106000
13000
26500
6600
40000
6600
53000
13200
A
B
C
D
E
F
G
H
I
J
12-23
Pop.
Village
Distribution of the clusters
A
B
C
D
E
F
G
H
I
J
1600
220
3200
400
800
200
1200
200
1600
400
1600
1820
5020
5420
6220
6420
7620
7820
9420
9820
Total population = 9820
Compute cumulated population
Distribution of the clusters
Then compute sampling fraction :
K= = 327
Draw a random number (between 1
and 327)
Example: 62
Start from the village including “62”
and draw the clusters adding the
sampling fraction
9820
30
A
B
C
D
E
F
G
H
I
J
1600
1820
5020
5420
6220
6420
7620
7820
9420
9820
I I I I
I
I I I I I I I I I I
I
I I
I
I I I I
I
I I I I I
I
Drawing households and children
On the spot
Go to the center of the village , choose direction
(random)
Number the houses in this direction
 Ex: 21
Draw random number (between 1 and 21) to
identify the first house to visit
From this house progress until finding the 7
children ( itinerary rules fixed beforehand)
EPITABLE: Calculating design effect
Selecting a sampling method
• Population to be studied
– Size/geographical distribution
– Heterogeneity with respect to variable
• Level of precision required
• Resources available
• Importance of having a precise estimate
of the sampling error
Steps in estimating sample size
• Identify major study variable
• Determine type of estimate (%, mean, ratio,...)
• Indicate expected frequency of factor of interest
• Decide on desired precision of the estimate
• Decide on acceptable risk that estimate will fall outside
its real population value
• Adjust for estimated design effect
• Adjust for expected response rate
• (Adjust for population size? In case of small size
population only)
Sample size formula in
descriptive survey
z: alpha risk express in z-score
p: expected prevalence
q: 1 - p
d: absolute precision
g: design effect
z² * p * q 1.96²*0.15*0.85
n = -------------- ---------------------- = 544
d² 0.03²
Cluster sampling
z² * p * q 2*1.96²*0.15*0.85
n = g* -------------- ------------------------ = 1088
d² 0.03²
Simple random / systematic sampling
EPITABLE: cluster sample size
calculation
Place of sampling
in descriptive surveys
• Define objectives
• Define resources available
• Identify study population
• Identify variables to study
• Define precision required
• Establish plan of analysis (questionnaire)
• Create sampling frame
• Select sample
• Pilot data collection
• Collect data
• Analyse data
• Communicate results
• Use results
Conclusions
• Probability samples are the best
• Beware of …
– refusals
– absentees
– “do not know”
Conclusions
• If in doubt…
Call a statistician !!!!

More Related Content

Similar to 12- Sampling.ppt

Chapter Seven - .pptbhhhdfhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhd
Chapter Seven - .pptbhhhdfhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhdChapter Seven - .pptbhhhdfhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhd
Chapter Seven - .pptbhhhdfhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhd
beshahashenafe20
 
Introduction to sampling
Introduction to samplingIntroduction to sampling
Introduction to sampling
Situo Liu
 
Research method ch06 sampling
Research method ch06 samplingResearch method ch06 sampling
Research method ch06 sampling
naranbatn
 
Population sampling RSS6 2014
Population sampling RSS6 2014Population sampling RSS6 2014
Population sampling RSS6 2014
RSS6
 
POPULATION AND SAMPLING.pptx
POPULATION AND SAMPLING.pptxPOPULATION AND SAMPLING.pptx
POPULATION AND SAMPLING.pptx
MartMantilla1
 
sampling and statiscal inference
sampling and statiscal inferencesampling and statiscal inference
sampling and statiscal inference
Shruti MISHRA
 
Sampling by Mr Peng Kungkea
Sampling  by Mr Peng KungkeaSampling  by Mr Peng Kungkea
Sampling by Mr Peng Kungkea
Kungkea Peng
 
5. sampling design
5. sampling design5. sampling design
5. sampling design
kbhupadhoj
 

Similar to 12- Sampling.ppt (20)

Chapter Seven - .pptbhhhdfhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhd
Chapter Seven - .pptbhhhdfhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhdChapter Seven - .pptbhhhdfhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhd
Chapter Seven - .pptbhhhdfhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhd
 
Sampling....
Sampling....Sampling....
Sampling....
 
Sampling techniques new
Sampling techniques newSampling techniques new
Sampling techniques new
 
Sampling techniques new
Sampling techniques newSampling techniques new
Sampling techniques new
 
Sampling methodologies in research mrhod
Sampling methodologies in research mrhodSampling methodologies in research mrhod
Sampling methodologies in research mrhod
 
4 Sampling Methods(1).pptx
4 Sampling Methods(1).pptx4 Sampling Methods(1).pptx
4 Sampling Methods(1).pptx
 
Sampling biostatistics.pptx
Sampling biostatistics.pptxSampling biostatistics.pptx
Sampling biostatistics.pptx
 
Introduction to sampling
Introduction to samplingIntroduction to sampling
Introduction to sampling
 
Research method ch06 sampling
Research method ch06 samplingResearch method ch06 sampling
Research method ch06 sampling
 
Population sampling RSS6 2014
Population sampling RSS6 2014Population sampling RSS6 2014
Population sampling RSS6 2014
 
Business research sampling
Business research samplingBusiness research sampling
Business research sampling
 
Sampling
SamplingSampling
Sampling
 
POPULATION AND SAMPLING.pptx
POPULATION AND SAMPLING.pptxPOPULATION AND SAMPLING.pptx
POPULATION AND SAMPLING.pptx
 
sampling and statiscal inference
sampling and statiscal inferencesampling and statiscal inference
sampling and statiscal inference
 
Methods of Sampling Techniques and Sample Size
Methods of Sampling Techniques and Sample SizeMethods of Sampling Techniques and Sample Size
Methods of Sampling Techniques and Sample Size
 
Sampling by Mr Peng Kungkea
Sampling  by Mr Peng KungkeaSampling  by Mr Peng Kungkea
Sampling by Mr Peng Kungkea
 
Sampling Theory
Sampling TheorySampling Theory
Sampling Theory
 
probability and non-probability samplings
probability and non-probability samplingsprobability and non-probability samplings
probability and non-probability samplings
 
5. sampling design
5. sampling design5. sampling design
5. sampling design
 
Sampling methods 16
Sampling methods   16Sampling methods   16
Sampling methods 16
 

More from Raj Vel

More from Raj Vel (10)

Air pollution_AAM.ppt
Air pollution_AAM.pptAir pollution_AAM.ppt
Air pollution_AAM.ppt
 
Museum 4.ppt
Museum 4.pptMuseum 4.ppt
Museum 4.ppt
 
Museum 3.ppt
Museum 3.pptMuseum 3.ppt
Museum 3.ppt
 
Museum 2.ppt
Museum 2.pptMuseum 2.ppt
Museum 2.ppt
 
Museum 1.ppt
Museum 1.pptMuseum 1.ppt
Museum 1.ppt
 
9 Solid waste and liquid waste disposal.ppt
9 Solid waste and liquid waste disposal.ppt9 Solid waste and liquid waste disposal.ppt
9 Solid waste and liquid waste disposal.ppt
 
BOC.pptx
BOC.pptxBOC.pptx
BOC.pptx
 
Anthropometry.pptx
Anthropometry.pptxAnthropometry.pptx
Anthropometry.pptx
 
Health seeking behaviour.pptx
Health seeking behaviour.pptxHealth seeking behaviour.pptx
Health seeking behaviour.pptx
 
1 Barriers of Communication.pptx
1 Barriers of Communication.pptx1 Barriers of Communication.pptx
1 Barriers of Communication.pptx
 

Recently uploaded

Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
Epec Engineered Technologies
 
Integrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - NeometrixIntegrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - Neometrix
Neometrix_Engineering_Pvt_Ltd
 
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
AldoGarca30
 

Recently uploaded (20)

Linux Systems Programming: Inter Process Communication (IPC) using Pipes
Linux Systems Programming: Inter Process Communication (IPC) using PipesLinux Systems Programming: Inter Process Communication (IPC) using Pipes
Linux Systems Programming: Inter Process Communication (IPC) using Pipes
 
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
 
A Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna MunicipalityA Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna Municipality
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
 
Integrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - NeometrixIntegrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - Neometrix
 
Orlando’s Arnold Palmer Hospital Layout Strategy-1.pptx
Orlando’s Arnold Palmer Hospital Layout Strategy-1.pptxOrlando’s Arnold Palmer Hospital Layout Strategy-1.pptx
Orlando’s Arnold Palmer Hospital Layout Strategy-1.pptx
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
 
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
Signal Processing and Linear System Analysis
Signal Processing and Linear System AnalysisSignal Processing and Linear System Analysis
Signal Processing and Linear System Analysis
 
Computer Graphics Introduction To Curves
Computer Graphics Introduction To CurvesComputer Graphics Introduction To Curves
Computer Graphics Introduction To Curves
 
Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS Lambda
 
DC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equationDC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equation
 
UNIT 4 PTRP final Convergence in probability.pptx
UNIT 4 PTRP final Convergence in probability.pptxUNIT 4 PTRP final Convergence in probability.pptx
UNIT 4 PTRP final Convergence in probability.pptx
 
Introduction to Data Visualization,Matplotlib.pdf
Introduction to Data Visualization,Matplotlib.pdfIntroduction to Data Visualization,Matplotlib.pdf
Introduction to Data Visualization,Matplotlib.pdf
 
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
 
Ghuma $ Russian Call Girls Ahmedabad ₹7.5k Pick Up & Drop With Cash Payment 8...
Ghuma $ Russian Call Girls Ahmedabad ₹7.5k Pick Up & Drop With Cash Payment 8...Ghuma $ Russian Call Girls Ahmedabad ₹7.5k Pick Up & Drop With Cash Payment 8...
Ghuma $ Russian Call Girls Ahmedabad ₹7.5k Pick Up & Drop With Cash Payment 8...
 
fitting shop and tools used in fitting shop .ppt
fitting shop and tools used in fitting shop .pptfitting shop and tools used in fitting shop .ppt
fitting shop and tools used in fitting shop .ppt
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the start
 

12- Sampling.ppt

  • 1. Sampling Sources: -EPIET Introductory course, Thomas Grein, Denis Coulombier, Philippe Sudre, Mike Catchpole -IDEA Brigitte Helynck, Philippe Malfait, Institut de veille sanitaire Modified: Denise Antona, EPIET 2003
  • 2. Objectives of presentation • Definition of sampling • Why do we use samples? • Concept of representativeness • Main methods of sampling • Sampling error • Sample size calculation
  • 3. Definition of sampling Procedure by which some members of a given population are selected as representatives of the entire population
  • 4. Definition of sampling terms • Sampling unit – Subject under observation on which information is collected • Sampling fraction – Ratio between the sample size and the population size • Sampling frame – Any list of all the sampling units in the population • Sampling scheme – Method of selecting sampling units from sampling frame
  • 5. Why do we use samples ? Get information from large populations – At minimal cost – At maximum speed – At increased accuracy – Using enhanced tools
  • 7. What we need to know • Concepts – Representativeness – Sampling methods – Choice of the right design • Calculations – Sampling error – Design effect – Sample size
  • 8. Sampling and representativeness Sample Target Population Sampling Population Target Population  Sampling Population  Sample
  • 9. Representativeness • Person • Demographic characteristics (age, sex…) • Exposure/susceptibility • Place (Example : urban vs. rural) • Time • Seasonality • Day of the week • Time of the day Ensure representativeness before starting, confirm once completed !!!!!!
  • 10. Types of samples • Non-probability samples • Probability samples
  • 11. Non probability samples • Quotas • Sample reflects population structure • Time/resources constraints • Convenience samples (purposive units) • Biased • Best or worst scenario Probability of being chosen : unknown
  • 12. Probability samples • Random sampling • Each subject has a known probability of being chosen • Reduces possibility of selection bias • Allows application of statistical theory to results
  • 13. Sampling error • No sample is the exact mirror image of the population • Magnitude of error can be measured in probability samples • Expressed by standard error – of mean, proportion, differences, etc • Function of – amount of variability in measuring factor of interest – sample size
  • 14. Methods used in probability samples • Simple random sampling • Systematic sampling • Stratified sampling • Multistage sampling • Cluster sampling
  • 15. Quality of an estimate Precision & validity No precision Random error ! Precision but no validity Systematic error (Bias) !
  • 16. Simple random sampling • Principle –Equal chance of drawing each unit • Procedure –Number all units –Randomly draw units
  • 17. Simple random sampling • Advantages –Simple –Sampling error easily measured • Disadvantages –Need complete list of units –Does not always achieve best representativeness –Units may be scattered
  • 18. Example: evaluate the prevalence of tooth decay among the 1200 children attending a school • List of children attending the school • Children numerated from 1 to 1200 • Sample size = 100 children • Random sampling of 100 numbers between 1 and 1200 How to randomly select? Simple random sampling
  • 20. 57172 42088 70098 11333 26902 29959 43909 49607 33883 87680 28923 15659 09839 45817 89405 70743 77950 67344 10609 87119 15859 74577 42791 75889 11607 11596 01796 24498 17009 67119 00614 49529 56149 55678 38169 47228 49931 94303 67448 31286 80719 65101 77729 83949 83358 75230 56624 27549 93809 19505 82000 79068 45552 86776 48980 56684 40950 86216 48161 17646 24164 35513 94057 51834 12182 59744 65695 83710 41125 14291 74773 66391 13382 48076 73151 48724 35670 38453 63154 58116 38629 94576 48859 75654 17152 66516 78796 73099 60728 32063 12431 23898 23683 10853 04038 75246 01881 99056 46747 08846 01331 88163 74462 14551 23094 29831 95387 23917 07421 97869 88092 72201 15243 21100 48125 05243 16181 39641 36970 99522 53501 58431 68149 25405 23463 49168 02048 31522 07698 24181 01161 01527 17046 31460 91507 16050 22921 25930 79579 43488 13211 71120 91715 49881 68127 00501 37484 99278 28751 80855 02035 10910 55309 10713 36439 65660 72554 77021 46279 22705 92034 90892 69853 06175 61221 76825 18239 47687 50612 84077 41387 54107 09190 74305 68196 75634 81415 98504 32168 17822 49946 37545 47201 85224 38461 44528 30953 08633 08049 68698 08759 45611 07556 24587 88753 71626 64864 54986 38964 83534 60557 50031 75829 05622 30237 77795 41870 26300 Table of random numbers
  • 23. Systematic sampling • N = 1200, and n = 60  sampling fraction = 1200/60 = 20 • List persons from 1 to 1200 • Randomly select a number between 1 and 20 (ex : 8)  1st person selected = the 8th on the list  2nd person = 8 + 20 = the 28th etc .....
  • 25. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 46 47 48 49 50 51 52 53 54 55 ……..
  • 27. Stratified sampling • Principle : –Classify population into internally homogeneous subgroups (strata) –Draw sample in each strata –Combine results of all strata
  • 28. Stratified sampling • Advantages – More precise if variable associated with strata – All subgroups represented, allowing separate conclusions about each of them • Disadvantages – Sampling error difficult to measure – Loss of precision if very small numbers sampled in individual strata
  • 29. Example: Stratified sampling • Determine vaccination coverage in a country • One sample drawn in each region • Estimates calculated for each stratum • Each stratum weighted to obtain estimate for country (average)
  • 30. Multiple stage sampling Principle • = consecutive samplings • example : sampling unit = household – 1rst stage : drawing areas or blocks – 2nd stage : drawing buildings, houses – 3rd stage : drawing households
  • 31. Cluster sampling • Principle –Random sample of groups (“clusters”) of units –In selected clusters, all units or proportion (sample) of units included
  • 32. Example: Cluster sampling Section 4 Section 5 Section 3 Section 2 Section 1
  • 33. Cluster sampling • Advantages – Simple as complete list of sampling units within population not required – Less travel/resources required • Disadvantages – Imprecise if clusters homogeneous and therefore sample variation greater than population variation (large design effect) – Sampling error difficult to measure
  • 34. EPI cluster sampling To evaluate vaccination coverage: • Without list of persons • Total population of villages • Randomly choose 30 clusters • 30 cluster of 7 children each= 210 children
  • 35. Drawing the clusters You need : – Map of the region – Distribution of population (by villages or area) – Age distribution (population 12-23 m :3%) 1600 220 3200 400 800 200 1200 200 1600 400 53000 7300 106000 13000 26500 6600 40000 6600 53000 13200 A B C D E F G H I J 12-23 Pop. Village
  • 36. Distribution of the clusters A B C D E F G H I J 1600 220 3200 400 800 200 1200 200 1600 400 1600 1820 5020 5420 6220 6420 7620 7820 9420 9820 Total population = 9820 Compute cumulated population
  • 37. Distribution of the clusters Then compute sampling fraction : K= = 327 Draw a random number (between 1 and 327) Example: 62 Start from the village including “62” and draw the clusters adding the sampling fraction 9820 30 A B C D E F G H I J 1600 1820 5020 5420 6220 6420 7620 7820 9420 9820 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I
  • 38. Drawing households and children On the spot Go to the center of the village , choose direction (random) Number the houses in this direction  Ex: 21 Draw random number (between 1 and 21) to identify the first house to visit From this house progress until finding the 7 children ( itinerary rules fixed beforehand)
  • 40. Selecting a sampling method • Population to be studied – Size/geographical distribution – Heterogeneity with respect to variable • Level of precision required • Resources available • Importance of having a precise estimate of the sampling error
  • 41. Steps in estimating sample size • Identify major study variable • Determine type of estimate (%, mean, ratio,...) • Indicate expected frequency of factor of interest • Decide on desired precision of the estimate • Decide on acceptable risk that estimate will fall outside its real population value • Adjust for estimated design effect • Adjust for expected response rate • (Adjust for population size? In case of small size population only)
  • 42. Sample size formula in descriptive survey z: alpha risk express in z-score p: expected prevalence q: 1 - p d: absolute precision g: design effect z² * p * q 1.96²*0.15*0.85 n = -------------- ---------------------- = 544 d² 0.03² Cluster sampling z² * p * q 2*1.96²*0.15*0.85 n = g* -------------- ------------------------ = 1088 d² 0.03² Simple random / systematic sampling
  • 43. EPITABLE: cluster sample size calculation
  • 44. Place of sampling in descriptive surveys • Define objectives • Define resources available • Identify study population • Identify variables to study • Define precision required • Establish plan of analysis (questionnaire) • Create sampling frame • Select sample • Pilot data collection • Collect data • Analyse data • Communicate results • Use results
  • 45. Conclusions • Probability samples are the best • Beware of … – refusals – absentees – “do not know”
  • 46. Conclusions • If in doubt… Call a statistician !!!!