SlideShare a Scribd company logo
1 of 47
Definition of sampling
 Procedure by which some members of the population are
selected as representatives of the entire population
Study population
 The study population is the population to which the results
of the study will be inferred
The study population depends upon the research question:
How many injections do people received each year in India?
 Study population: Population of India
How many needle-sticks health care workers experience each
year in India?
 Study population: Health care workers of India
Sample
The sample needs to be representative of the population in
terms of
1. Time:
Seasonality
Day of the week
Time of the day
2. Place:
Urban
Rural
3. Persons
Age
Sex
Other demographic characteristics
Definition of sampling terms
 Sampling unit (Basic sampling unit, BSU)
 Elementary unit that will be sampled
 People
 Health care workers
 Hospitals
 Sampling frame
 List of all sampling units in the population
 Sampling scheme
 Method used to select sampling units from the sampling
frame
Sampling
Why do we sample populations?
• Obtain information from large populations
• Ensure the efficiency of a study
• Obtain more accurate information
Example
 The Ministry of Health of a country X wants to estimate the
proportion of children in elementary schools who have
been immunized against childhood infectious diseases
 The task must be completed in one month
 The objective is to estimate the proportion of immunized
children
Type of samples
 Probability sampling
 Random sampling
 Removes the possibility of bias in selection of subjects
 Ensures that each subject has a known probability of being chosen
 Allows to draw valid conclusions about population
 Non-probability samples
 Probability of being selected is unknown
 Convenience sampling
 Subjective sampling
 Time/resources constraints
Sampling error
 No sample is a perfect mirror image of the population
 Magnitude of error can be measured in probability samples
 Expressed by standard error of mean, proportion,
differences…
 Function of:
Sample size
Variability in measurement
Simple
Random
Systematic
Random
Stratified Random
Cluster Sampling
Probability
Probability Sampling
1. Simple random sampling
 Principle
 Equal chance for each sampling unit
 Procedure
 Number all units
 Randomly draw units
 Advantages
 Simple
 Sampling error easily measured
 Disadvantages
 Need complete list of units
 Does not always achieve best representation
2. Systematic sampling
 Principle
 A unit drawn every k units
 Equal chance of being drawn for each unit
 Procedure
 Calculate sampling interval (k = N/n)
 Draw a random number (≤ k) for starting
 Draw every k units from first unit
 Advantages
 Ensures representativity across list
 Easy to implement
 Disadvantage
 Dangerous if list has cycles
Example of systematic sampling
Every eighth house is selected
3. Stratified sampling
 Principle
 Classify population into homogeneous subgroups (strata)
 Draw sample in each strata
 Combine results of all strata
 Advantage
 More precise if variable associated with strata
 All subgroups represented, allowing separate conclusions
about each of them
 Disadvantages
 Sampling error difficult to measure
 Loss of precision if small numbers sampled in individual strata
Example of stratified sampling
 Estimate vaccination coverage in a country
 One sample drawn in each region
 Estimates calculated for each stratum
 Each strata weighted to obtain estimate for country
4. Cluster sampling
 Principle
 Random sample of groups (“clusters”) of units
 All or proportion of units included selected clusters
 Advantages
 Simple: No list of units required
 Less travel/resources required
 Disadvantages
 Imprecise if clusters homogeneous (Large design effect)
 Sampling error difficult to measure
Cluster sampling
 The sampling unit is not a subject, but a group (cluster) of
subjects.
 It is assumed that:
 The variability among clusters is minimal
 The variability within each cluster is what is observed in the
general population
The two stages of a cluster sample
1.First stage: Probability proportional to size
 Select the number of clusters to be included
 Compute a cumulative list of the populations in each unit with a
grand total
 Divide the grand total by the number of clusters and obtain the
sampling interval
 Choose a random number and identify the first cluster
 Add the sampling interval and identify the second cluster
 By repeating the same procedure, identify all the clusters
2.Second stage
 In each cluster select a random sample using a sampling frame of
subjects (e.g. residents) or households
• In cluster sampling, the clusters are the primary sampling
unit (PSU’s) .
• Units within the clusters are the secondary sampling units
(SSU’s)
CLUSTER SAMPLING
Non- probability
Convenience
Quota
Purposive/
Judgment
Snowball
/Referral
NONPROBABILITY
SAMPLING (NPS)
CONVENIENCE SAMPLING
Class of 100 students
20 Students selected
as per convenience
Merit – useful in pilot studies.
Demerit – results usually
biased and unsatisfactory.
Quota
Formation
Interview 500
people
Personal
judgement
Radio listening
survey
60%
housewives
25%
farmers
15%
children
under age 15
300
125
75
500 people
QUOTA SAMPLING
Merit- Used in public
opinion studies
Demerit – personal
prejudice and bias
JUDGMENT SAMPLING
 Judgment/Purposive/Deliberate sampling.
 Depends exclusively on the judgment of investigator.
 Sample selected which investigator thinks to be most typical of
the universe.
 Merits
 Small no. of sampling units
 Urgent public policy & business decisions
 Demerits
*Personal prejudice & bias
*No objective way of evaluating reliability of results
JUDGMENT SAMPLING - EXAMPLE
CLASS OF 20
STUDENTS
Sample size for a
study= 8
JUDGMENT
SAMPLE OF 8
STUDENTS
SNOWBALL SAMPLING
 Used when the desired sample characteristic is rare
 When locating respondents is extremely difficult or cost
prohibitive.
 Snowball sampling relies on referrals from initial subjects to
generate additional subjects
 Merit
 For difficult to reach populations (other methods may not yield
any results).
 Demerit
 not representative of the population and will result in a biased
sample as it is self-selecting.
SNOWBALL SAMPLING - STEPS
 Make contact with one or two cases in the population.
 Ask these cases to identify further cases.
 Ask these new cases to identify further new cases.
 Stop when either no new cases are given or the sample is as large
as is manageable.
Key issues
 We cannot study the whole population so we sample it
 Taking a sample leads to sampling error, which is measurable
 Good design and quality assurance ensure validity and while
appropriate sample size will ensure precision
 Probability samples are the only one that allow use of statistics as
we know them
Types of Data
 Qualitative
 Nominal
Eg. Color of Eyes
 Ordinal
Eg. Stages of disease condition
 Quantitative
 Discrete
Eg. Family size
 Continuous
Eg. Height / Weight
Describe –Central Value
 Data is not information.
 Summarize
 Average
 Mean
 Median
 Mode
Arithmetic Mean (AM)
 Most commonly used; Simply called MEAN
 Add all the observed values(Sum=∑Xi)
 Mean =Sum / n
 Sample Mean is denoted by x̅
 Population Mean is denoted by μ
Example
 Age of 10 Pregnant women
 26, 31, 25, 21, 26, 26, 27, 25, 27, and 26
 Sum= (26+31+25+21+26+26+27+25+27+26) = 260
 n = 10
 Mean = sum / n= 260/10 = 26 years
Median
 The Median describes literally the middle value of the
distribution
 Divides the distribution exactly into two halves (i.e. 50% of
the data will fall on either side)
 Useful when there are extreme values
Example
 Duration (days) of hospital stay of 11 patients
 1, 2, 3, 4, 5, 6, 7, 8, 8, 9, 77 (Arranged in ascending order)
 Median is the middle value (6thvalue) = 6
 (Mean = 11.8)
 If n is even; then take average of middle two values.
Mode
 The Mode is the value that occurs most frequently
 Mode is the only location statistics to be used –for nominal
data -not measurable characteristic
 Epidemiology –Describing an epidemic with respect to
TIME
Example
 •Colour preference of people for their car
 Mode = Yellow
Colour preference No. of persons
Green 354
Yellow 852
White 310
Red 474
Describe –Dispersion
 Is it enough to know the average?
 Example of swimming pool.
 •Measures of variability
 Range
 Inter-quartile range
 Mean deviation from mean
 Variance / Standard deviation
RANGE
Definition:
 The difference between the Minimum and the Maximum value of
the observations
Advantage:
 A quick and easy indicator of dispersion.
Disadvantage:
 Influenced by extreme values; Uses only two data points
INTER-QUARTILE RANGE
Definition:
 Defined as the interval between the value of the upper quartile
(Q3) and the lower quartile (Q1)
 Inter Quartile Range = Q3–Q1
Advantage:
 Unaffected by the extreme values
Disadvantage:
 Covers only the middle 50% observations
INTER QUARTILE RANGE
 The range of a data is divided into four (25%)
MEAN DEVIATION
 Definition: The mean deviation is the average of the absolute
(ignoring the sign) deviations of the observations from the
arithmetic mean.
 Advantage: It is based on all the observations in the group. It is
easy to grasp the meaning of the procedure.
 Disadvantage: It ignores the sign of the difference of the value of
the observation and arithmetic mean.
 It is not widely used because of the availability of a more
advantageous measure.
STANDARD DEVIATION -SD (σ)
 Definition: The SD is the square root of the average of the squared
deviations of the observations from the arithmetic mean
 The square of the SD is called variance
 Advantage: The SD is the most important measure of distribution.
While the variance is in unit squared, the SD is expressed in the same
units of measurement as the observation. It is suitable for further
analysis
 The SD together with arithmetic mean is useful for description of the
data
Normal Distribution Curve
Coefficient of Variation (CV)
 Purpose: To compare the relative variability in different groups
 Definition: The coefficient of variation is the SD expressed as a
percentage of the arithmetic mean (AM).

 CV = SD x 100
AM
Summary
 Choose appropriate central / dispersion value
 Mean / SD –if no extreme values
 Median / IQR –if there are extreme values
 Mode /Range –for qualitative variables/ time distribution
in epidemic curve
 Mean and SD are used the most.
2.7.21 sampling methods data analysis

More Related Content

What's hot

Probabilistic & Non-Probability (Purposeful) Sampling
Probabilistic & Non-Probability (Purposeful) Sampling Probabilistic & Non-Probability (Purposeful) Sampling
Probabilistic & Non-Probability (Purposeful) Sampling Norhidayah Badrul Hisham
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statisticsBurak Mızrak
 
Proportions and Confidence Intervals in Biostatistics
Proportions and Confidence Intervals in BiostatisticsProportions and Confidence Intervals in Biostatistics
Proportions and Confidence Intervals in BiostatisticsHelpWithAssignment.com
 
Qualitative vs quantitative data - infographic
Qualitative vs quantitative data - infographicQualitative vs quantitative data - infographic
Qualitative vs quantitative data - infographicIntellspot
 
2.1 frequency distributions, histograms, and related topics
2.1 frequency distributions, histograms, and related topics2.1 frequency distributions, histograms, and related topics
2.1 frequency distributions, histograms, and related topicsleblance
 
Biostatistics Workshop: Missing Data
Biostatistics Workshop: Missing DataBiostatistics Workshop: Missing Data
Biostatistics Workshop: Missing DataHopkinsCFAR
 
Sampling techniques
Sampling techniquesSampling techniques
Sampling techniqueschetan1923
 
Sampling Design
Sampling DesignSampling Design
Sampling DesignJale Nonan
 
Stratified random sampling
Stratified random samplingStratified random sampling
Stratified random samplingwaiton sherekete
 
Sampling techniques
Sampling techniquesSampling techniques
Sampling techniquesMunibaMughal
 
Data collection methods
Data collection methodsData collection methods
Data collection methodsdramitmv14
 

What's hot (20)

Sec 3.1 measures of center
Sec 3.1 measures of center  Sec 3.1 measures of center
Sec 3.1 measures of center
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Probabilistic & Non-Probability (Purposeful) Sampling
Probabilistic & Non-Probability (Purposeful) Sampling Probabilistic & Non-Probability (Purposeful) Sampling
Probabilistic & Non-Probability (Purposeful) Sampling
 
Sec 1.3 collecting sample data
Sec 1.3 collecting sample data  Sec 1.3 collecting sample data
Sec 1.3 collecting sample data
 
Descriptive statistics -review(2)
Descriptive statistics -review(2)Descriptive statistics -review(2)
Descriptive statistics -review(2)
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Proportions and Confidence Intervals in Biostatistics
Proportions and Confidence Intervals in BiostatisticsProportions and Confidence Intervals in Biostatistics
Proportions and Confidence Intervals in Biostatistics
 
I. lecture sampling
I. lecture samplingI. lecture sampling
I. lecture sampling
 
Qualitative vs quantitative data - infographic
Qualitative vs quantitative data - infographicQualitative vs quantitative data - infographic
Qualitative vs quantitative data - infographic
 
2.1 frequency distributions, histograms, and related topics
2.1 frequency distributions, histograms, and related topics2.1 frequency distributions, histograms, and related topics
2.1 frequency distributions, histograms, and related topics
 
Sampling techniques
Sampling techniquesSampling techniques
Sampling techniques
 
Biostatistics Workshop: Missing Data
Biostatistics Workshop: Missing DataBiostatistics Workshop: Missing Data
Biostatistics Workshop: Missing Data
 
Sampling techniques
Sampling techniquesSampling techniques
Sampling techniques
 
Sampling Design
Sampling DesignSampling Design
Sampling Design
 
Stratified random sampling
Stratified random samplingStratified random sampling
Stratified random sampling
 
Histograms
HistogramsHistograms
Histograms
 
lfstat3e_ppt_01_rev.ppt
lfstat3e_ppt_01_rev.pptlfstat3e_ppt_01_rev.ppt
lfstat3e_ppt_01_rev.ppt
 
Sampling techniques
Sampling techniquesSampling techniques
Sampling techniques
 
Sampling Technique
Sampling TechniqueSampling Technique
Sampling Technique
 
Data collection methods
Data collection methodsData collection methods
Data collection methods
 

Similar to 2.7.21 sampling methods data analysis

Sampling and its variability
Sampling  and its variabilitySampling  and its variability
Sampling and its variabilityDrBhushan Kamble
 
Sampling methods 16
Sampling methods   16Sampling methods   16
Sampling methods 16Raj Selvam
 
Sampling techniques new
Sampling techniques newSampling techniques new
Sampling techniques newGeeta80373
 
Sampling techniques new
Sampling techniques newSampling techniques new
Sampling techniques newbabita jangra
 
Sampling
Sampling Sampling
Sampling ANCYBS
 
Session 5_Sampling strategy_Intake Dr Emmanuel.pdf
Session 5_Sampling strategy_Intake Dr Emmanuel.pdfSession 5_Sampling strategy_Intake Dr Emmanuel.pdf
Session 5_Sampling strategy_Intake Dr Emmanuel.pdfmuhirwaSamuel
 
probability and non-probability samplings
probability and non-probability samplingsprobability and non-probability samplings
probability and non-probability samplingsn1a2g3a4j5a6i7
 
sampling and statiscal inference
sampling and statiscal inferencesampling and statiscal inference
sampling and statiscal inferenceShruti MISHRA
 
Sampling techniques.pptx
Sampling techniques.pptxSampling techniques.pptx
Sampling techniques.pptxprajjwalgahlot
 
POPULATION AND SAMPLING.pptx
POPULATION AND SAMPLING.pptxPOPULATION AND SAMPLING.pptx
POPULATION AND SAMPLING.pptxMartMantilla1
 

Similar to 2.7.21 sampling methods data analysis (20)

Sampling techniques
Sampling techniquesSampling techniques
Sampling techniques
 
Sampling and its variability
Sampling  and its variabilitySampling  and its variability
Sampling and its variability
 
Sampling methods 16
Sampling methods   16Sampling methods   16
Sampling methods 16
 
Sampling techniques new
Sampling techniques newSampling techniques new
Sampling techniques new
 
Sampling techniques new
Sampling techniques newSampling techniques new
Sampling techniques new
 
Sampling design
Sampling designSampling design
Sampling design
 
Biostats in ortho
Biostats in orthoBiostats in ortho
Biostats in ortho
 
sampling.pptx
sampling.pptxsampling.pptx
sampling.pptx
 
Sampling
Sampling Sampling
Sampling
 
Session 5_Sampling strategy_Intake Dr Emmanuel.pdf
Session 5_Sampling strategy_Intake Dr Emmanuel.pdfSession 5_Sampling strategy_Intake Dr Emmanuel.pdf
Session 5_Sampling strategy_Intake Dr Emmanuel.pdf
 
sampling methods
sampling methodssampling methods
sampling methods
 
probability and non-probability samplings
probability and non-probability samplingsprobability and non-probability samplings
probability and non-probability samplings
 
Sample and sample size
Sample and sample sizeSample and sample size
Sample and sample size
 
sampling and statiscal inference
sampling and statiscal inferencesampling and statiscal inference
sampling and statiscal inference
 
Sampling techniques.pptx
Sampling techniques.pptxSampling techniques.pptx
Sampling techniques.pptx
 
Sampling techniques
Sampling techniquesSampling techniques
Sampling techniques
 
Sampling Techniques
Sampling TechniquesSampling Techniques
Sampling Techniques
 
Unit 3 Sampling
Unit 3 SamplingUnit 3 Sampling
Unit 3 Sampling
 
Basic statistics
Basic statisticsBasic statistics
Basic statistics
 
POPULATION AND SAMPLING.pptx
POPULATION AND SAMPLING.pptxPOPULATION AND SAMPLING.pptx
POPULATION AND SAMPLING.pptx
 

More from Ashish965416

vaginal hysterectomy gyane .pdf
vaginal hysterectomy gyane .pdfvaginal hysterectomy gyane .pdf
vaginal hysterectomy gyane .pdfAshish965416
 
proptosis case opthalmology.pdf
proptosis case opthalmology.pdfproptosis case opthalmology.pdf
proptosis case opthalmology.pdfAshish965416
 
rheumatology 7.pdf
rheumatology 7.pdfrheumatology 7.pdf
rheumatology 7.pdfAshish965416
 
rhematology 6- rheumatoid arthritis .pdf
rhematology 6- rheumatoid arthritis
.pdfrhematology 6- rheumatoid arthritis
.pdf
rhematology 6- rheumatoid arthritis .pdfAshish965416
 
fracture complications 2 .pdf
fracture complications 2 .pdffracture complications 2 .pdf
fracture complications 2 .pdfAshish965416
 
rheumatology 5 -inflammatory myopathies.pdf
 rheumatology 5 -inflammatory myopathies.pdf rheumatology 5 -inflammatory myopathies.pdf
rheumatology 5 -inflammatory myopathies.pdfAshish965416
 
Scleroderma rheumatology 4.pdf
Scleroderma rheumatology 4.pdfScleroderma rheumatology 4.pdf
Scleroderma rheumatology 4.pdfAshish965416
 
complications of fracture.pdf
complications of fracture.pdfcomplications of fracture.pdf
complications of fracture.pdfAshish965416
 
Development paediatric mbbs
Development paediatric mbbsDevelopment paediatric mbbs
Development paediatric mbbsAshish965416
 
Preoperative assessment
Preoperative assessmentPreoperative assessment
Preoperative assessmentAshish965416
 
Oral cancer pathology
Oral cancer pathologyOral cancer pathology
Oral cancer pathologyAshish965416
 
Lymphoma and splenomegaly pathology
Lymphoma and splenomegaly pathologyLymphoma and splenomegaly pathology
Lymphoma and splenomegaly pathologyAshish965416
 
Dic and vitamin k deficiency pathology
Dic and vitamin k deficiency pathologyDic and vitamin k deficiency pathology
Dic and vitamin k deficiency pathologyAshish965416
 
Beta blockers pharmacology
Beta blockers pharmacologyBeta blockers pharmacology
Beta blockers pharmacologyAshish965416
 

More from Ashish965416 (20)

vaginal hysterectomy gyane .pdf
vaginal hysterectomy gyane .pdfvaginal hysterectomy gyane .pdf
vaginal hysterectomy gyane .pdf
 
DnC.pptx
DnC.pptxDnC.pptx
DnC.pptx
 
proptosis case opthalmology.pdf
proptosis case opthalmology.pdfproptosis case opthalmology.pdf
proptosis case opthalmology.pdf
 
rheumatology 7.pdf
rheumatology 7.pdfrheumatology 7.pdf
rheumatology 7.pdf
 
rhematology 6- rheumatoid arthritis .pdf
rhematology 6- rheumatoid arthritis
.pdfrhematology 6- rheumatoid arthritis
.pdf
rhematology 6- rheumatoid arthritis .pdf
 
fracture complications 2 .pdf
fracture complications 2 .pdffracture complications 2 .pdf
fracture complications 2 .pdf
 
rheumatology 5 -inflammatory myopathies.pdf
 rheumatology 5 -inflammatory myopathies.pdf rheumatology 5 -inflammatory myopathies.pdf
rheumatology 5 -inflammatory myopathies.pdf
 
Scleroderma rheumatology 4.pdf
Scleroderma rheumatology 4.pdfScleroderma rheumatology 4.pdf
Scleroderma rheumatology 4.pdf
 
complications of fracture.pdf
complications of fracture.pdfcomplications of fracture.pdf
complications of fracture.pdf
 
SLE rheum 3.pdf
SLE rheum 3.pdfSLE rheum 3.pdf
SLE rheum 3.pdf
 
Rheumatology 2
Rheumatology 2Rheumatology 2
Rheumatology 2
 
Rhematology 1
Rhematology 1Rhematology 1
Rhematology 1
 
Development paediatric mbbs
Development paediatric mbbsDevelopment paediatric mbbs
Development paediatric mbbs
 
Preoperative assessment
Preoperative assessmentPreoperative assessment
Preoperative assessment
 
Git 2 pathology
Git 2 pathologyGit 2 pathology
Git 2 pathology
 
Oral cancer pathology
Oral cancer pathologyOral cancer pathology
Oral cancer pathology
 
Lymphoma and splenomegaly pathology
Lymphoma and splenomegaly pathologyLymphoma and splenomegaly pathology
Lymphoma and splenomegaly pathology
 
Dic and vitamin k deficiency pathology
Dic and vitamin k deficiency pathologyDic and vitamin k deficiency pathology
Dic and vitamin k deficiency pathology
 
Beta blockers pharmacology
Beta blockers pharmacologyBeta blockers pharmacology
Beta blockers pharmacology
 
Culture media
Culture mediaCulture media
Culture media
 

Recently uploaded

AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.arsicmarija21
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...Nguyen Thanh Tu Collection
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxDr.Ibrahim Hassaan
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Mark Reed
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfphamnguyenenglishnb
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptxSherlyMaeNeri
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfUjwalaBharambe
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfSpandanaRallapalli
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Types of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxTypes of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxEyham Joco
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 

Recently uploaded (20)

AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptx
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptx
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdf
 
Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Types of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxTypes of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptx
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 

2.7.21 sampling methods data analysis

  • 1.
  • 2.
  • 3. Definition of sampling  Procedure by which some members of the population are selected as representatives of the entire population
  • 4. Study population  The study population is the population to which the results of the study will be inferred The study population depends upon the research question: How many injections do people received each year in India?  Study population: Population of India How many needle-sticks health care workers experience each year in India?  Study population: Health care workers of India
  • 5. Sample The sample needs to be representative of the population in terms of 1. Time: Seasonality Day of the week Time of the day 2. Place: Urban Rural 3. Persons Age Sex Other demographic characteristics
  • 6. Definition of sampling terms  Sampling unit (Basic sampling unit, BSU)  Elementary unit that will be sampled  People  Health care workers  Hospitals  Sampling frame  List of all sampling units in the population  Sampling scheme  Method used to select sampling units from the sampling frame
  • 7. Sampling Why do we sample populations? • Obtain information from large populations • Ensure the efficiency of a study • Obtain more accurate information
  • 8. Example  The Ministry of Health of a country X wants to estimate the proportion of children in elementary schools who have been immunized against childhood infectious diseases  The task must be completed in one month  The objective is to estimate the proportion of immunized children
  • 9. Type of samples  Probability sampling  Random sampling  Removes the possibility of bias in selection of subjects  Ensures that each subject has a known probability of being chosen  Allows to draw valid conclusions about population  Non-probability samples  Probability of being selected is unknown  Convenience sampling  Subjective sampling  Time/resources constraints
  • 10. Sampling error  No sample is a perfect mirror image of the population  Magnitude of error can be measured in probability samples  Expressed by standard error of mean, proportion, differences…  Function of: Sample size Variability in measurement
  • 12. 1. Simple random sampling  Principle  Equal chance for each sampling unit  Procedure  Number all units  Randomly draw units  Advantages  Simple  Sampling error easily measured  Disadvantages  Need complete list of units  Does not always achieve best representation
  • 13. 2. Systematic sampling  Principle  A unit drawn every k units  Equal chance of being drawn for each unit  Procedure  Calculate sampling interval (k = N/n)  Draw a random number (≤ k) for starting  Draw every k units from first unit  Advantages  Ensures representativity across list  Easy to implement  Disadvantage  Dangerous if list has cycles
  • 14. Example of systematic sampling Every eighth house is selected
  • 15. 3. Stratified sampling  Principle  Classify population into homogeneous subgroups (strata)  Draw sample in each strata  Combine results of all strata  Advantage  More precise if variable associated with strata  All subgroups represented, allowing separate conclusions about each of them  Disadvantages  Sampling error difficult to measure  Loss of precision if small numbers sampled in individual strata
  • 16. Example of stratified sampling  Estimate vaccination coverage in a country  One sample drawn in each region  Estimates calculated for each stratum  Each strata weighted to obtain estimate for country
  • 17. 4. Cluster sampling  Principle  Random sample of groups (“clusters”) of units  All or proportion of units included selected clusters  Advantages  Simple: No list of units required  Less travel/resources required  Disadvantages  Imprecise if clusters homogeneous (Large design effect)  Sampling error difficult to measure
  • 18. Cluster sampling  The sampling unit is not a subject, but a group (cluster) of subjects.  It is assumed that:  The variability among clusters is minimal  The variability within each cluster is what is observed in the general population
  • 19. The two stages of a cluster sample 1.First stage: Probability proportional to size  Select the number of clusters to be included  Compute a cumulative list of the populations in each unit with a grand total  Divide the grand total by the number of clusters and obtain the sampling interval  Choose a random number and identify the first cluster  Add the sampling interval and identify the second cluster  By repeating the same procedure, identify all the clusters 2.Second stage  In each cluster select a random sample using a sampling frame of subjects (e.g. residents) or households
  • 20. • In cluster sampling, the clusters are the primary sampling unit (PSU’s) . • Units within the clusters are the secondary sampling units (SSU’s) CLUSTER SAMPLING
  • 22. CONVENIENCE SAMPLING Class of 100 students 20 Students selected as per convenience Merit – useful in pilot studies. Demerit – results usually biased and unsatisfactory.
  • 23. Quota Formation Interview 500 people Personal judgement Radio listening survey 60% housewives 25% farmers 15% children under age 15 300 125 75 500 people QUOTA SAMPLING Merit- Used in public opinion studies Demerit – personal prejudice and bias
  • 24. JUDGMENT SAMPLING  Judgment/Purposive/Deliberate sampling.  Depends exclusively on the judgment of investigator.  Sample selected which investigator thinks to be most typical of the universe.  Merits  Small no. of sampling units  Urgent public policy & business decisions  Demerits *Personal prejudice & bias *No objective way of evaluating reliability of results
  • 25. JUDGMENT SAMPLING - EXAMPLE CLASS OF 20 STUDENTS Sample size for a study= 8 JUDGMENT SAMPLE OF 8 STUDENTS
  • 26. SNOWBALL SAMPLING  Used when the desired sample characteristic is rare  When locating respondents is extremely difficult or cost prohibitive.  Snowball sampling relies on referrals from initial subjects to generate additional subjects  Merit  For difficult to reach populations (other methods may not yield any results).  Demerit  not representative of the population and will result in a biased sample as it is self-selecting.
  • 27. SNOWBALL SAMPLING - STEPS  Make contact with one or two cases in the population.  Ask these cases to identify further cases.  Ask these new cases to identify further new cases.  Stop when either no new cases are given or the sample is as large as is manageable.
  • 28. Key issues  We cannot study the whole population so we sample it  Taking a sample leads to sampling error, which is measurable  Good design and quality assurance ensure validity and while appropriate sample size will ensure precision  Probability samples are the only one that allow use of statistics as we know them
  • 29.
  • 30. Types of Data  Qualitative  Nominal Eg. Color of Eyes  Ordinal Eg. Stages of disease condition  Quantitative  Discrete Eg. Family size  Continuous Eg. Height / Weight
  • 31. Describe –Central Value  Data is not information.  Summarize  Average  Mean  Median  Mode
  • 32. Arithmetic Mean (AM)  Most commonly used; Simply called MEAN  Add all the observed values(Sum=∑Xi)  Mean =Sum / n  Sample Mean is denoted by x̅  Population Mean is denoted by μ
  • 33. Example  Age of 10 Pregnant women  26, 31, 25, 21, 26, 26, 27, 25, 27, and 26  Sum= (26+31+25+21+26+26+27+25+27+26) = 260  n = 10  Mean = sum / n= 260/10 = 26 years
  • 34. Median  The Median describes literally the middle value of the distribution  Divides the distribution exactly into two halves (i.e. 50% of the data will fall on either side)  Useful when there are extreme values
  • 35. Example  Duration (days) of hospital stay of 11 patients  1, 2, 3, 4, 5, 6, 7, 8, 8, 9, 77 (Arranged in ascending order)  Median is the middle value (6thvalue) = 6  (Mean = 11.8)  If n is even; then take average of middle two values.
  • 36. Mode  The Mode is the value that occurs most frequently  Mode is the only location statistics to be used –for nominal data -not measurable characteristic  Epidemiology –Describing an epidemic with respect to TIME
  • 37. Example  •Colour preference of people for their car  Mode = Yellow Colour preference No. of persons Green 354 Yellow 852 White 310 Red 474
  • 38. Describe –Dispersion  Is it enough to know the average?  Example of swimming pool.  •Measures of variability  Range  Inter-quartile range  Mean deviation from mean  Variance / Standard deviation
  • 39. RANGE Definition:  The difference between the Minimum and the Maximum value of the observations Advantage:  A quick and easy indicator of dispersion. Disadvantage:  Influenced by extreme values; Uses only two data points
  • 40. INTER-QUARTILE RANGE Definition:  Defined as the interval between the value of the upper quartile (Q3) and the lower quartile (Q1)  Inter Quartile Range = Q3–Q1 Advantage:  Unaffected by the extreme values Disadvantage:  Covers only the middle 50% observations
  • 41. INTER QUARTILE RANGE  The range of a data is divided into four (25%)
  • 42. MEAN DEVIATION  Definition: The mean deviation is the average of the absolute (ignoring the sign) deviations of the observations from the arithmetic mean.  Advantage: It is based on all the observations in the group. It is easy to grasp the meaning of the procedure.  Disadvantage: It ignores the sign of the difference of the value of the observation and arithmetic mean.  It is not widely used because of the availability of a more advantageous measure.
  • 43. STANDARD DEVIATION -SD (σ)  Definition: The SD is the square root of the average of the squared deviations of the observations from the arithmetic mean  The square of the SD is called variance  Advantage: The SD is the most important measure of distribution. While the variance is in unit squared, the SD is expressed in the same units of measurement as the observation. It is suitable for further analysis  The SD together with arithmetic mean is useful for description of the data
  • 45. Coefficient of Variation (CV)  Purpose: To compare the relative variability in different groups  Definition: The coefficient of variation is the SD expressed as a percentage of the arithmetic mean (AM).   CV = SD x 100 AM
  • 46. Summary  Choose appropriate central / dispersion value  Mean / SD –if no extreme values  Median / IQR –if there are extreme values  Mode /Range –for qualitative variables/ time distribution in epidemic curve  Mean and SD are used the most.