SlideShare a Scribd company logo
1 of 61
Instructor
Morteza Maleki PhD
2
 It may be possible to collect and analyse data from every possible case or
group member; this is termed a census.
 Yet, it will be impossible for you either to collect or to analyse all the data available
to you owing to restrictions of time, money and often access.
 Sampling techniques provide a range of methods that enable you to reduce the
amount of data you need to collect by considering only data from a subgroup
rather than all possible cases or elements.
 Some research questions will require sample data to generalise about all the cases
from which your sample has been selected.
 Other research questions may not involve such generalisations.
3
4
 The full set of cases from which a sample is taken is called the population.
 In sampling, the term ‘population’ is not used in its normal sense, as the full set of
cases need not necessarily be people.
 For research to discover relative levels of service at burger bars throughout a
country, the population from which you would select your sample would be all
burger bars in that country.
5
The need to sample
 Sampling provides a valid alternative to a census when:
1. It would be impracticable for you to survey the entire population;
2. Your budget constraints prevent you from surveying the entire population;
3. Your time constraints prevent you from surveying the entire population;
4. You have collected all the data but need the results quickly.
 For all research questions where it would be impracticable for you to collect
data from the entire population, you need to select a sample.
 Moreover, testing an entire population of products to destruction, such as to
establish the crash protection provided by cars, would be impractical for any
manufacturer.
6
 Using sampling makes possible a higher accuracy than a census (Henry,
1990).
1. The smaller number of cases for which you need to collect data means that more
time can be spent designing and piloting the means of collecting these data.
2. Collecting data from fewer cases also means that you can collect information that is
more detailed.
3. If you are employing people to collect the data (perhaps as interviewers) you can
use higher-quality staff.
4. You also can devote more time to trying to obtain data from the more difficult
cases.
5. Proportionally more time can be devoted to checking and testing the data for
accuracy prior to analysis, after your data have been collected.
7
An overview of sampling techniques
 The sampling techniques available to you can be divided into two types:
1. Probability or representative sampling;
2. Non-probability or judgmental sampling
8
 With probability samples the chance, or probability, of each case being selected
from the population is known and is either equal or unequal for all cases.
 This means that it is possible to answer research questions and to achieve objectives
that require you to estimate statistically the characteristics of the population from
the sample.
 Hence, probability sampling is often associated with survey and experimental
research strategies.
9
 For non-probability samples, the probability of each case being selected from
the total population is not known
 It is impossible to answer research questions or to address objectives that require
you to make statistical inferences about the characteristics of the population.
 You may still be able to generalise from non-probability samples about the
population, but not on statistical grounds.
 For this reason, non-probability sampling (other than quota sampling) is more
frequently used when adopting a case study strategy
10
11
 Probability sampling (or representative sampling) is most commonly associated
with survey-based research strategies where you need to make inferences from
your sample about a population.
 The process of probability sampling can be divided into four stages:
1. Identify a suitable sampling frame based on your research question(s) or objectives.
2. Decide on a suitable sample size.
3. Select the most appropriate sampling technique and select the sample.
4. Check that the sample is representative of the population.
 For populations of less than 50 cases, Henry (1990) advises against probability
sampling.
 You should collect data on the entire population as the influence of a single extreme
case on subsequent statistical analyses is more pronounced than for larger samples.
12
I. Identifying a suitable sampling frame and the implications for generalisability
 The sampling frame for any probability sample is a complete list of all the cases in the
population from which your sample will be drawn.
 Researchers then select their sample from this list.
 The completeness of the sampling frame is very important.
 An incomplete or inaccurate list means that some cases will have been excluded and so it will
be impossible for every case in the population to have a chance of selection.
 Then the sample may not be representative of the total population.
13
 Where no suitable list exists you will have to compile your own sampling frame,
perhaps drawing upon existing lists.
 It is important to ensure that the sampling frame is unbiased, current and accurate.
 The way you define your sampling frame also has implications regarding the
extent to which you can generalise from your sample.
 Within probability sampling, by defining the sampling frame you are defining the
population about which you want to generalise.
 Organisations that specialise in selling lists of names, addresses and email
addresses.
14
15
II. Deciding on a suitable sample size
 The larger your sample’s size, the lower the likely error in generalising to the
population.
 Probability sampling is therefore a compromise between the accuracy of your
findings and the amount of time and money you invest in collecting, checking and
analysing the data.
16
 The choice of sample size is governed by:
1. The confidence you need to have in your data
 The level of certainty that the characteristics of the data collected will represent the
characteristics of the total population;
2. Margin of error that you can tolerate
 The accuracy you require for any estimates made from your sample;
3. The types of analyses you are going to undertake
 In particular the number of categories into which you wish to subdivide your data, as many
statistical techniques have a minimum threshold of data cases for each cell
 and to a lesser extent:
 the size of the total population from which your sample is being drawn.
 Note: the final sample size is almost always a matter of judgment as well as of
calculation.
17
 The larger the absolute size of a sample, the more closely its distribution will be to
the normal distribution and thus the more robust it will be.
 This relationship, known as the central limit theorem, occurs even if the
population from which the sample is drawn is not normally distributed.
 A sample size of 30 or more will usually result in a sampling distribution for the
mean that is very close to a normal distribution.
 Stutely’s (2003) advice of a minimum number of 30 for statistical analyses provides a
useful rule of thumb for the smallest number in each category within your overall
sample.
18
 The process of coming up with conclusions about a population on the basis of
data describing the sample is called statistical inference.
 It allows you to calculate how probable it is that your result, given your sample size,
could have been obtained by chance.
 Providing they are not biased, samples of larger absolute size are more likely to
be representative of the population from which they are drawn than smaller
samples and
 The mean (average) calculated for the sample is more likely to equal the mean for
the population.
 This is known as the law of large numbers.
19
 Researchers normally work to a 95 per cent level of certainty.
 This means that if your sample was selected 100 times, at least 95 of these samples
would be certain to represent the characteristics of the population.
 The margin of error describes the precision of your estimates of the population.
20
21
 The smaller the absolute size of the sample and, to a far lesser extent, the smaller
the relative proportion of the total population sampled, the greater the margin of
error.
 Within this, the impact of absolute sample size on the margin of error decreases for
larger sample sizes.
 deVaus (2002) argues that it is for this reason that many market research
companies limit their samples’ sizes to approximately 2000.
22
The Importance of a High Response Rate
 A perfect representative sample is one that exactly represents the population
from which it is taken.
 You therefore need to obtain as high a response rate as possible to ensure that
your sample is representative.
 In reality, you are likely to have non-responses.
 They have refused to be involved in your research for whatever reason.
 Your respondents will not be representative of the total population, and the data
you collect may be biased.
 Any non-responses will necessitate extra respondents being found to reach the
required sample size, thereby increasing the cost of your data collection.
23
 Nonresponse is due to four interrelated problems:
1. Refusal to respond;
2. Ineligibility to respond;
3. Inability to locate respondent;
4. Respondent located but unable to make contact.
 As part of your research report, you will need to include your response rate.
 Neumann (2000) suggests that when you calculate this you should include all
eligible respondents:
 This he calls the total response rate.
24
 A more common way of doing this excludes ineligible respondents and those who,
despite repeated attempts were unreachable.
 This is known as the active response rate:
25
III. Selecting the most appropriate sampling technique and the sample
 Five main techniques can be used to select a probability sample:
1. Simple random;
2. Systematic;
3. Stratified Random;
4. Cluster;
5. Multi-stage.
26
27
28
1. Simple random sampling
 It involves selecting the sample at random from the sampling frame using
either random number tables or a computer.
1. Number each of the cases in your sampling frame with a unique number. The
first case is numbered 0, the second 1 and so on.
2. Select cases using random numbers until your actual sample size is reached.
 If the same number is read off a second time it must be disregarded as you
need different cases.
 Not putting each case’s number back into the sampling frame after it has been
selected - termed sampling without replacement.
 Use a computer program such as a spreadsheet to generate random numbers.
29
 Random numbers allow you to select your sample without bias.
 The sample selected can be said to be representative of the whole population.
 Simple random sampling is best used when you have an accurate and easily
accessible sampling frame that lists the entire population, preferably stored on a
computer.
30
2. Systematic sampling
 Systematic sampling involves you selecting the sample at regular intervals
from the sampling frame; to do this you:
1. Number each of the cases in your sampling frame with a unique number.
 The first case is numbered 0, the second 1 and so on.
2. Select the first case using a random number.
3. Calculate the sampling fraction.
4. Select subsequent cases systematically using the sampling fraction to determine
the frequency of selection.
31
 To calculate the sampling fraction – the proportion of the total population that
you need to select – you use the formula
 If your sampling fraction is 1⁄3 you need to select one in every three cases – that
is, every third case from the sampling frame.
 Selecting one in every three would not be random as every third case would be bound to
be selected, whereas those between would have no chance of selection.
 A random number is used to decide where to start on the sampling frame.
32
3. Stratified random sampling
 Stratified random sampling is a modification of random sampling
 You divide the population into two or more relevant and significant strata based
on one or a number of attributes.
 The sampling frame is divided into a number of subsets.
 A random sample (simple or systematic) is then drawn from each of the strata.
33
 Stratified sampling shares many of the advantages and disadvantages of simple
random or systematic sampling.
 Dividing the population into a series of relevant strata means that the sample is
more likely to be representative.
 You can ensure that each of the strata is represented proportionally within your sample.
 Yet, it is only possible to do this if you are aware of, and can easily distinguish,
significant strata in your sampling frame.
 Moreover, the extra stage in the sampling procedure means that it is likely to take
longer, to be more expensive, and to be more difficult to explain than simple
random or systematic sampling.
34
 Some sampling frame will already be divided into strata (deVaus, 2002).
 For example, membership lists that are ordered by date of joining will automatically
result in stratification by length of membership if systematic sampling is used.
 Or, a sampling frame of employee names that is in alphabetical order will
automatically ensure that employees will be sampled in the correct proportion to
the letter with which their name begins if systematic sampling is used.
 If you are using simple random sampling or your sampling frame contains
periodic patterns, you will need to stratify it.
35
 Steps to Stratify a Sampling Frame:
1) Choose the stratification variable or variables.
2) Divide the sampling frame into the discrete strata.
3) Number each of the cases within each stratum with a unique number.
4) Select your sample using either simple random or systematic sampling.
 The stratification variable (or variables) chosen should represent the discrete
characteristic (or characteristics) for which you want to ensure correct
representation within the sample.
36
 Samples can be stratified using more than one characteristic.
 You may wish to stratify a sample of an organization's employees by both department
and salary grade:
1) Divide the sampling frame into the discrete departments.
2) Within each department divide the sampling frame into discrete salary grades.
3) Number each of the cases within each salary grade within each department with a
unique number.
4) Select your sample using either simple random or systematic sampling.
NOTE:
 The relative sizes of different strata mean that, in order to have sufficient data for
analysis, you need to select larger samples from the strata with smaller populations.
37
4. Cluster sampling
 Cluster sampling is similar to stratified sampling as you need to divide the
population into discrete groups prior to sampling (Henry, 1990).
 The groups are termed clusters in this form of sampling and can be based on any
naturally occurring grouping.
 For example, you could group your data by type of manufacturing firm or
geographical area.
 For cluster sampling your sampling frame is the complete list of clusters rather than
a complete list of individual cases within the population.
 You then select a few clusters, normally using simple random sampling.
 Data are then collected from every case within the selected clusters.
38
 The cluster sampling technique has three main stages:
1) Choose the cluster grouping for your sampling frame.
2) Number each of the clusters with a unique number.
 The first cluster is numbered 0, the second 1 and so on.
3) Select your sample using some form of random sampling.
 Selecting clusters randomly makes cluster sampling a probability sampling
technique.
39
5. Multi-stage sampling
 Multi-stage sampling - multi-stage cluster sampling - is a development of
cluster sampling.
 It is normally used to overcome problems associated with a geographically
dispersed population when face-to-face contact is needed or where it is
expensive and time consuming to construct a sampling frame for a large
geographical area.
 The technique involves taking a series of cluster samples, each involving
some form of random sampling.
 It can be divided into four phases.
 Because multi-stage sampling relies on a series of different sampling frames, you
need to ensure that they are all appropriate and available.
40
41
 Probability techniques all are based on the assumption that your sample will be
chosen statistically at random.
 It is possible to specify the probability that any case will be included in the sample.
 Within business research, such as market surveys and case study research, this
may either not be possible (as you do not have a sampling frame) or appropriate
to answering your research question.
 Non-probability sampling (or non-random sampling) techniques are based on
the researcher’s subjective judgment.
42
 To answer your research questions(s) and to meet your objectives, you may need
to undertake an in-depth study that focuses on a small, perhaps one, case selected
for a particular purpose.
 This sample would provide you with an information-rich case study in which
you explore your research question.
 Moreover, Limited resources or the inability to specify a sampling frame may
dictate the use of one or a number of non-probability sampling techniques.
43
44
45
1) Quota sampling
 Quota sampling is entirely non-random and is normally used for interview
surveys.
 It is based on the premise that your sample will represent the population as the
variability in your sample for various quota variables is the same as that in the
population.
 Quota sampling is a type of stratified sample in which selection of cases
within strata is entirely non-random (Barnett, 1991).
46
 To select a quota sample you:
1. Divide the population into specific groups.
2. Calculate a quota for each group based on relevant and available data.
3. Give each interviewer an assignment, which states the number of cases in each
quota from which they must collect data.
4. Combine the data collected by interviewers to provide the full sample.
47
Advantages of Quota Sampling over Random Sampling
1. It is less costly and can be set up very quickly.
2. It does not require a sampling frame and may be the only technique you can
use if one is not available.
 Quota sampling is normally used for large populations.
 For small populations it is usually possible to obtain a sampling frame.
 Decisions on sample size are governed by the need to have sufficient responses in
each quota to enable subsequent statistical analyses to be undertaken.
48
 Calculations of quotas are based on relevant and available data and are usually
relative to the proportions in which they occur in the population.
 Without sensible and relevant quotas, data collected may be biased.
 For many market research projects, quotas are derived from census data.
 Your choice of quota is dependent on two main factors:
1. Usefulness as a means of stratifying the data;
2. Ability to overcome likely variations between groups in their availability for
interview.
49
2) Purposive / Judgmental sampling
 Use your judgment to select cases.
 Used when working with very small samples such as in case study research.
 To select cases that are particularly informative (Neuman, 2000).
 Used by researchers adopting the grounded theory strategy.
 Cannot be considered to be statistically representative of the total population.
50
51
1) Extreme Case or Deviant Sampling
 It focuses on unusual or special cases
 The data collected about these unusual or extreme outcomes will enable you
to learn the most and to answer your research question(s) and to meet your
objectives most effectively.
 Findings from extreme cases will be relevant in understanding or explaining
more typical cases (Patton, 2002).
52
2) Heterogeneous or Maximum Variation Sampling
 It enables you to collect data to describe and explain the key themes that can
be observed.
 Any patterns that do emerge are likely to be of particular interest and value
and represent the key themes.
 The data collected should enable you to document uniqueness.
 To ensure maximum variation within a sample Patton (2002) suggests you
identify your diverse characteristics (sample selection criteria) prior to
selecting your sample.
53
3) Homogeneous or Minimum Variation Sampling
 It focuses on one particular subgroup in which all the sample members are
similar.
 This enables you to study the group in great depth.
54
4) Critical Case Sampling
 It selects critical cases on the basis that they can make a point dramatically or
because they are important.
 The focus of data collection is to understand what is happening in each
critical case so that logical generalisations can be made.
 Patton (2002) outlines a number of clues that suggest critical cases:
1) If it happens there, will it happen everywhere?
2) If they are having problems, can you be sure that everyone will have problems?
3) If they cannot understand the process, is it likely that no one will be able to
understand the process?
55
5) Typical Case Sampling
 It is usually used as part of a research project to provide an illustrative profile
using a representative case.
 Such a sample enables you to provide an illustration of what is ‘typical’ to
those who will be reading your research report and may be unfamiliar with
the subject matter.
 It is not intended to be definitive.
56
3) Snowball sampling
 Snowball sampling is commonly used when it is difficult to identify members
of the desired population, for example people who are working while claiming
unemployment benefit.
 Researchers need to:
1) Make contact with one or two cases in the population.
2) Ask these cases to identify further cases.
3) Ask these new cases to identify further new cases (and so on).
4) Stop when either no new cases are given or the sample is as large as is
manageable.
 The main problem is making initial contact.
57
 For such samples the problems of bias are huge, as respondents are most likely
to identify other potential respondents who are similar to themselves, resulting in a
homogeneous sample (Lee, 1993).
 The next problem is to find these new cases.
58
4) Self-selection sampling
 It occurs when you allow each case, usually individuals, to identify their
desire to take part in the research.
 You therefore:
1. Publicize your need for cases, either by advertising through appropriate
media or by asking them to take part.
 Articles and advertisements in magazines that the population are likely to read,
postings on appropriate Internet newsgroups and discussion groups, hyperlinks
from other websites as well as letters or emails of invitation to colleagues and
friends.
2. Collect data from those who respond.
59
5) Convenience sampling or haphazard sampling
 It involves selecting haphazardly those cases that are easiest to obtain for
your sample, such as the person interviewed at random in a shopping centre
for a television programme.
 The sample selection process is continued until your required sample size
has been reached.
 It is prone to bias and influences that are beyond your control, as the cases
appear in the sample only because of the ease of obtaining them.
60
The End
61

More Related Content

Similar to How to Select a Probability Sample

Sampling method son research methodology
Sampling method son research methodologySampling method son research methodology
Sampling method son research methodologyLevisMithamo
 
sampling
samplingsampling
samplingkamalu4
 
audit sampling notes
audit sampling notesaudit sampling notes
audit sampling notesstudent
 
sampling method techniques of engineers.pptx
sampling method techniques of engineers.pptxsampling method techniques of engineers.pptx
sampling method techniques of engineers.pptxMustafa580017
 
Sampling design 1216114348242957-8
Sampling design 1216114348242957-8Sampling design 1216114348242957-8
Sampling design 1216114348242957-8rgwax
 
Unit 6 sampling techniques
Unit 6 sampling techniquesUnit 6 sampling techniques
Unit 6 sampling techniquesAsima shahzadi
 
Surveys & Questionnaires
Surveys & QuestionnairesSurveys & Questionnaires
Surveys & Questionnairesnncygarcia
 
Types of research design, sampling methods & data collection
Types of research design, sampling methods & data collectionTypes of research design, sampling methods & data collection
Types of research design, sampling methods & data collectionBipin Koirala
 
Sampling Design in Applied Marketing Research
Sampling Design in Applied Marketing ResearchSampling Design in Applied Marketing Research
Sampling Design in Applied Marketing ResearchKelly Page
 
Non probability sampling
Non  probability samplingNon  probability sampling
Non probability samplingcorayu13
 

Similar to How to Select a Probability Sample (20)

Sampling method son research methodology
Sampling method son research methodologySampling method son research methodology
Sampling method son research methodology
 
Chapter5.ppt
Chapter5.pptChapter5.ppt
Chapter5.ppt
 
sampling
samplingsampling
sampling
 
Chapter5
Chapter5Chapter5
Chapter5
 
5. Selecting samples.pdf
5. Selecting samples.pdf5. Selecting samples.pdf
5. Selecting samples.pdf
 
Fundamental of sampling
Fundamental of samplingFundamental of sampling
Fundamental of sampling
 
Sampling Methods.pptx
Sampling Methods.pptxSampling Methods.pptx
Sampling Methods.pptx
 
SAMPLING.pptx
SAMPLING.pptxSAMPLING.pptx
SAMPLING.pptx
 
audit sampling notes
audit sampling notesaudit sampling notes
audit sampling notes
 
sampling method techniques of engineers.pptx
sampling method techniques of engineers.pptxsampling method techniques of engineers.pptx
sampling method techniques of engineers.pptx
 
Sampling methods
Sampling methodsSampling methods
Sampling methods
 
Sampling design 1216114348242957-8
Sampling design 1216114348242957-8Sampling design 1216114348242957-8
Sampling design 1216114348242957-8
 
Unit 6 sampling techniques
Unit 6 sampling techniquesUnit 6 sampling techniques
Unit 6 sampling techniques
 
Surveys & Questionnaires
Surveys & QuestionnairesSurveys & Questionnaires
Surveys & Questionnaires
 
Types of research design, sampling methods & data collection
Types of research design, sampling methods & data collectionTypes of research design, sampling methods & data collection
Types of research design, sampling methods & data collection
 
Sampling Design in Applied Marketing Research
Sampling Design in Applied Marketing ResearchSampling Design in Applied Marketing Research
Sampling Design in Applied Marketing Research
 
Sampling techniques
Sampling techniquesSampling techniques
Sampling techniques
 
Sampling
Sampling Sampling
Sampling
 
Mr1480.ch2
Mr1480.ch2Mr1480.ch2
Mr1480.ch2
 
Non probability sampling
Non  probability samplingNon  probability sampling
Non probability sampling
 

Recently uploaded

Day 0- Bootcamp Roadmap for PLC Bootcamp
Day 0- Bootcamp Roadmap for PLC BootcampDay 0- Bootcamp Roadmap for PLC Bootcamp
Day 0- Bootcamp Roadmap for PLC BootcampPLCLeadershipDevelop
 
Reflecting, turning experience into insight
Reflecting, turning experience into insightReflecting, turning experience into insight
Reflecting, turning experience into insightWayne Abrahams
 
crisiscommunication-presentation in crisis management.pptx
crisiscommunication-presentation in crisis management.pptxcrisiscommunication-presentation in crisis management.pptx
crisiscommunication-presentation in crisis management.pptxSamahhassan30
 
LPC Warehouse Management System For Clients In The Business Sector
LPC Warehouse Management System For Clients In The Business SectorLPC Warehouse Management System For Clients In The Business Sector
LPC Warehouse Management System For Clients In The Business Sectorthomas851723
 
CALL ON ➥8923113531 🔝Call Girls Charbagh Lucknow best sexual service
CALL ON ➥8923113531 🔝Call Girls Charbagh Lucknow best sexual serviceCALL ON ➥8923113531 🔝Call Girls Charbagh Lucknow best sexual service
CALL ON ➥8923113531 🔝Call Girls Charbagh Lucknow best sexual serviceanilsa9823
 
Fifteenth Finance Commission Presentation
Fifteenth Finance Commission PresentationFifteenth Finance Commission Presentation
Fifteenth Finance Commission Presentationmintusiprd
 
CEO of Google, Sunder Pichai's biography
CEO of Google, Sunder Pichai's biographyCEO of Google, Sunder Pichai's biography
CEO of Google, Sunder Pichai's biographyHafizMuhammadAbdulla5
 
Introduction to LPC - Facility Design And Re-Engineering
Introduction to LPC - Facility Design And Re-EngineeringIntroduction to LPC - Facility Design And Re-Engineering
Introduction to LPC - Facility Design And Re-Engineeringthomas851723
 
Board Diversity Initiaive Launch Presentation
Board Diversity Initiaive Launch PresentationBoard Diversity Initiaive Launch Presentation
Board Diversity Initiaive Launch Presentationcraig524401
 
LPC Operations Review PowerPoint | Operations Review
LPC Operations Review PowerPoint | Operations ReviewLPC Operations Review PowerPoint | Operations Review
LPC Operations Review PowerPoint | Operations Reviewthomas851723
 
VIP Kolkata Call Girl Rajarhat 👉 8250192130 Available With Room
VIP Kolkata Call Girl Rajarhat 👉 8250192130  Available With RoomVIP Kolkata Call Girl Rajarhat 👉 8250192130  Available With Room
VIP Kolkata Call Girl Rajarhat 👉 8250192130 Available With Roomdivyansh0kumar0
 
Pooja Mehta 9167673311, Trusted Call Girls In NAVI MUMBAI Cash On Payment , V...
Pooja Mehta 9167673311, Trusted Call Girls In NAVI MUMBAI Cash On Payment , V...Pooja Mehta 9167673311, Trusted Call Girls In NAVI MUMBAI Cash On Payment , V...
Pooja Mehta 9167673311, Trusted Call Girls In NAVI MUMBAI Cash On Payment , V...Pooja Nehwal
 
ANIn Gurugram April 2024 |Can Agile and AI work together? by Pramodkumar Shri...
ANIn Gurugram April 2024 |Can Agile and AI work together? by Pramodkumar Shri...ANIn Gurugram April 2024 |Can Agile and AI work together? by Pramodkumar Shri...
ANIn Gurugram April 2024 |Can Agile and AI work together? by Pramodkumar Shri...AgileNetwork
 

Recently uploaded (17)

Day 0- Bootcamp Roadmap for PLC Bootcamp
Day 0- Bootcamp Roadmap for PLC BootcampDay 0- Bootcamp Roadmap for PLC Bootcamp
Day 0- Bootcamp Roadmap for PLC Bootcamp
 
Reflecting, turning experience into insight
Reflecting, turning experience into insightReflecting, turning experience into insight
Reflecting, turning experience into insight
 
crisiscommunication-presentation in crisis management.pptx
crisiscommunication-presentation in crisis management.pptxcrisiscommunication-presentation in crisis management.pptx
crisiscommunication-presentation in crisis management.pptx
 
LPC Warehouse Management System For Clients In The Business Sector
LPC Warehouse Management System For Clients In The Business SectorLPC Warehouse Management System For Clients In The Business Sector
LPC Warehouse Management System For Clients In The Business Sector
 
CALL ON ➥8923113531 🔝Call Girls Charbagh Lucknow best sexual service
CALL ON ➥8923113531 🔝Call Girls Charbagh Lucknow best sexual serviceCALL ON ➥8923113531 🔝Call Girls Charbagh Lucknow best sexual service
CALL ON ➥8923113531 🔝Call Girls Charbagh Lucknow best sexual service
 
Fifteenth Finance Commission Presentation
Fifteenth Finance Commission PresentationFifteenth Finance Commission Presentation
Fifteenth Finance Commission Presentation
 
CEO of Google, Sunder Pichai's biography
CEO of Google, Sunder Pichai's biographyCEO of Google, Sunder Pichai's biography
CEO of Google, Sunder Pichai's biography
 
Introduction to LPC - Facility Design And Re-Engineering
Introduction to LPC - Facility Design And Re-EngineeringIntroduction to LPC - Facility Design And Re-Engineering
Introduction to LPC - Facility Design And Re-Engineering
 
Board Diversity Initiaive Launch Presentation
Board Diversity Initiaive Launch PresentationBoard Diversity Initiaive Launch Presentation
Board Diversity Initiaive Launch Presentation
 
LPC Operations Review PowerPoint | Operations Review
LPC Operations Review PowerPoint | Operations ReviewLPC Operations Review PowerPoint | Operations Review
LPC Operations Review PowerPoint | Operations Review
 
sauth delhi call girls in Defence Colony🔝 9953056974 🔝 escort Service
sauth delhi call girls in Defence Colony🔝 9953056974 🔝 escort Servicesauth delhi call girls in Defence Colony🔝 9953056974 🔝 escort Service
sauth delhi call girls in Defence Colony🔝 9953056974 🔝 escort Service
 
Becoming an Inclusive Leader - Bernadette Thompson
Becoming an Inclusive Leader - Bernadette ThompsonBecoming an Inclusive Leader - Bernadette Thompson
Becoming an Inclusive Leader - Bernadette Thompson
 
VIP Kolkata Call Girl Rajarhat 👉 8250192130 Available With Room
VIP Kolkata Call Girl Rajarhat 👉 8250192130  Available With RoomVIP Kolkata Call Girl Rajarhat 👉 8250192130  Available With Room
VIP Kolkata Call Girl Rajarhat 👉 8250192130 Available With Room
 
Pooja Mehta 9167673311, Trusted Call Girls In NAVI MUMBAI Cash On Payment , V...
Pooja Mehta 9167673311, Trusted Call Girls In NAVI MUMBAI Cash On Payment , V...Pooja Mehta 9167673311, Trusted Call Girls In NAVI MUMBAI Cash On Payment , V...
Pooja Mehta 9167673311, Trusted Call Girls In NAVI MUMBAI Cash On Payment , V...
 
Call Girls Service Tilak Nagar @9999965857 Delhi 🫦 No Advance VVIP 🍎 SERVICE
Call Girls Service Tilak Nagar @9999965857 Delhi 🫦 No Advance  VVIP 🍎 SERVICECall Girls Service Tilak Nagar @9999965857 Delhi 🫦 No Advance  VVIP 🍎 SERVICE
Call Girls Service Tilak Nagar @9999965857 Delhi 🫦 No Advance VVIP 🍎 SERVICE
 
Rohini Sector 16 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 16 Call Girls Delhi 9999965857 @Sabina Saikh No AdvanceRohini Sector 16 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 16 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
 
ANIn Gurugram April 2024 |Can Agile and AI work together? by Pramodkumar Shri...
ANIn Gurugram April 2024 |Can Agile and AI work together? by Pramodkumar Shri...ANIn Gurugram April 2024 |Can Agile and AI work together? by Pramodkumar Shri...
ANIn Gurugram April 2024 |Can Agile and AI work together? by Pramodkumar Shri...
 

How to Select a Probability Sample

  • 2. 2
  • 3.  It may be possible to collect and analyse data from every possible case or group member; this is termed a census.  Yet, it will be impossible for you either to collect or to analyse all the data available to you owing to restrictions of time, money and often access.  Sampling techniques provide a range of methods that enable you to reduce the amount of data you need to collect by considering only data from a subgroup rather than all possible cases or elements.  Some research questions will require sample data to generalise about all the cases from which your sample has been selected.  Other research questions may not involve such generalisations. 3
  • 4. 4
  • 5.  The full set of cases from which a sample is taken is called the population.  In sampling, the term ‘population’ is not used in its normal sense, as the full set of cases need not necessarily be people.  For research to discover relative levels of service at burger bars throughout a country, the population from which you would select your sample would be all burger bars in that country. 5
  • 6. The need to sample  Sampling provides a valid alternative to a census when: 1. It would be impracticable for you to survey the entire population; 2. Your budget constraints prevent you from surveying the entire population; 3. Your time constraints prevent you from surveying the entire population; 4. You have collected all the data but need the results quickly.  For all research questions where it would be impracticable for you to collect data from the entire population, you need to select a sample.  Moreover, testing an entire population of products to destruction, such as to establish the crash protection provided by cars, would be impractical for any manufacturer. 6
  • 7.  Using sampling makes possible a higher accuracy than a census (Henry, 1990). 1. The smaller number of cases for which you need to collect data means that more time can be spent designing and piloting the means of collecting these data. 2. Collecting data from fewer cases also means that you can collect information that is more detailed. 3. If you are employing people to collect the data (perhaps as interviewers) you can use higher-quality staff. 4. You also can devote more time to trying to obtain data from the more difficult cases. 5. Proportionally more time can be devoted to checking and testing the data for accuracy prior to analysis, after your data have been collected. 7
  • 8. An overview of sampling techniques  The sampling techniques available to you can be divided into two types: 1. Probability or representative sampling; 2. Non-probability or judgmental sampling 8
  • 9.  With probability samples the chance, or probability, of each case being selected from the population is known and is either equal or unequal for all cases.  This means that it is possible to answer research questions and to achieve objectives that require you to estimate statistically the characteristics of the population from the sample.  Hence, probability sampling is often associated with survey and experimental research strategies. 9
  • 10.  For non-probability samples, the probability of each case being selected from the total population is not known  It is impossible to answer research questions or to address objectives that require you to make statistical inferences about the characteristics of the population.  You may still be able to generalise from non-probability samples about the population, but not on statistical grounds.  For this reason, non-probability sampling (other than quota sampling) is more frequently used when adopting a case study strategy 10
  • 11. 11
  • 12.  Probability sampling (or representative sampling) is most commonly associated with survey-based research strategies where you need to make inferences from your sample about a population.  The process of probability sampling can be divided into four stages: 1. Identify a suitable sampling frame based on your research question(s) or objectives. 2. Decide on a suitable sample size. 3. Select the most appropriate sampling technique and select the sample. 4. Check that the sample is representative of the population.  For populations of less than 50 cases, Henry (1990) advises against probability sampling.  You should collect data on the entire population as the influence of a single extreme case on subsequent statistical analyses is more pronounced than for larger samples. 12
  • 13. I. Identifying a suitable sampling frame and the implications for generalisability  The sampling frame for any probability sample is a complete list of all the cases in the population from which your sample will be drawn.  Researchers then select their sample from this list.  The completeness of the sampling frame is very important.  An incomplete or inaccurate list means that some cases will have been excluded and so it will be impossible for every case in the population to have a chance of selection.  Then the sample may not be representative of the total population. 13
  • 14.  Where no suitable list exists you will have to compile your own sampling frame, perhaps drawing upon existing lists.  It is important to ensure that the sampling frame is unbiased, current and accurate.  The way you define your sampling frame also has implications regarding the extent to which you can generalise from your sample.  Within probability sampling, by defining the sampling frame you are defining the population about which you want to generalise.  Organisations that specialise in selling lists of names, addresses and email addresses. 14
  • 15. 15
  • 16. II. Deciding on a suitable sample size  The larger your sample’s size, the lower the likely error in generalising to the population.  Probability sampling is therefore a compromise between the accuracy of your findings and the amount of time and money you invest in collecting, checking and analysing the data. 16
  • 17.  The choice of sample size is governed by: 1. The confidence you need to have in your data  The level of certainty that the characteristics of the data collected will represent the characteristics of the total population; 2. Margin of error that you can tolerate  The accuracy you require for any estimates made from your sample; 3. The types of analyses you are going to undertake  In particular the number of categories into which you wish to subdivide your data, as many statistical techniques have a minimum threshold of data cases for each cell  and to a lesser extent:  the size of the total population from which your sample is being drawn.  Note: the final sample size is almost always a matter of judgment as well as of calculation. 17
  • 18.  The larger the absolute size of a sample, the more closely its distribution will be to the normal distribution and thus the more robust it will be.  This relationship, known as the central limit theorem, occurs even if the population from which the sample is drawn is not normally distributed.  A sample size of 30 or more will usually result in a sampling distribution for the mean that is very close to a normal distribution.  Stutely’s (2003) advice of a minimum number of 30 for statistical analyses provides a useful rule of thumb for the smallest number in each category within your overall sample. 18
  • 19.  The process of coming up with conclusions about a population on the basis of data describing the sample is called statistical inference.  It allows you to calculate how probable it is that your result, given your sample size, could have been obtained by chance.  Providing they are not biased, samples of larger absolute size are more likely to be representative of the population from which they are drawn than smaller samples and  The mean (average) calculated for the sample is more likely to equal the mean for the population.  This is known as the law of large numbers. 19
  • 20.  Researchers normally work to a 95 per cent level of certainty.  This means that if your sample was selected 100 times, at least 95 of these samples would be certain to represent the characteristics of the population.  The margin of error describes the precision of your estimates of the population. 20
  • 21. 21
  • 22.  The smaller the absolute size of the sample and, to a far lesser extent, the smaller the relative proportion of the total population sampled, the greater the margin of error.  Within this, the impact of absolute sample size on the margin of error decreases for larger sample sizes.  deVaus (2002) argues that it is for this reason that many market research companies limit their samples’ sizes to approximately 2000. 22
  • 23. The Importance of a High Response Rate  A perfect representative sample is one that exactly represents the population from which it is taken.  You therefore need to obtain as high a response rate as possible to ensure that your sample is representative.  In reality, you are likely to have non-responses.  They have refused to be involved in your research for whatever reason.  Your respondents will not be representative of the total population, and the data you collect may be biased.  Any non-responses will necessitate extra respondents being found to reach the required sample size, thereby increasing the cost of your data collection. 23
  • 24.  Nonresponse is due to four interrelated problems: 1. Refusal to respond; 2. Ineligibility to respond; 3. Inability to locate respondent; 4. Respondent located but unable to make contact.  As part of your research report, you will need to include your response rate.  Neumann (2000) suggests that when you calculate this you should include all eligible respondents:  This he calls the total response rate. 24
  • 25.  A more common way of doing this excludes ineligible respondents and those who, despite repeated attempts were unreachable.  This is known as the active response rate: 25
  • 26. III. Selecting the most appropriate sampling technique and the sample  Five main techniques can be used to select a probability sample: 1. Simple random; 2. Systematic; 3. Stratified Random; 4. Cluster; 5. Multi-stage. 26
  • 27. 27
  • 28. 28
  • 29. 1. Simple random sampling  It involves selecting the sample at random from the sampling frame using either random number tables or a computer. 1. Number each of the cases in your sampling frame with a unique number. The first case is numbered 0, the second 1 and so on. 2. Select cases using random numbers until your actual sample size is reached.  If the same number is read off a second time it must be disregarded as you need different cases.  Not putting each case’s number back into the sampling frame after it has been selected - termed sampling without replacement.  Use a computer program such as a spreadsheet to generate random numbers. 29
  • 30.  Random numbers allow you to select your sample without bias.  The sample selected can be said to be representative of the whole population.  Simple random sampling is best used when you have an accurate and easily accessible sampling frame that lists the entire population, preferably stored on a computer. 30
  • 31. 2. Systematic sampling  Systematic sampling involves you selecting the sample at regular intervals from the sampling frame; to do this you: 1. Number each of the cases in your sampling frame with a unique number.  The first case is numbered 0, the second 1 and so on. 2. Select the first case using a random number. 3. Calculate the sampling fraction. 4. Select subsequent cases systematically using the sampling fraction to determine the frequency of selection. 31
  • 32.  To calculate the sampling fraction – the proportion of the total population that you need to select – you use the formula  If your sampling fraction is 1⁄3 you need to select one in every three cases – that is, every third case from the sampling frame.  Selecting one in every three would not be random as every third case would be bound to be selected, whereas those between would have no chance of selection.  A random number is used to decide where to start on the sampling frame. 32
  • 33. 3. Stratified random sampling  Stratified random sampling is a modification of random sampling  You divide the population into two or more relevant and significant strata based on one or a number of attributes.  The sampling frame is divided into a number of subsets.  A random sample (simple or systematic) is then drawn from each of the strata. 33
  • 34.  Stratified sampling shares many of the advantages and disadvantages of simple random or systematic sampling.  Dividing the population into a series of relevant strata means that the sample is more likely to be representative.  You can ensure that each of the strata is represented proportionally within your sample.  Yet, it is only possible to do this if you are aware of, and can easily distinguish, significant strata in your sampling frame.  Moreover, the extra stage in the sampling procedure means that it is likely to take longer, to be more expensive, and to be more difficult to explain than simple random or systematic sampling. 34
  • 35.  Some sampling frame will already be divided into strata (deVaus, 2002).  For example, membership lists that are ordered by date of joining will automatically result in stratification by length of membership if systematic sampling is used.  Or, a sampling frame of employee names that is in alphabetical order will automatically ensure that employees will be sampled in the correct proportion to the letter with which their name begins if systematic sampling is used.  If you are using simple random sampling or your sampling frame contains periodic patterns, you will need to stratify it. 35
  • 36.  Steps to Stratify a Sampling Frame: 1) Choose the stratification variable or variables. 2) Divide the sampling frame into the discrete strata. 3) Number each of the cases within each stratum with a unique number. 4) Select your sample using either simple random or systematic sampling.  The stratification variable (or variables) chosen should represent the discrete characteristic (or characteristics) for which you want to ensure correct representation within the sample. 36
  • 37.  Samples can be stratified using more than one characteristic.  You may wish to stratify a sample of an organization's employees by both department and salary grade: 1) Divide the sampling frame into the discrete departments. 2) Within each department divide the sampling frame into discrete salary grades. 3) Number each of the cases within each salary grade within each department with a unique number. 4) Select your sample using either simple random or systematic sampling. NOTE:  The relative sizes of different strata mean that, in order to have sufficient data for analysis, you need to select larger samples from the strata with smaller populations. 37
  • 38. 4. Cluster sampling  Cluster sampling is similar to stratified sampling as you need to divide the population into discrete groups prior to sampling (Henry, 1990).  The groups are termed clusters in this form of sampling and can be based on any naturally occurring grouping.  For example, you could group your data by type of manufacturing firm or geographical area.  For cluster sampling your sampling frame is the complete list of clusters rather than a complete list of individual cases within the population.  You then select a few clusters, normally using simple random sampling.  Data are then collected from every case within the selected clusters. 38
  • 39.  The cluster sampling technique has three main stages: 1) Choose the cluster grouping for your sampling frame. 2) Number each of the clusters with a unique number.  The first cluster is numbered 0, the second 1 and so on. 3) Select your sample using some form of random sampling.  Selecting clusters randomly makes cluster sampling a probability sampling technique. 39
  • 40. 5. Multi-stage sampling  Multi-stage sampling - multi-stage cluster sampling - is a development of cluster sampling.  It is normally used to overcome problems associated with a geographically dispersed population when face-to-face contact is needed or where it is expensive and time consuming to construct a sampling frame for a large geographical area.  The technique involves taking a series of cluster samples, each involving some form of random sampling.  It can be divided into four phases.  Because multi-stage sampling relies on a series of different sampling frames, you need to ensure that they are all appropriate and available. 40
  • 41. 41
  • 42.  Probability techniques all are based on the assumption that your sample will be chosen statistically at random.  It is possible to specify the probability that any case will be included in the sample.  Within business research, such as market surveys and case study research, this may either not be possible (as you do not have a sampling frame) or appropriate to answering your research question.  Non-probability sampling (or non-random sampling) techniques are based on the researcher’s subjective judgment. 42
  • 43.  To answer your research questions(s) and to meet your objectives, you may need to undertake an in-depth study that focuses on a small, perhaps one, case selected for a particular purpose.  This sample would provide you with an information-rich case study in which you explore your research question.  Moreover, Limited resources or the inability to specify a sampling frame may dictate the use of one or a number of non-probability sampling techniques. 43
  • 44. 44
  • 45. 45
  • 46. 1) Quota sampling  Quota sampling is entirely non-random and is normally used for interview surveys.  It is based on the premise that your sample will represent the population as the variability in your sample for various quota variables is the same as that in the population.  Quota sampling is a type of stratified sample in which selection of cases within strata is entirely non-random (Barnett, 1991). 46
  • 47.  To select a quota sample you: 1. Divide the population into specific groups. 2. Calculate a quota for each group based on relevant and available data. 3. Give each interviewer an assignment, which states the number of cases in each quota from which they must collect data. 4. Combine the data collected by interviewers to provide the full sample. 47
  • 48. Advantages of Quota Sampling over Random Sampling 1. It is less costly and can be set up very quickly. 2. It does not require a sampling frame and may be the only technique you can use if one is not available.  Quota sampling is normally used for large populations.  For small populations it is usually possible to obtain a sampling frame.  Decisions on sample size are governed by the need to have sufficient responses in each quota to enable subsequent statistical analyses to be undertaken. 48
  • 49.  Calculations of quotas are based on relevant and available data and are usually relative to the proportions in which they occur in the population.  Without sensible and relevant quotas, data collected may be biased.  For many market research projects, quotas are derived from census data.  Your choice of quota is dependent on two main factors: 1. Usefulness as a means of stratifying the data; 2. Ability to overcome likely variations between groups in their availability for interview. 49
  • 50. 2) Purposive / Judgmental sampling  Use your judgment to select cases.  Used when working with very small samples such as in case study research.  To select cases that are particularly informative (Neuman, 2000).  Used by researchers adopting the grounded theory strategy.  Cannot be considered to be statistically representative of the total population. 50
  • 51. 51
  • 52. 1) Extreme Case or Deviant Sampling  It focuses on unusual or special cases  The data collected about these unusual or extreme outcomes will enable you to learn the most and to answer your research question(s) and to meet your objectives most effectively.  Findings from extreme cases will be relevant in understanding or explaining more typical cases (Patton, 2002). 52
  • 53. 2) Heterogeneous or Maximum Variation Sampling  It enables you to collect data to describe and explain the key themes that can be observed.  Any patterns that do emerge are likely to be of particular interest and value and represent the key themes.  The data collected should enable you to document uniqueness.  To ensure maximum variation within a sample Patton (2002) suggests you identify your diverse characteristics (sample selection criteria) prior to selecting your sample. 53
  • 54. 3) Homogeneous or Minimum Variation Sampling  It focuses on one particular subgroup in which all the sample members are similar.  This enables you to study the group in great depth. 54
  • 55. 4) Critical Case Sampling  It selects critical cases on the basis that they can make a point dramatically or because they are important.  The focus of data collection is to understand what is happening in each critical case so that logical generalisations can be made.  Patton (2002) outlines a number of clues that suggest critical cases: 1) If it happens there, will it happen everywhere? 2) If they are having problems, can you be sure that everyone will have problems? 3) If they cannot understand the process, is it likely that no one will be able to understand the process? 55
  • 56. 5) Typical Case Sampling  It is usually used as part of a research project to provide an illustrative profile using a representative case.  Such a sample enables you to provide an illustration of what is ‘typical’ to those who will be reading your research report and may be unfamiliar with the subject matter.  It is not intended to be definitive. 56
  • 57. 3) Snowball sampling  Snowball sampling is commonly used when it is difficult to identify members of the desired population, for example people who are working while claiming unemployment benefit.  Researchers need to: 1) Make contact with one or two cases in the population. 2) Ask these cases to identify further cases. 3) Ask these new cases to identify further new cases (and so on). 4) Stop when either no new cases are given or the sample is as large as is manageable.  The main problem is making initial contact. 57
  • 58.  For such samples the problems of bias are huge, as respondents are most likely to identify other potential respondents who are similar to themselves, resulting in a homogeneous sample (Lee, 1993).  The next problem is to find these new cases. 58
  • 59. 4) Self-selection sampling  It occurs when you allow each case, usually individuals, to identify their desire to take part in the research.  You therefore: 1. Publicize your need for cases, either by advertising through appropriate media or by asking them to take part.  Articles and advertisements in magazines that the population are likely to read, postings on appropriate Internet newsgroups and discussion groups, hyperlinks from other websites as well as letters or emails of invitation to colleagues and friends. 2. Collect data from those who respond. 59
  • 60. 5) Convenience sampling or haphazard sampling  It involves selecting haphazardly those cases that are easiest to obtain for your sample, such as the person interviewed at random in a shopping centre for a television programme.  The sample selection process is continued until your required sample size has been reached.  It is prone to bias and influences that are beyond your control, as the cases appear in the sample only because of the ease of obtaining them. 60