CHAPTER TWO
SAMPLE, SAMPLING METHODS AND
PROPABILITIES CONCEPTS
CONTENTS
1. Sampling
2. Advantages of samples
3. Methods of Sampling
3.1. Non-probability Sampling Methods
3.1.1. Convenience sampling
3.1.2. Quota sampling
3.2. Probability Sampling Methods
3.2.1. Simple random sampling (SRS)
3.2.2. Systematic Sampling
3.2.3. Stratified Sampling
3.2.4. Cluster sampling
4. Errors in sampling
4.1. Sampling error (random error)
4.2. Non-sampling error (bias)
5. Sample Size Calculation
6. What is Probability?
7. Properties of Probability
8. Rules of Probability
10. Calculating Probability
Sampling
 Sampling involves the selection of a number of a study units from
a defined population.
 The population is too large for us to consider collecting
information from all its members.
 Usually, a representative subgroup of the population (sample) is
included in the investigation.
 A representative sample has all the important characteristics of
the population from which it is drawn
Advantages of samples
 cost - sampling saves time, labor and money
 quality of data - more time and effort can be spent on getting
reliable data on each individual included in the sample.
 Due to the use of better trained personnel, more careful supervision
and processing a sample can actually produce precise results.
Common terms used in sampling
 Reference population (also called source population or target
population) –
 the population of interest, to which the investigators would like to
generalize the results of the study, and from which a representative
sample is to be drawn.
 Study or sample population –
 the population included in the sample
 Sampling unit –
 the unit of selection in the sampling process
 Study unit –
 the unit on which information is collected.
 Sampling frame –
 the list of all the units in the reference population, from which a
sample is to be picked.
Sampling methods
1. Non-probability Sampling Methods
2. Probability Sampling Methods
Non-probability Sampling Methods
 Used when a sampling frame does not exist
 No random selection (unrepresentative of the given population)
 Inappropriate if the aim is to measure variables and generalize
findings obtained from a sample to the population.
Non-probability sampling methods are:
 A) Convenience sampling:
 is a method in which for convenience sake the study
units that happen to be available at the time of data
collection are selected
B) Quota sampling:
 is a method that ensures that a certain number of sample units from
different categories with specific characteristics are represented.
 In this method the investigator interviews as many people in each
category of study unit as he can find until he has filled his quota.
 Both the above methods do not claim to be representative of
the entire population.
Probability Sampling methods
 A sampling frame exists.
 Involve random selection procedures. All units of the population
should have an equal or at least a known chance of being included
in the sample.
A) Simple random sampling (SRS)
 This is the most basic scheme of random sampling.
 Each unit in the sampling frame has an equal chance of being
selected
 representativeness of the sample is ensured.
B) Systematic Sampling
 Individuals are chosen at regular intervals ( for example, every kth
) from the sampling frame.
 The first unit to be selected is taken at random from among the
first k units.
 For example, a systematic sample is to be selected from 1200
students of a school. The sample size is decided to be 100.
 The sampling fraction is: 100 /1200 = 1/12. Hence, the sample interval
is 12.
 The number of the first student to be included in the sample is
chosen randomly, for example by blindly picking one out of twelve
pieces of paper, numbered 1 to 12.
 If number 6 is picked, every twelfth student will be included in the
sample, starting with student number 6, until 100 students are
selected. The numbers selected would be 6,18,30,42,etc.
C) Stratified Sampling
 The population is first divided into groups (strata) according to a
characteristic of interest (eg., sex, geographic area, prevalence of
disease, etc.).
 A separate sample is then taken independently from each
stratum, by simple random or systematic sampling.
D) Cluster sampling
 In this sampling scheme, selection of the required sample is
done on groups of study units (clusters) instead of each study
unit individually.
 The sampling unit is a cluster, and the sampling frame is a list
of these clusters.
 procedure
 The reference population (homogeneous) is divided into clusters.
These clusters are often geographic units (eg districts, villages, etc.)
 A sample of such clusters is selected
Errors in sampling
 When we take a sample, our results will not exactly
equal the correct results for the whole population.
 That is, our results will be subject to errors.
Sampling error (random error)
 A sample is a subset of a population.
 Because of this property of samples, results obtained from them
cannot reflect the full range of variation found in the larger group
(population).
 This type of error, arising from the sampling process itself, is
called sampling error which is a form of random error.
 Sampling error can be minimized by increasing the size of the
sample.
 When n = N sampling error = 0
⇒
Non-sampling error (bias)
 It is a type of systematic error in the design or conduct of a
sampling procedure which results in distortion of the sample, so
that it is no longer representative of the reference population.
 We can eliminate or reduce the non-sampling error (bias) by
careful design of the sampling procedure and not by increasing
the sample size.
Sample Size Calculation
 N: stands for the target
population.
 n: will be the sample size.
 While e also stands for the
marginal error that will be
0.05, this means the researcher
is going to work confidence
level of 95%
 N = 70, e = 0.05, n =?
Probability Concepts
What is Probability?
 Probability is the branch of mathematics
concerning numerical descriptions of how
likely an event is to occur, or how likely it is
that a proposition is true.
 The probability of an event is a number
between 0 and 1. where, 0 indicates
impossibility of the event and 1 indicates
certainty.
Properties of Probabilities
 Property 1. Probabilities are always between
0 and 1
 Property 2. A sample space is all possible
outcomes. The probabilities in the sample
space sum to 1 (exactly).
 Property 3. The complement of an event is
“the event not happening”. The probability of
a complement is 1 minus the probability of
the event.
 Property 4. Probabilities of disjoint events
can be added.
Properties of Probabilities
In symbols
 Property 1. 0 Pr(
≤ A) 1
≤
 Property 2. Pr(S) = 1, where S represent the
sample space (all possible outcomes)
 Property 3. Pr(Ā) = 1 – Pr(A), Ā represent the
complement of A (not A)
 Property 4. If A and B are disjoint, then Pr(A or
B) = Pr(A) + Pr(B)
Properties 1 & 2 Illustrated
Property 1. 0 Pr(
≤ A) 1
≤
Note that all individual
probabilities are between 0
and 1.
Property 2. Pr(S) = 1
Note that the sum of all
probabilities = .0039 + .0469
+ .2109 + .4219 + .3164 = 1
“Four patients” pmf
Property 3 Illustrated
Property 3. Pr(Ā) = 1 –
Pr(A),
As an example, let A
represent 4 successes.
Pr(A) = .3164
Let Ā represent the
complement of A.
Pr(Ā) = 1 – Pr(A) = 1 – 0.3164
= 0.6836
“Four patients”
Ā
A
Property 4 Illustrated
Property 4. Pr(A or B) =
Pr(A) + Pr(B) for disjoint
events
Let A represent 4 successes
Let B represent 3 successes
Since A and B are disjoint,
Pr(A or B) = Pr(A) + Pr(B) =
0.3164 + 0.4129 = 0.7293.
The probability of
observing 3 or 4 successes is
0.7293 (about 73%).
“Four patients” pmf
B A
Venn and Tree Diagram
Venn and Tree Diagram
EXERCISE
Calculate the probability of getting an odd number if a dice is rolled.
?
Example
Calculate the probability of getting an odd number if a dice is rolled.
Solution:
Sample space (S) = {1, 2, 3, 4, 5, 6}
n(S) = 6
Let “E” be the event of getting an odd number, E = {1, 3, 5}
n(E) = 3
So, the Probability of getting an odd number is:
P(E) = (Number of outcomes favorable)/(Total number of outcomes)
= n(E)/n(S)
= 3/6
= ½
EXERCISE
There are 5 red balls, 7 green balls and 13 blue balls in the bag.
A. Calculate the probability of getting red ball.
B. Calculate the probability of getting green or blue balls.
?
Complementary events
 If two events are complementary events then to find the probability of
one just subtract the probability of the other from one.
Example
a.Suppose you know that the probability of it raining today is 0.45. What is
the probability of it not raining?
b.Suppose you know the probability of not getting the flu is 0.24. What is
the probability of getting the flu?
c.In an experiment of picking a card from a deck, what is the probability of
not getting a card that is a Queen?
Solution
a. Since not raining is the complement of raining, then
P( not raining )=1−P( raining )=1−0.45=0.55P( not
raining )=1−P( raining )=1−0.45=0.55
b. Since getting the flu is the complement of not getting the flu,
then
P( getting the flu )=1−P( not getting the
flu )=1−0.24=0.76P( getting the flu )=1−P( not getting the
flu )=1−0.24=0.76
EXERCISE
Contains the number of T-shirt of each color that were found in a
shop.
Blue Brown Green Orange Red Yellow
48 37 48 54 32 36
a. Find the probability of choosing a green T-shirt.
b. Find the probability of choosing a blue or yellow T-shirt.
c. Find the probability of not choosing a brown T-shirt.
EXERCISE
What is the likelihood of choosing a day that falls on the weekend
when randomly picking a day of the week?
The end
Thank you so much for listening
Any question !!!!!

Chapter three and - Economics-introduction

  • 1.
    CHAPTER TWO SAMPLE, SAMPLINGMETHODS AND PROPABILITIES CONCEPTS
  • 2.
    CONTENTS 1. Sampling 2. Advantagesof samples 3. Methods of Sampling 3.1. Non-probability Sampling Methods 3.1.1. Convenience sampling 3.1.2. Quota sampling
  • 3.
    3.2. Probability SamplingMethods 3.2.1. Simple random sampling (SRS) 3.2.2. Systematic Sampling 3.2.3. Stratified Sampling 3.2.4. Cluster sampling 4. Errors in sampling 4.1. Sampling error (random error) 4.2. Non-sampling error (bias) 5. Sample Size Calculation
  • 4.
    6. What isProbability? 7. Properties of Probability 8. Rules of Probability 10. Calculating Probability
  • 5.
    Sampling  Sampling involvesthe selection of a number of a study units from a defined population.  The population is too large for us to consider collecting information from all its members.  Usually, a representative subgroup of the population (sample) is included in the investigation.  A representative sample has all the important characteristics of the population from which it is drawn
  • 6.
    Advantages of samples cost - sampling saves time, labor and money  quality of data - more time and effort can be spent on getting reliable data on each individual included in the sample.  Due to the use of better trained personnel, more careful supervision and processing a sample can actually produce precise results.
  • 7.
    Common terms usedin sampling  Reference population (also called source population or target population) –  the population of interest, to which the investigators would like to generalize the results of the study, and from which a representative sample is to be drawn.  Study or sample population –  the population included in the sample
  • 8.
     Sampling unit–  the unit of selection in the sampling process  Study unit –  the unit on which information is collected.  Sampling frame –  the list of all the units in the reference population, from which a sample is to be picked.
  • 9.
    Sampling methods 1. Non-probabilitySampling Methods 2. Probability Sampling Methods
  • 10.
    Non-probability Sampling Methods Used when a sampling frame does not exist  No random selection (unrepresentative of the given population)  Inappropriate if the aim is to measure variables and generalize findings obtained from a sample to the population.
  • 11.
    Non-probability sampling methodsare:  A) Convenience sampling:  is a method in which for convenience sake the study units that happen to be available at the time of data collection are selected
  • 12.
    B) Quota sampling: is a method that ensures that a certain number of sample units from different categories with specific characteristics are represented.  In this method the investigator interviews as many people in each category of study unit as he can find until he has filled his quota.  Both the above methods do not claim to be representative of the entire population.
  • 13.
    Probability Sampling methods A sampling frame exists.  Involve random selection procedures. All units of the population should have an equal or at least a known chance of being included in the sample.
  • 14.
    A) Simple randomsampling (SRS)  This is the most basic scheme of random sampling.  Each unit in the sampling frame has an equal chance of being selected  representativeness of the sample is ensured.
  • 15.
    B) Systematic Sampling Individuals are chosen at regular intervals ( for example, every kth ) from the sampling frame.  The first unit to be selected is taken at random from among the first k units.  For example, a systematic sample is to be selected from 1200 students of a school. The sample size is decided to be 100.  The sampling fraction is: 100 /1200 = 1/12. Hence, the sample interval is 12.
  • 16.
     The numberof the first student to be included in the sample is chosen randomly, for example by blindly picking one out of twelve pieces of paper, numbered 1 to 12.  If number 6 is picked, every twelfth student will be included in the sample, starting with student number 6, until 100 students are selected. The numbers selected would be 6,18,30,42,etc.
  • 17.
    C) Stratified Sampling The population is first divided into groups (strata) according to a characteristic of interest (eg., sex, geographic area, prevalence of disease, etc.).  A separate sample is then taken independently from each stratum, by simple random or systematic sampling.
  • 18.
    D) Cluster sampling In this sampling scheme, selection of the required sample is done on groups of study units (clusters) instead of each study unit individually.  The sampling unit is a cluster, and the sampling frame is a list of these clusters.
  • 19.
     procedure  Thereference population (homogeneous) is divided into clusters. These clusters are often geographic units (eg districts, villages, etc.)  A sample of such clusters is selected
  • 20.
    Errors in sampling When we take a sample, our results will not exactly equal the correct results for the whole population.  That is, our results will be subject to errors.
  • 21.
    Sampling error (randomerror)  A sample is a subset of a population.  Because of this property of samples, results obtained from them cannot reflect the full range of variation found in the larger group (population).  This type of error, arising from the sampling process itself, is called sampling error which is a form of random error.  Sampling error can be minimized by increasing the size of the sample.  When n = N sampling error = 0 ⇒
  • 22.
    Non-sampling error (bias) It is a type of systematic error in the design or conduct of a sampling procedure which results in distortion of the sample, so that it is no longer representative of the reference population.  We can eliminate or reduce the non-sampling error (bias) by careful design of the sampling procedure and not by increasing the sample size.
  • 23.
    Sample Size Calculation N: stands for the target population.  n: will be the sample size.  While e also stands for the marginal error that will be 0.05, this means the researcher is going to work confidence level of 95%  N = 70, e = 0.05, n =?
  • 24.
  • 25.
    What is Probability? Probability is the branch of mathematics concerning numerical descriptions of how likely an event is to occur, or how likely it is that a proposition is true.  The probability of an event is a number between 0 and 1. where, 0 indicates impossibility of the event and 1 indicates certainty.
  • 26.
    Properties of Probabilities Property 1. Probabilities are always between 0 and 1  Property 2. A sample space is all possible outcomes. The probabilities in the sample space sum to 1 (exactly).  Property 3. The complement of an event is “the event not happening”. The probability of a complement is 1 minus the probability of the event.  Property 4. Probabilities of disjoint events can be added.
  • 27.
    Properties of Probabilities Insymbols  Property 1. 0 Pr( ≤ A) 1 ≤  Property 2. Pr(S) = 1, where S represent the sample space (all possible outcomes)  Property 3. Pr(Ā) = 1 – Pr(A), Ā represent the complement of A (not A)  Property 4. If A and B are disjoint, then Pr(A or B) = Pr(A) + Pr(B)
  • 28.
    Properties 1 &2 Illustrated Property 1. 0 Pr( ≤ A) 1 ≤ Note that all individual probabilities are between 0 and 1. Property 2. Pr(S) = 1 Note that the sum of all probabilities = .0039 + .0469 + .2109 + .4219 + .3164 = 1 “Four patients” pmf
  • 29.
    Property 3 Illustrated Property3. Pr(Ā) = 1 – Pr(A), As an example, let A represent 4 successes. Pr(A) = .3164 Let Ā represent the complement of A. Pr(Ā) = 1 – Pr(A) = 1 – 0.3164 = 0.6836 “Four patients” Ā A
  • 30.
    Property 4 Illustrated Property4. Pr(A or B) = Pr(A) + Pr(B) for disjoint events Let A represent 4 successes Let B represent 3 successes Since A and B are disjoint, Pr(A or B) = Pr(A) + Pr(B) = 0.3164 + 0.4129 = 0.7293. The probability of observing 3 or 4 successes is 0.7293 (about 73%). “Four patients” pmf B A
  • 31.
  • 32.
  • 37.
    EXERCISE Calculate the probabilityof getting an odd number if a dice is rolled. ?
  • 38.
    Example Calculate the probabilityof getting an odd number if a dice is rolled. Solution: Sample space (S) = {1, 2, 3, 4, 5, 6} n(S) = 6 Let “E” be the event of getting an odd number, E = {1, 3, 5} n(E) = 3 So, the Probability of getting an odd number is: P(E) = (Number of outcomes favorable)/(Total number of outcomes) = n(E)/n(S) = 3/6 = ½
  • 39.
    EXERCISE There are 5red balls, 7 green balls and 13 blue balls in the bag. A. Calculate the probability of getting red ball. B. Calculate the probability of getting green or blue balls. ?
  • 40.
    Complementary events  Iftwo events are complementary events then to find the probability of one just subtract the probability of the other from one.
  • 41.
    Example a.Suppose you knowthat the probability of it raining today is 0.45. What is the probability of it not raining? b.Suppose you know the probability of not getting the flu is 0.24. What is the probability of getting the flu? c.In an experiment of picking a card from a deck, what is the probability of not getting a card that is a Queen?
  • 42.
    Solution a. Since notraining is the complement of raining, then P( not raining )=1−P( raining )=1−0.45=0.55P( not raining )=1−P( raining )=1−0.45=0.55 b. Since getting the flu is the complement of not getting the flu, then P( getting the flu )=1−P( not getting the flu )=1−0.24=0.76P( getting the flu )=1−P( not getting the flu )=1−0.24=0.76
  • 43.
    EXERCISE Contains the numberof T-shirt of each color that were found in a shop. Blue Brown Green Orange Red Yellow 48 37 48 54 32 36 a. Find the probability of choosing a green T-shirt. b. Find the probability of choosing a blue or yellow T-shirt. c. Find the probability of not choosing a brown T-shirt.
  • 44.
    EXERCISE What is thelikelihood of choosing a day that falls on the weekend when randomly picking a day of the week?
  • 45.
    The end Thank youso much for listening Any question !!!!!