2. From Course Syllabus
• Probability and Probability distributions
Basics and Rules
Binomial distribution
Normal distribution
Application in health research (OR, RR)
2
9/2/2022
3. Probability and Sampling Distributions
Probability Distributions
Basics and Rules
Binomial distribution
Poisson distribution
Normal distribution
Sampling Distribution
• Sample means
• Sample proportions
Application in Health Research (OR, RR)
3
9/2/2022
4. Probability
the ratio between number of possible event (a) to the number of all
possible events (A).
P(E)= a/A
In a community, there are 256 population of working age group( 121
male and 135 female). If 1 person is randomly selected,
The chance that a male will be selected = P(Male) = 121/356
The chance that a female will be selected= P(Female)=135/256
Numerator is a part or subset of denominator
Summation of the probability of all possible events= 1
4
9/2/2022
5. Basic Rules of probability
Probability is the measure of the relative frequency of an occurrence of event.
Let S= set of all possible outcomes=(A1, A2, A3,……..,An}
0≤ 𝑃(𝐴𝑖) ≤ 1
Sum of all 𝑃(𝐴𝑖)=1
𝐴𝑖 & 𝐴𝑗 = any two events , P(𝐴𝑖 or 𝐴𝑗) = P(𝐴𝑖) + P(𝐴𝑗) - P(𝐴𝑖 and 𝐴𝑗)
If 𝐴𝑖 and 𝐴𝑗 = mutually exclusive events
P(𝐴𝑖 and 𝐴𝑗)=0
P(𝐴𝑖 or 𝐴𝑗)= P(𝐴𝑖) + P(𝐴𝑗)
P 𝐴𝑖 𝐴𝑗 =
P(𝐴𝑖 and 𝐴𝑗)
P(𝐴𝑗)
; P(𝐴𝑗) ≠ 0
If 𝐴𝑖 and 𝐴𝑗 = independent events , P(𝐴𝑖 and 𝐴𝑗)=P(𝐴𝑖) * P(𝐴𝑗)
5
9/2/2022
6. Probability deals with the occurrence of a random event.
The four basic rules of probability are :
1. Addition rule of probability : P(A or B)=P(A)+P(B)-P(A and B)
2. Multiplication rule of probability
: P(A and B)=P(A)×P(B|A) or P(B)×P(A|B)
3. Complement rule of probability : P(not A)=1-P(A)
4. Total probability rule : The sum of the probabilities of all possible
outcomes is 1.
6
9/2/2022
7. Example 1:
Tossing an unbiased cion once, outcome={H, T}
P(H)=1/2
P(T)=1/2
H and T never occur together or P(H and T)=0
P(H or T) = P(H)+P(T)-P(H and T)
= (1/2)+(1/2)-0= 1
H and T = Mutually exclusive events
= Events that never occur together.
7
9/2/2022
8. Example 2:
Tossing an unbiased cion twice, outcome={HH,HT, TH, TT}
P(HH=2H=0T)=1/4
P(HT)=1/4=P(TH) >>> P(1T)=P(HT or TH)=P(1H)=1/4+1/4 =1/2
P(TT=2T=0H)=1/4
HH, HT, TH, TT = Mutually exclusive events
= Events that never occur together.
First tossing independent from 2nd tossing
P(HH)=P(H)*P(H), P(HT)=P(H)*P(T), P(TH)=P(T)*P(H), P(TT)=P(T)*P(T)
If A and B are 2 independent events, P(A and B)= P(A)*P(B)
Or P 𝐵 𝐴 = 𝑃(𝐵) or P 𝐴 𝐵 =P(A).
8
9/2/2022
9. Question 1. Find
P(M)= ……… P(F)=………. P(M and D)=……… P(M and 𝐷)= ……
P (𝐷)=………. P(𝐷 )= ……… P(F and D)=……… P(F and 𝐷)= ……
P(D|M)=…… P(D|F)=……
P(F|D)= ……. P(M|D)=…….
P(𝐷|M)= …… P(𝐷|F)=…….
Disease status by sex
Sex
Disease Total
𝐷 𝐷
Male(M) 40 460 500
Female 260 140 400
Total 300 600 900
9
9/2/2022
11. Binomial Distribution
= The distribution of the number of successes of n
independent and identical Bernoulli trials.
Let x = Number of successes =0,1,2,……..,n
= Chance of success of each trial
Pr 𝑋 = 𝑥 =
𝑛
𝑥
𝜋𝑥
(1 − 𝜋)𝑛−𝑥
Bernoulli trial or Binomial trial = a random experiment with exactly two
possible outcomes : success or failure in which the probability of success
is the same every time the experiment is conducted. 11
9/2/2022
12. Let X= # of T P(X) X*P(X) ((X-3)^2)*P(X)
0 (1-)3 0 (0-3)2 *(1-)3
1 3(1-)2 3(1-)2 (1-3)2 * 3(1-)2
2 3(1-)2 6(1-)2 (2-3)2 * 3(1-)2
3 3 33 (3-3)2 *3
Total 1 3 3(1-)
Mean of X Variance of X
12
9/2/2022
13. Pr 𝑋 = 𝑥 =
𝑛!
𝑥!(𝑛−𝑥)!
𝜋𝑥
(1 − 𝜋)𝑛−𝑥
; X = 0,1,2,3,…,n
Mean of X or 𝜇𝑥 = 𝑛𝜋
Variance of X or 𝜎𝑥
2
= 𝑛𝜋(1 − 𝜋)
𝜋 = parameter in Binomial distribution
13
9/2/2022
14. How to obtain Binomial probability in EXCEL?
𝑓𝑥 >> Select for a function or select for a category
select for a category …. Statistical
select a function …. BINOM.DIST
Number_s = Number of success (x)
Trials = Number of trials (n)
Probability_s = Probability of success ()
Cummulative box
True = Cumulative distribution function = Prob 𝑋 ≤ 𝑥
False = Probability mass function = Prob 𝑋 = 𝑥
14
9/2/2022
15. Question 2.
Q1:Suppose it said that 67.8% of BKK people strictly practice on
DMHTT(Distancing, Wearing Mask, Hand washing, Testing (temperature) and
Thai Cha Na)ใ
1. For a family of 10 members, what will be the chance that
a) None of them, b) Half, c) all, d) at least half of them
strictly practice on DMHTT.
2. In a community of size 200 population, what will be the
chance that
a) None of them, b) Half, c) all, d) at least half of them
strictly practice on DMHTT.
For each question, find mean, median, mode and SD of the number of strictly
practice on DMHTT.
15
9/2/2022
16. = Mean of outcome per time interval
Let X= Number of outcome in each time interval = 0, 1, 2, 3, …………….
Pr 𝑋 = 𝑥 =
𝑒−𝑥
𝑥!
Mean of X or 𝜇𝑥 = = Variance of X or 𝜎𝑥
2
= parameter in Poisson distribution
Poisson Distribution
The distribution of the number outcomes in an equal non-overlapping
and independent time interval.
𝑥1 𝑥2 𝑥3 𝑥𝑛
1
0 2 3 n-1 n
16
9/2/2022
17. 𝑓𝑥 >> Select for a function or select for a category
select for a category …. Statistical
select a function …. POISSON.DIST
X = Number of success (x)
Mean = Lamda
Cumulative box
True = Cumulative distribution function = Prob 𝑋 ≤ 𝑥
False = Probability mass function = Prob 𝑋 = 𝑥
How to obtain Binomial probability in EXCEL?
17
9/2/2022
18. QUESTION 3:
A community hospital report on the average of road traffic injured patients
admitted to the emergency department was 5.1 daily.
What is the probability that in a coming day, there will be none, 3,4 and at
least 6 of the patients admitted to this emergency department?
18
9/2/2022
22. 𝑓(𝑋) =
1
2𝜋𝜎2
𝑒
(𝑋−𝜇)2
2𝜎2
; -∞ < 𝑋 < ∞ , −∞ < 𝜇 < ∞, 𝜎2
≠ 0
X ~N(𝜇 , 𝜎2)
1. Bell shape or symmetry with respect to mean
2. Me(di)an = Mode
3. -∞ < 𝑋 < ∞
4. P(X=x)=0 (in theory but practical ????)
22
9/2/2022
23. Standard Normal Distribution
= Normal with Mean=0 and Variance=1
Z~ 𝑁 0,1
𝑓(𝑋) =
1
2𝜋
𝑒
1
2
𝑧2
; -∞ < 𝑍 < ∞
Area of Normal or Std Normal …… From EXCEL
Relationship bet Normal and Std Normal
𝑧 =
𝑋 − 𝜇
𝜎
; 𝜎 ≠ 0
23
9/2/2022
24. EXCEL for Standard Normal Distribution
>> Tool bar>>Function category>> Statistical>>Function name
NORM.S.DIST ….. P(Z<z)
NORM.S.INV ….. Z=?, if area under std normal curve is given
NORM.DIST …. Normal Dist. P(X<x) for known mean and variance
NORM.INV …. X=?, if area under normal curve is given for known
mean and variance
24
9/2/2022
25. Normal approximate to Binomial
If n is large of 𝒏 → ∞, X~ 𝑁(𝑛𝜋, 𝑛𝜋 1 − 𝜋 )
Z =
𝑋−𝑛𝜋
𝑛𝜋(1−𝜋)
~ 𝑁(0,1)
Normal approximate to Poisson
X~ Poisson()
Z =
𝑋−
~ 𝑁(0,1)
Poisson approximate to Binomial
If n is large of 𝒏 → ∞ and 𝜋 → 0 , then 𝑛𝜋 ≈ 𝑛𝜋(1 − 𝜋)
X~ Poisson (𝑛𝜋)
25
9/2/2022
26. Question 4:
We assumed that the systolic blood pressure of adult men is
approximately normal with mean and standard deviation 125 and
12 mm.Hg. respectively.
a) If one adult man is randomly selected, what will be the chance
that his systolic blood pressure will be
a.1) less than 130 mm.Hg.
a.2) between 110 and 127 mm.Hg.
a.3) more than 120 mm.Hg.
b) If 300 adult men are randomly selected, how many of them will
have their blood pressure more than 120 mm.Hg.
26
9/2/2022
27. Question 5:
In a small school of size 150 students, if carries prevalence of student in
this school is the same as the provincial level which is 30%. What is the
chance that you’ll find exactly 10, at most 10 and 10-30 students with
carries free.
Question 6:
1% of workers under insurance was injured yearly. If a company with
1000 workers, what is the chance that not more than 8, 8-15 workers will
be injured in the coming year.
27
9/2/2022
37. Mean of sample means Variance of sample mean
Sampling with replacement 𝜇𝑥 = 𝜇𝑥
𝜎𝑥
2
=
𝜎2
𝑛
Sampling without
replacement
𝜇𝑥 = 𝜇𝑥 𝜎𝑥
2
=
(𝑁−𝑛)
(𝑁−1)
𝜎2
𝑛
Sampling without replacement, for large N and small n,
(𝑁−𝑛)
(𝑁−1)
→ 1, Then 𝜎𝑥
2
=
𝜎2
𝑛
n= sample size and N=size of population
𝜎𝑥 =
𝜎
𝑛
= Standard error of Mean = S.E(𝑥),
for unknown 𝜎2 >>> estimate by 𝑠2
= 𝑖=1
2 (𝑥𝑖−𝑥)2
𝑛−1
, then 𝑠𝑥 =
𝑠
𝑛
an estimate of 𝜎𝑥 .
37
9/2/2022
38. Central Limit Theorem
If we pick up sample of size n from normally distributed population with
known both mean (𝜇 ) and variance(𝜎2
) , the distribution of sample
means (𝑥) is normally distributed with mean(𝜇) and variance (
𝜎2
𝑛
).
𝑥 ~ 𝑁(𝜇,
𝜎2
𝑛
) then z =
𝑥−𝜇
𝜎
𝑛
38
9/2/2022
39. Central Limit Theorem
If we pick up sample of size n from non normal distributed population
with known mean (𝜇 ) and known variance(𝜎2
) , for large n the
distribution of sample means (𝑥) is approximately normally distributed
with mean(𝜇) and variance (
𝜎2
𝑛
).
𝑥 ~ 𝑁(𝜇,
𝜎2
𝑛
) then z =
𝑥−𝜇
𝜎
𝑛
For large sample size and unknown variance,
𝑥 ~ 𝑁(𝜇,
𝑠2
𝑛
) then t =
𝑥−𝜇
𝑠
𝑛
; df.=n-1
39
9/2/2022
40. Sampling distribution of sample proportion
X=0; female and
X=1; male
P(X=1) = P(Male) = 𝜋=
3
5
= 𝜇𝑥
P(X=0)=P(Female)= 1- 𝜋 =
2
5
= 𝜇𝑛−𝑥
Var(X)=
6
25
=
3
5
∗
2
5
= 𝜋(1- 𝜋 )
Population N=5
M1
M2
F3
F4 M5
Sampling with replacement
n=2
Total possible
sample= 25
𝑝 = sample proportion of male
40
9/2/2022
43. Mean of sample
proportions
Variance of sample
proportions
Sampling with
replacement
𝜇𝑝 = 𝜋
𝜎𝑝
2=
𝜋(1 − 𝜋 )
𝑛
Sampling without
replacement
𝜇𝑝 = 𝜋 𝜎𝑝
2 =
(𝑁−𝑛)
(𝑁−1)
𝜋(1−𝜋 )
𝑛
Sampling without replacement, for large N and small n,
(𝑁−𝑛)
(𝑁−1)
→ 1, Then 𝜎𝑝
2=
𝜋(1−𝜋 )
𝑛
n= sample size and N=size of population
𝜎𝑝 =
𝜋(1−𝜋 )
𝑛
= Standard error of proportion= S.E(p)
For unknown 𝜋, 𝑠𝑝 =
𝑝(1−𝑝 )
𝑛
= estimate of 𝜎𝑝.
43
9/2/2022
44. 𝜇𝑝 =
𝑝
25
=
15
25
=
3
5
=𝜋 = population proportion of male
𝜎𝑝
2
=
(𝑝 − 𝜇𝑝)2
25
=
(1 −
3
5
)2+(1 −
3
5
)2+(0 −
3
5
)2+ ⋯ … + (1 −
3
5
)2
25
=
3
25
=
3
5
∗
2
5
/2 = 𝜋(1- 𝜋 )/n
If n→large, 𝑝~ 𝑁(𝜋,
𝜋(1− 𝜋 )
𝑛
) then z =
𝑝−𝜋
𝜋(1−𝜋 )
𝑛
If n→ small, X ~ Binomial(𝜋) >>>> Binomial test
44
9/2/2022
45. Suppose it is known that 90% of pregnant women entering their
3rd trimester had some prenatal care. If a sample of size 200 is
randomly recruited from this population, what will be the chance
that the proportion of having some prenatal care is not more
than 85%.
P(p<0.85)=???; z=
𝑝−𝜋
𝜋(1−𝜋)
𝑛
=
0.85−𝜋
𝜋(1−𝜋)
𝑛
=
0.85−0.90
0.90(1−0.90)
200
=-2.36
P(p<0.85)= P(z<-2,36) =0.0091
45
9/2/2022
46. Point Estimation
Quantitative variable
1. One set of sample
Sample mean (𝑥 )…………. Estimate ……. Population mean (𝜇)
Sample variance (𝑠2) ….. Estimate …… Population variance (2)
Error of estimation=SEM=SD(𝑥 )= 𝜎𝑥 =
𝜎
𝑛
For unknown 2, SD(𝑥 ) …… Estimate …… 𝑠𝑥 =
𝑠
𝑛
46
9/2/2022
47. Quantitative variable
2. Two independent sets of sample (𝑖=1,2)
Sample mean (𝑥𝑖)…………. Estimate ……. Population mean (𝜇𝑖)
Sample variance (𝑠𝑖
2
) ….. Estimate …… Population variance (𝜎𝑖
2
)
Error of estimation=SEM=SD(𝑥𝑖)= 𝜎𝑥𝑖
=
𝜎𝑖
𝑛𝑖
For unknown 2, SD(𝑥𝑖) …… Estimate …… 𝑠𝑥𝑖
=
𝑠𝑖
𝑛𝑖
Difference of sample mean = (𝑥1- 𝑥2) ….. Estimate … (𝜇1- 𝜇2)
Error of estimation …… 𝜎 𝑥1 −𝑥2
= 𝜎𝑥1
2
+ 𝜎𝑥2
2
=
𝜎1
2
𝑛1
+
𝜎2
2
𝑛2
For unknown variances of both group, SD (𝑥1- 𝑥2) …… Estimate ……
Equal variances, SD (𝑥1- 𝑥2) = 𝑠𝑝
2
(
1
𝑛1
+
1
𝑛2
) ; 𝑠𝑝
2 =
(𝑛𝑖−𝑖)𝑠𝑖
2
(𝑛𝑖−1)
=pooled variance
Unequal variances, SD (𝑥1- 𝑥2) =
𝑠1
2
𝑛1
+
𝑠2
2
𝑛2
Point Estimation
47
48. Qualitative variable
1. One set of sample
Sample proportion (p)…. Estimate ….. Population proportion ()
Error of estimation=SD(p) = 𝜎𝑝 =
𝜋(1−𝜋)
𝑛
For unknown , 𝜎𝑝 ….. Estimate …. 𝑠𝑝 =
𝑝(1−𝑝)
𝑛
Point Estimation
48
9/2/2022
49. Qualitative variable
2. Two independent sets of sample (𝑖=1,2)
Sample proportion (𝑝𝑖)…………. Estimate ……. Population proportion (𝑖)
Error of estimation=SD(𝑝𝑖) = 𝜎𝑝𝑖
=
𝜋𝑖 (1−𝜋𝑖)
𝑛𝑖
For unknown 𝜋𝑖, 𝜎𝑝𝑖
….. Estimate …. 𝑠𝑝𝑖
𝑝𝑖(1−𝑝𝑖)
𝑛𝑖
Difference of sample proportion = (𝑝1- 𝑝2) ….. Estimate … (1- 2) …. Risk difference
Known population proportions
Error of estimation …… 𝜎(𝑝1− 𝑝2) =
𝜋1 (1−𝜋1)
𝑛1
+
𝜋2 (1−𝜋2)
𝑛2
Unknown population proportions
Error of estimation …… 𝑠(𝑝1− 𝑝2) =
𝑝1 (1−𝑝1)
𝑛1
+
𝑝2 (1−𝑝2)
𝑛2
Point Estimation
49
9/2/2022
50. Interval Estimation
Pr(𝑧𝛼
2
< 𝑧 < 𝑧(1−
𝛼
2
)) = 1−
From CLT and known pop. variance: 𝑥 N(𝜇𝑥,𝜎𝑥
2
) 𝜇𝑥=𝜇
𝜎𝑥
2
=
𝜎2
𝑛
then Z=
𝑥−𝜇
𝜎
𝑛
N(0,1).
Then Pr(𝑥 + 𝑧𝛼
2
𝜎
𝑛
< 𝜇< 𝑥 + 𝑧(1−
𝛼
2
)
𝜎
𝑛
) = 1−
100(1−)% Confidence Interval of 𝜇 = 𝑥 ± 𝑧(1−
𝛼
2
)
𝜎
𝑛
/2 /2
Z/2 Z(1-/2) Z
Area = 1-
Z
50
9/2/2022
51. For unknown population variance, 𝑠2 ….. Estimate …… 2, then 𝑥 N(𝜇𝑥,𝑠𝑥
2
)
𝜇𝑥=𝜇 , 𝑠𝑥
2
=
𝑠2
𝑛
then t=
𝑥−𝜇
𝑠
𝑛
t with df=(n-1)
Then Pr(𝑥 + 𝑡𝛼
2
𝑠
𝑛
< 𝜇< 𝑥 + 𝑡(1−
𝛼
2
)
𝑠
𝑛
) = 1−
100(1−)% Confidence Interval of 𝜇 = 𝑥 ± 𝑡(1−
𝛼
2
)
𝜎
𝑛
100(1−)% CI of True Mean =
𝑥 ± 𝑧(1−
𝛼
2
)
𝜎
𝑛
,
𝑥 ± 𝑡(1−
𝛼
2
)
𝑠
𝑛
t with df=(n-1)
Interval Estimation
51
9/2/2022
52. From the same concept, we can obtain 100(1−)% CI of (𝜇1- 𝜇2) as the
following
1. Known pop. Var.
100(1−)% CI of (𝜇1- 𝜇2) = (𝑥1- 𝑥2) ± 𝑧(1−
𝛼
2
)
𝜎1
2
𝑛1
+
𝜎2
2
𝑛2
2. Unknown pop. Var.
2.1 Equal pop. var. …… estimate by 𝑠𝑝
2 =
(𝑛𝑖−𝑖)𝑠𝑖
2
(𝑛𝑖−1)
=pooled variance
100(1−)% CI of (𝜇1- 𝜇2)
= (𝑥1- 𝑥2) ± 𝑡(1−
𝛼
2
)
𝑠𝑝
2
(
1
𝑛1
+
1
𝑛2
) ; t with df=(𝑛1 + 𝑛2 − 2)
Interval Estimation (Diff bet 2 independent pop. Means)
52
9/2/2022
53. 2.2 Unequal var. , each sample var …. Estimate… pop. Var
100(1−)% CI of (𝜇1- 𝜇2)
= (𝑥1- 𝑥2) ± 𝑡(1−
𝛼
2
)
𝑠1
2
𝑛1
+
𝑠2
2
𝑛2
; df =
𝑠1
2
𝑛1
+
𝑠2
2
𝑛2
2
𝑠1
2
𝑛1
2
𝑛1−1
+
𝑠2
2
𝑛2
2
𝑛2−1
Interval Estimation (Diff bet 2 independent pop. Means)
53
9/2/2022
54. Pr(𝑧𝛼
2
< 𝑧 < 𝑧(1−
𝛼
2
)) = 1−
From CLT and for large n: p N(𝜇𝑝,𝜎𝑝
2
) where 𝜇𝑝= and 𝜎𝑝 =
𝜋(1−𝜋)
𝑛
, then
Z=
𝑝−𝜋
𝜋(1−𝜋)
𝑛
N(0,1).
Pr(𝑝 + 𝑧𝛼
2
𝜋(1−𝜋)
𝑛
<< 𝑝 + 𝑧(1−
𝛼
2
)
𝜋(1−𝜋)
𝑛
) = 1−, estimate again by p
100(1−)% Confidence Interval of = 𝑝 ± 𝑧(1−
𝛼
2
)
𝑝(1−𝑝)
𝑛
Interval Estimation
/2 /2
Z/2 Z(1-/2) Z
Area = 1-
Z
54
9/2/2022
55. Interval Estimation of diff. bet 2 independent proportions
From CLT and for large sample size 𝑛𝑖(i=1,2)
𝑝𝑖 N(𝜇𝑝𝑖
,𝜎𝑝𝑖
2
) where 𝜇𝑝𝑖
=𝜋𝑖 and 𝜎𝑝𝑖
=
𝜋𝑖 (1−𝜋𝑖)
𝑛𝑖
,
Then (𝑝1- 𝑝2) N 𝜇(𝑝1−𝑝2), 𝜎(𝑝1−𝑝2)
2
,
where 𝜇(𝑝1−𝑝2) = (1- 2) and 𝜎(𝑝1−𝑝2)
2
=
𝜋1 (1−𝜋1)
𝑛1
+
𝜋2 (1−𝜋2)
𝑛2
.
Since we do not know those 2 pop. proportions, they both will be estimated
by sample proportion of each group.
100(1−)% CI of(1- 2) = (𝑝1- 𝑝2) ± 𝑧(1−
𝛼
2
)
𝑝1 (1−𝑝1)
𝑛1
+
𝑝2 (1−𝑝2)
𝑛2
55
9/2/2022
56. RR: Relative Risk
When we wish to compare the probabilities of a specified
event in two different groups, the concept of relative risk is
often useful.
Applicationin Health Research (RR, OR)
56
9/2/2022
57. Relative Risk: RR
Cohort or Follow-up or Experimental study
Group
Outcome
Total
Cure Not cure
A A B N1
B C D N2
Total A+C B+D N
Population
Sample
Sex(EXPOSED)
Outcome
Total
Cure Not cure
A a b n1
B c d n2
Total a+c b+d n
STARTING POINT
STARTING POINT
57
9/2/2022
58. True Cure rate of A = A/N1 =𝑃𝐴 and
True Cure rate of B= C/N2 =𝑃𝐵
TRUE RR=
𝑃𝐴
𝑃𝐵
Estimate RR or 𝑅𝑅=
𝑝𝐴
𝑝𝐵
=
𝑎/𝑛1
𝑐/𝑛2
SE (ln 𝑅𝑅)=
1
𝑎
+
1
𝑐
−
1
𝑛1
−
1
𝑛2
1/2
95% confidence interval of RR
= exp( ln 𝑅𝑅 ± 1.96×SE(ln𝑅𝑅) )
58
9/2/2022
59. TRUE Risk Difference:
RD= Cure rate of A – Cure Rate of B = 𝑃𝐴- 𝑃𝐵
Estimate Risk Difference:
𝑅𝐷 = Sample Cure rate of A – sample Rate of B
= 𝑝𝐴- 𝑝𝐵
SE(𝑅𝐷)= SE(𝑝𝐴- 𝑝𝐵) =
𝑝𝐴(1−𝑝𝐴)
𝑛1
+
𝑝𝐵(1−𝑝𝐵)
𝑛2
59
9/2/2022
60. OR: Odds Ratio
Another measure that is often used to compare the
probabilities of an event in two different groups. Unlike the
relative risk, which compares the probabilities directly,
however, the odds ratio (as its name would suggest) relates
the odds of the event in the two populations.
60
9/2/2022
61. From Design … to analysis
Cross-sectional Study : Strictly practice on DMHTT
Sex
Strictly practice on DMHTT
Total
Yes No
Male A B N1
Female C D N2
Total A+C B+D N
Population
Sample
Sex
Strictly practice on DMHTT
Total
Yes No
Male a b n1
Female c d n2
Total a+c b+d n
61
9/2/2022
62. Rational from Population to sample
• Probability of a Yes outcome in Male = A/N1
• Probability of a No outcome in Male = B/N1
Odd of outcome in Male = Probability of a Yes outcome in Male
Probability of a No outcome in Male
= A/B
Odd of outcome in Female = Probability of a Yes outcome in Female
Probability of a No outcome in Female
= C/D
True OR: OR =
Probability of a Yes outcome in Male
Probability of a No outcome in Male
Probability of a Yes outcome in Female
Probability of a No outcome in Female
=
𝐴𝐷
𝐵𝐶
Estimated OR: 𝑂𝑅 =
𝑎𝑑
𝑏𝑐
62
9/2/2022
64. Probability of outcome and odds ratio
Outcome Total
Yes No
Group A 20 60 80
Group B 10 90 100
Total 30 150 180
• Probability of a Yes outcome in group A = 20/80 =25%
• Probability of a No outcome in group A = 60/80 = 75%
• Odd of outcome in group A = Probability of a Yes outcome in group A
Probability of a No outcome in group A
= 0.25/0.75 =0.33
• Odd of outcome in group B = Probability of a Yes outcome in group B
Probability of a No outcome in group B
= 0.10/0.90 =0.11
64
9/2/2022
65. Probability of outcome and odds ratio (2)
Outcome Total
Yes No
Group A 20 60 80
Group B 10 90 100
Total 30 150 180
• Odds ratio of group A to group B =
odds of outcome in group A
odds of outcome in group B
= 0.33/0.11 =3.00
65
9/2/2022
66. Applicationin Health Research (OR)
Case-Control Study : Strictly practice on DMHTT
Sex(EXPOSED)
Strictly practice on DMHTT
Total
Yes No
Young A B A+B
OLD C D C+D
Total N1 N2 N
Population
Sample
Sex(EXPOSED)
Strictly practice on DMHTT
Total
Yes No
Young a b a+b
Old c d c+d
Total n1 n2 n
STARTING POINT
STARTING POINT
66
9/2/2022
67. • Probability of a Yes outcome in Male = A/N1
• Probability of a No outcome in Male = C/N1
Odd of Exposed among CASE = Probability of EXPOSED among CASE
Probability of Non−Exposed among CASE
= A/C
Odd of Exposed among CONTROL=
Probability of EXPOSED among CONTROL
Probability of Non−Exposed among CONTROL
= B/D
True OR: OR =
Probability of EXPOSED among CASE
Probability of Non−Exposed among CASE
Probability of EXPOSED among CONTROL
Probability of Non−Exposed among CONTROL
=
𝐴𝐷
𝐵𝐶
Estimated OR: 𝑂𝑅 =
𝑎𝑑
𝑏𝑐
67
9/2/2022
69. Relationships among the probabilities of an event in an exposed and unexposed group. Find the
odds ratio, the relative risk and interpret.
p(exp) p(unexp) OR RR
0.01 0.012
0.05 0.059
0.25 0.286
0.50 0.545
0.01 0.014
0.05 0.069
0.25 0.318
0.50 0.583
0.01 0.020
0.05 0.095
0.25 0.400
0.50 0.667