Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service Available
Â
INFERENTIAL STATISTICS
1. KNOWLEDGE FOR THE BENEFIT OF HUMANITYKNOWLEDGE FOR THE BENEFIT OF HUMANITY
BIOSTATISTICS (HFS3283)
INFERENTIAL STATISTICS
Dr.Dr. MohdMohd RazifRazif ShahrilShahril
School of Nutrition & DieteticsSchool of Nutrition & Dietetics
Faculty of Health SciencesFaculty of Health Sciences
UniversitiUniversiti SultanSultan ZainalZainal AbidinAbidin
1
2. S C H O O L O F N U T R I T I O N A N D D I E T E T I C S ⢠U N I V E R S I T I S U L T A N Z A I N A L A B I D I N
Topic Learning Outcomes
At the end of this lecture, students should be able to;
⢠define inferential statistics.
⢠explain hypothesis tests, p value and type I and II error
⢠explain how to interpret confidence interval
2
3. S C H O O L O F N U T R I T I O N A N D D I E T E T I C S ⢠U N I V E R S I T I S U L T A N Z A I N A L A B I D I N
3
4. S C H O O L O F N U T R I T I O N A N D D I E T E T I C S ⢠U N I V E R S I T I S U L T A N Z A I N A L A B I D I N
What is INFERENTIAL STATISTICS?
4
⢠It is the Statistical Technique/ method used to
infer the result of the sample (statistic) to the
population (parameter)
PopulationPopulation SampleSample
5. S C H O O L O F N U T R I T I O N A N D D I E T E T I C S ⢠U N I V E R S I T I S U L T A N Z A I N A L A B I D I N
Two types of inferential statistics
5
⢠Hypothesis tests
â e.g.
⢠Comparing 2 means
⢠Comparing 2 proportions
⢠Association between one variable and another variable
⢠Estimation (Confidence interval)
â e.g.
⢠Estimating a mean (numerical)
⢠Estimating a proportion (categorical)
â˘
1
2
6. S C H O O L O F N U T R I T I O N A N D D I E T E T I C S ⢠U N I V E R S I T I S U L T A N Z A I N A L A B I D I N
HypothesisHypothesis
6
⢠A statement derived from primary research
question(s)
⢠There may be more than one hypothesis to be
tested in one study, but the fewer the better.
⢠It usually comes out of a hunch, an educated
guess based on published results or preliminary
observations
⢠In writing a hypothesis;
â State hypothesis (hypotheses) clearly and specifically
â Include study and outcome factors
â˘
7. S C H O O L O F N U T R I T I O N A N D D I E T E T I C S ⢠U N I V E R S I T I S U L T A N Z A I N A L A B I D I N
Research question??
7
⢠A statement derived from primary research
question(s)
⢠Most studies are concerned with answering the
following questions:
â What is the magnitude of a health problem or health
factor?
â What is the causal relation between one factor (or
factors) and the disease or outcome of interest?
â What is the efficacy of an intervention?
â˘
8. S C H O O L O F N U T R I T I O N A N D D I E T E T I C S ⢠U N I V E R S I T I S U L T A N Z A I N A L A B I D I N
Hypothesis testing
8
⢠In Hypothesis testing, we answer to a specific
question related to a population parameter (e.g.
population mean nutrition knowledge score)
using a sample statistic (e.g. sample mean
nutrition knowledge score).
9. S C H O O L O F N U T R I T I O N A N D D I E T E T I C S ⢠U N I V E R S I T I S U L T A N Z A I N A L A B I D I N
Hypothesis testing
9
⢠In Hypothesis testing, we answer to a specific
question related to a population parameter (e.g.
population mean BMI) using a sample statistic
(e.g. sample mean BMI).
⢠RQ â Is the BMI of the population different
from 24.5 kg/m2 or not?
⢠Answer â Yes or No
â Null hypothesis (Ho ; Âľ = 24.5)
â Alternate hypothesis (Ha ; Âľ â 24.5)
10. S C H O O L O F N U T R I T I O N A N D D I E T E T I C S ⢠U N I V E R S I T I S U L T A N Z A I N A L A B I D I N
Null hypothesis
10
⢠Is a mathematical statement of equality stated
before data collection or data analysis
⢠After statistical analysis, a p value is calculated.
⢠Based on this p value, decision is made to
either accept or reject the Ho.
11. S C H O O L O F N U T R I T I O N A N D D I E T E T I C S ⢠U N I V E R S I T I S U L T A N Z A I N A L A B I D I N
Null hypothesis (cont.)
11
12. S C H O O L O F N U T R I T I O N A N D D I E T E T I C S ⢠U N I V E R S I T I S U L T A N Z A I N A L A B I D I N
Hypothesis testing (cont.)
12
⢠RQ - Is the BMI of the population different
from 24.5 kg/m2 or not?
â Ho ; Âľ = 24.5 mean = 26.1, SD = 4.3
â Ha ; Âľ â 24.5 n = 130
⢠This test is called âOne sample t testâ.
⢠At the end of the hypothesis testing, we will get a
P value.
â If the P value is < 0.05, we reject the Null Hypothesis
(Ho). And conclude as Ha.
â If the P value is ⼠0.05, we cannot reject the Null
Hypothesis (Ho). And conclude as Ho.
13. S C H O O L O F N U T R I T I O N A N D D I E T E T I C S ⢠U N I V E R S I T I S U L T A N Z A I N A L A B I D I N
Hypothesis testing (cont.)
13
⢠RQ - Is the BMI of the population different
from 24.5 kg/m2 or not?
â Ho ; Âľ = 24.5 mean = 26.1, SD = 4.3
â Ha ; Âľ â 24.5 n = 130
⢠In above example, if we get P = 0.01, we reject the
null hypothesis (Ho), then ...
â We conclude as Alternative Hypothesis (Ha) ⌠âthe mean
BMI of the population is different from 24.5 kg/m2â.
â Alternatively, we may report as ⌠âthe mean BMI is
significantly different from 24.5 kg/m2â.
⢠Note:
⢠(1) The second conclusion is more commonly used in the literature.
⢠(2) These are âstatistical conclusionâ, not yet âresearch conclusionâ.
14. S C H O O L O F N U T R I T I O N A N D D I E T E T I C S ⢠U N I V E R S I T I S U L T A N Z A I N A L A B I D I N
Hypothesis testing (cont.)
14
⢠RQ - Is the BMI of the population different
from 24.5 kg/m2 or not?
â Ho ; Âľ = 24.5 mean = 26.1, SD = 4.3
â Ha ; Âľ â 24.5 n = 130
⢠In above example, if we get P = 0.08, we CANNOT
reject the null hypothesis, then âŚ
â We conclude as ⌠âthe mean BMI in the population is NOT
different from 24.5 kg/m2â.
â Alternatively we may report as ... âthe mean BMI is NOT
significantly different from 24.5 kg/m2â.
15. S C H O O L O F N U T R I T I O N A N D D I E T E T I C S ⢠U N I V E R S I T I S U L T A N Z A I N A L A B I D I N
Hypothesis testing (cont.)
15
⢠RQ - Is the BMI of the population different
from 24.5 kg/m2 or not?
â Ho ; Âľ = 24.5 mean = 26.1, SD = 4.3
â Ha ; Âľ â 24.5 n = 130
⢠In this one sample t test, there are assumptions or
requirements that we need to fulfil/check.
â (1) The sample is selected by using random sampling.
â (2) Observations are independent.
â (3) The data is ânormally distributedâ (called âNormality
Assumptionâ).
16. S C H O O L O F N U T R I T I O N A N D D I E T E T I C S ⢠U N I V E R S I T I S U L T A N Z A I N A L A B I D I N
Steps in Hypothesis Testing
16
Step (1):Step (1): Generate null and alternative hypothesis
Step (2):Step (2): Set the significance level (Îą)
Step (3):Step (3): Decide which statistical test to use and check
the assumptions of the test
Step (4):Step (4): Perform test statistic and associated p -value
Step (5):Step (5): Make interpretation (based on p value & CI)
Step (6):Step (6): Draw conclusion
17. S C H O O L O F N U T R I T I O N A N D D I E T E T I C S ⢠U N I V E R S I T I S U L T A N Z A I N A L A B I D I N
P value?
17
⢠The p value is the probability (likehood) that the
result or difference was due to chance.
⢠A p value of 0.05 indicates that a 5% probability
that the difference observed between the groups
was due to chance
18. S C H O O L O F N U T R I T I O N A N D D I E T E T I C S ⢠U N I V E R S I T I S U L T A N Z A I N A L A B I D I N
P value? (cont.)
18
⢠RQ - Is the BMI of the population different
from 24.5 kg/m2 or not?
â Ho ; Âľ = 24.5 mean = 26.1, SD = 4.3
â Ha ; Âľ â 24.5 n = 130
⢠If the P value < 0.05, we reject the Null Hypothesis.
⢠P value is the probability of error if you reject the Null
Hypothesis and conclude as the Alternative Hypothesis.
⢠Example: P value = 0.01. It means that âŚ
â There is 1% probability of error in our conclusion, if we conclude
as Alternative Hypothesis (âsignificantly differentâ).
â We normally, allow less than 5% error. That is why the cut-off
point for P value is 0.05.
19. S C H O O L O F N U T R I T I O N A N D D I E T E T I C S ⢠U N I V E R S I T I S U L T A N Z A I N A L A B I D I N
P value? (cont.)
19
⢠RQ - Is the BMI of the population different
from 24.5 kg/m2 or not?
â Ho ; Âľ = 24.5 mean = 26.1, SD = 4.3
â Ha ; Âľ â 24.5 n = 130
⢠If the P value < 0.05, we reject the Null Hypothesis.
⢠P value is the probability of error if you reject the Null
Hypothesis and conclude as the Alternative Hypothesis.
⢠Example: P value = 0.2. It means that âŚ
â There is 20% probability of error in our conclusion, if we
conclude as Alternative Hypothesis (âsignificantly differentâ).
â Therefore, we canât conclude as it is âsignificantly differentâ.
We have to conclude as âthe difference is not significantâ.
20. S C H O O L O F N U T R I T I O N A N D D I E T E T I C S ⢠U N I V E R S I T I S U L T A N Z A I N A L A B I D I N
P value? (cont.)
20
⢠RQ - Is the BMI of the population different
from 24.5 kg/m2 or not?
â Ho ; Âľ = 24.5 mean = 26.1, SD = 4.3
â Ha ; Âľ â 24.5 n = 130
⢠If the P value < 0.05, we reject the Null Hypothesis.
⢠It means that we have set the cut-off point at P less than
0.05 to reject the Ho.
⢠We say this as setting the âAlphaâ at 0.05
⢠Because the type of error that we have been talking
about, is called âType I errorâ or âAlpha errorâ.
21. S C H O O L O F N U T R I T I O N A N D D I E T E T I C S ⢠U N I V E R S I T I S U L T A N Z A I N A L A B I D I N
One-sided or Two-sided test?
21
⢠RQa - Is the BMI of the population different
from 24.5 kg/m2 or not?
â Ho ; Âľ = 24.5
â Ha ; Âľ â 24.5
⢠RQb - Is the BMI of the population more than
24.5 kgm2 or not?
â Ho ; Âľ ⤠24.5
â Ha ; Âľ > 24.5
⢠RQc - Is the BMI of the population less than
24.5 kgm2 or not?
â Ho ; Âľ ⼠24.5
â Ha ; Âľ < 24.5
TwoTwo--sided hypothesis testsided hypothesis test
OneOne--sided hypothesis testsided hypothesis test
OneOne--sided hypothesis testsided hypothesis test
22. S C H O O L O F N U T R I T I O N A N D D I E T E T I C S ⢠U N I V E R S I T I S U L T A N Z A I N A L A B I D I N
One-sided or Two-sided test? (cont.)
22
23. S C H O O L O F N U T R I T I O N A N D D I E T E T I C S ⢠U N I V E R S I T I S U L T A N Z A I N A L A B I D I N
Type I (Îą) and Type II (É) Error
23
Usually we set;
⢠Type I (ι) error
= 5%
⢠Type II (É) error
= 20%
24. S C H O O L O F N U T R I T I O N A N D D I E T E T I C S ⢠U N I V E R S I T I S U L T A N Z A I N A L A B I D I N
Type I (Îą) and Type II (É) Error (cont.)
24
Power = 1 - É
Usually, power = 1 â 0.2
= 0.8 or 80%
25. S C H O O L O F N U T R I T I O N A N D D I E T E T I C S ⢠U N I V E R S I T I S U L T A N Z A I N A L A B I D I N
Type I (Îą) and Type II (É) Error (cont.)
25
26. S C H O O L O F N U T R I T I O N A N D D I E T E T I C S ⢠U N I V E R S I T I S U L T A N Z A I N A L A B I D I N
Estimation (Confidence Interval)Estimation (Confidence Interval)
26
⢠Used to express degree of confidence in an
estimate
⢠Denotes the range of possible values the
estimate may assume, with a certain degree of
assurance
⢠Gives reliability on an estimate.
⢠E.g.
â If mean systolic BP level is 118 mmHg and 95% CI is
(110, 125); it means that you can be 95% confident
that the true systolic BP level mean lies between
110mmHg and 125mmHg.
27. S C H O O L O F N U T R I T I O N A N D D I E T E T I C S ⢠U N I V E R S I T I S U L T A N Z A I N A L A B I D I N
Estimation a Mean
27
⢠Formula = mean ¹ (t ι/2 x SE)
⢠Calculation example;
â The systolic BP of 100 students in a class
Mean = 123.4 mmHg
SD = 14.0 mmHg
â The 95% CI = Mean Âą t Îą/2 x (SD/ân)
= 123.4 Âą (1.98 x 14/10)
â The 95% CI ranges from 120.6 mmHg to 126.2 mmHg
â We can be 95% sure that this range includes the true
population mean.
â˘
28. S C H O O L O F N U T R I T I O N A N D D I E T E T I C S ⢠U N I V E R S I T I S U L T A N Z A I N A L A B I D I N
Assumption or Requirement
28
Calculation can be made only ...
1) When the sample is selected by using random
sampling
2) When the observations are independent
3) When the data is ânormally distributedâ (called
âNormality Assumptionâ)
29. S C H O O L O F N U T R I T I O N A N D D I E T E T I C S ⢠U N I V E R S I T I S U L T A N Z A I N A L A B I D I N
Estimation a Proportion
29
⢠Formula = p ¹ (Z ι/2 x SE)
⢠Calculation example;
â The abnormal systolic BP of 100 students in a class
Prevalence = 37%
â The 95% CI = p Âą (Z Îą/2 x SE)
= 0.37 Âą (1.96 x â((0.37*0.63)/100)
â The 95% CI ranges from 0.27 to 0.47 (or 27% to
47%)
â We can be 95% sure that this range includes the
true population proportion.
â˘
30. S C H O O L O F N U T R I T I O N A N D D I E T E T I C S ⢠U N I V E R S I T I S U L T A N Z A I N A L A B I D I N
Assumption or Requirement
30
Calculation can be made only ...
1) When the sample is selected by using random
sampling
2) When the observations are independent
3) BOTH (n*p) and (n *(1-p)) must be more than 5
count.
Note:
⢠Recode into 0 and 1;
â 1 should be âdiseaseâ or condition of interest.
â 0 should be ânon-diseaseâ or the rest.
31. S C H O O L O F N U T R I T I O N A N D D I E T E T I C S ⢠U N I V E R S I T I S U L T A N Z A I N A L A B I D I N
P value and Confidence Interval
31
⢠When using statistics to compare 2 groups, 2
approaches can be used;
â Calculation of p value
â Calculation of confidence interval
⢠Both p values and confidence interval are
complimentary and should be used together.
32. S C H O O L O F N U T R I T I O N A N D D I E T E T I C S ⢠U N I V E R S I T I S U L T A N Z A I N A L A B I D I N
Limitation of P value
32
⢠p values are sensitive to sample size
â Large sample size, smaller p value
⢠p values are sensitive to the magnitude of
difference between the two group:
â If the difference is small but samples are large, results
can still be statistically significant
â If the difference is large but samples are small, result
would not be statistically significant