At the end of this lecture, the students should be able to
1.Understand structure of research study appropriate for ANOVA test
2.Understand how to evaluate the assumptions underlying this test
3. interpret SPSS outputs and report the results
a full lecture presentation on ANOVA .
areas covered include;
a. definition and purpose of anova
b. one-way anova
c. factorial anova
d. mutiple anova
e MANOVA
f. POST-HOC TESTS - types
f. easy step by step process of calculating post hoc test.
a full lecture presentation on ANOVA .
areas covered include;
a. definition and purpose of anova
b. one-way anova
c. factorial anova
d. mutiple anova
e MANOVA
f. POST-HOC TESTS - types
f. easy step by step process of calculating post hoc test.
This presentation explains the concept of ANOVA, ANCOVA, MANOVA and MANCOVA. This presentation also deals about the procedure to do the ANOVA, ANCOVA and MANOVA with the use of SPSS.
Comparing the unpaired t test, small sample formula with the one with large sample formula and the Welch's test. The latter is an unpaired t test for samples of unequal sizes with unequal variances.
This presentation explains the concept of ANOVA, ANCOVA, MANOVA and MANCOVA. This presentation also deals about the procedure to do the ANOVA, ANCOVA and MANOVA with the use of SPSS.
Comparing the unpaired t test, small sample formula with the one with large sample formula and the Welch's test. The latter is an unpaired t test for samples of unequal sizes with unequal variances.
At the end of this lecture, the student should be able to:
1. understand structure of research study appropriate for independent-measures t hypothesis test
2. test between two populations or two treatments using independent measures t statistics
3. understand how to evaluate the assumptions underlying this test
Hypothesis is usually considered as the principal instrument in research and quality control. Its main function is to suggest new experiments and observations. In fact, many experiments are carried out with the deliberate object of testing hypothesis. Decision makers often face situations wherein they are interested in testing hypothesis on the basis of available information and then take decisions on the basis of such testing. In Six –Sigma methodology, hypothesis testing is a tool of substance and used in analysis phase of the six sigma project so that improvement can be done in right direction
Multiple Linear Regression II and ANOVA IJames Neill
Explains advanced use of multiple linear regression, including residuals, interactions and analysis of change, then introduces the principles of ANOVA starting with explanation of t-tests.
About CORE:
The Culture of Research and Education (C.O.R.E.) webinar series is spearheaded by Dr. Bernice B. Rumala, CORE Chair & Program Director of the Ph.D. in Health Sciences program in collaboration with leaders and faculty across all academic programs.
This innovative and wide-ranging series is designed to provide continuing education, skills-building techniques, and tools for academic and professional development. These sessions will provide a unique chance to build your professional development toolkit through presentations, discussions, and workshops with Trident’s world-class faculty.
For further information about CORE or to present, you may contact Dr. Bernice B. Rumala at Bernice.rumala@trident.edu
Estimating sample size through simulationsArthur8898
Determining sample size is one critical and important procedure for designing an experiment. The sample size for most statistical models can be easily calculated by using the POWER procedure. However, the PROC POWER cannot be used for a complicated statistical model. This paper reviews a more generalized method to estimate the sample size through a simulation approach by using SAS® software. The simulation approach not only applies to the simple but also to a more complex statistical design.
Data.savQuestion.docxOver the same period, determine wheth.docxtheodorelove43763
Data.sav
Question.docx
Over the same period, determine whether there is any difference in the weight change trajectory for babies who meet their nutritional goals versus babies who do not meet their nutritional goals (all goals).
1.
Now that we've answered the question about whether there were weight changes for all subjects (and then changes during the two periods), we want to know whether there were any differences between the two comparison groups (all nutrition goals met versus not met) for the same periods (overall, birth to 28 days, and 28 days to discharge).
Formulate three null hypotheses to reflect these new questions
2.
Now let's look more closely at the separate periods.
As it turns out, the weight change trajectories were not significantly different between the two groups of babies in all periods.
During which period do we see significant differences in weight change trajectories between the two nutritional groups?
What is the F statistic for the difference in weight change trajectories of the two groups during the period in which the trajectories were significantly different? Answer What is the p value? Answer
What is the change in the means of the two groups during this period:
· for the babies who met all their nutritional goals? Answer grams
· for the babies who did NOT meet all their nutritional goals? Answer grams
3.
Report and interpret the findings with respect to the difference in weight changes for the two groups for the three hypotheses above
Though we've determined that there is a significant difference between nutrition groups in terms of weight change, we notice that the two groups are different in terms of the length of stay. So, we wonder whether our previous findings might be altered if we take NICU length of stay (LOS) into account.
Now, we take the same model we built in the above questions, add length of stay as a covariate.
4. After controlling for length of stay, we see that the difference in the overall weight change trajectories between the two groups has changed.
What is the F statistic for the overall weight change trajectory? Answer
What is the p value? Answer
Which of the following is a reasonable conclusion regarding our hypothesis that there is a difference in weight change trajectories of the two nutritional groups after controlling for length of stay?
We fail to reject the null and conclude that there is no difference in weight change trajectories of babies who did and who did not meet their nutritional goals after controlling for length of stay
We reject the null and conclude that there is no difference in weight change trajectories of babies who did and who did not meet their nutritional goals after controlling for length of stay
We reject the null and conclude that even after controlling for length of stay, babies who met their nutritional goals still had significantly different weight change trajectories than babies who did not meet their nutritional goals
5. We would like so.
univariate and bivariate analysis in spss Subodh Khanal
this slide will help to perform various tests in spss targeting univariate and bivariate analysis along with the way of entering and analyzing multiple responses.
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Subhajit Sahu
Abstract — Levelwise PageRank is an alternative method of PageRank computation which decomposes the input graph into a directed acyclic block-graph of strongly connected components, and processes them in topological order, one level at a time. This enables calculation for ranks in a distributed fashion without per-iteration communication, unlike the standard method where all vertices are processed in each iteration. It however comes with a precondition of the absence of dead ends in the input graph. Here, the native non-distributed performance of Levelwise PageRank was compared against Monolithic PageRank on a CPU as well as a GPU. To ensure a fair comparison, Monolithic PageRank was also performed on a graph where vertices were split by components. Results indicate that Levelwise PageRank is about as fast as Monolithic PageRank on the CPU, but quite a bit slower on the GPU. Slowdown on the GPU is likely caused by a large submission of small workloads, and expected to be non-issue when the computation is performed on massive graphs.
Explore our comprehensive data analysis project presentation on predicting product ad campaign performance. Learn how data-driven insights can optimize your marketing strategies and enhance campaign effectiveness. Perfect for professionals and students looking to understand the power of data analysis in advertising. for more details visit: https://bostoninstituteofanalytics.org/data-science-and-artificial-intelligence/
Show drafts
volume_up
Empowering the Data Analytics Ecosystem: A Laser Focus on Value
The data analytics ecosystem thrives when every component functions at its peak, unlocking the true potential of data. Here's a laser focus on key areas for an empowered ecosystem:
1. Democratize Access, Not Data:
Granular Access Controls: Provide users with self-service tools tailored to their specific needs, preventing data overload and misuse.
Data Catalogs: Implement robust data catalogs for easy discovery and understanding of available data sources.
2. Foster Collaboration with Clear Roles:
Data Mesh Architecture: Break down data silos by creating a distributed data ownership model with clear ownership and responsibilities.
Collaborative Workspaces: Utilize interactive platforms where data scientists, analysts, and domain experts can work seamlessly together.
3. Leverage Advanced Analytics Strategically:
AI-powered Automation: Automate repetitive tasks like data cleaning and feature engineering, freeing up data talent for higher-level analysis.
Right-Tool Selection: Strategically choose the most effective advanced analytics techniques (e.g., AI, ML) based on specific business problems.
4. Prioritize Data Quality with Automation:
Automated Data Validation: Implement automated data quality checks to identify and rectify errors at the source, minimizing downstream issues.
Data Lineage Tracking: Track the flow of data throughout the ecosystem, ensuring transparency and facilitating root cause analysis for errors.
5. Cultivate a Data-Driven Mindset:
Metrics-Driven Performance Management: Align KPIs and performance metrics with data-driven insights to ensure actionable decision making.
Data Storytelling Workshops: Equip stakeholders with the skills to translate complex data findings into compelling narratives that drive action.
Benefits of a Precise Ecosystem:
Sharpened Focus: Precise access and clear roles ensure everyone works with the most relevant data, maximizing efficiency.
Actionable Insights: Strategic analytics and automated quality checks lead to more reliable and actionable data insights.
Continuous Improvement: Data-driven performance management fosters a culture of learning and continuous improvement.
Sustainable Growth: Empowered by data, organizations can make informed decisions to drive sustainable growth and innovation.
By focusing on these precise actions, organizations can create an empowered data analytics ecosystem that delivers real value by driving data-driven decisions and maximizing the return on their data investment.
1. HFS3283
PAIRED T-TEST
&
ONE WAY-ANOVA
DR. SHARIFAH WAJIHAH WAFA BTE SST WAFA
School of Nutrition and Dietetics
Faculty of Health Sciences
sharifahwajihah@unisza.edu.my
KNOWLEDGE FOR THE BENEFIT OF HUMANITY
2. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Topic Learning Outcomes
At the end of this lecture, the student should be
able to:
1
• Understand structure of research study appropriate
for ANOVA test
2
• Understand how to evaluate the assumptions
underlying this test
3 • interpret SPSS outputs and report the results
3. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Overview
t-tests
1. One-sample t-test
2. Independent samples t-test
3. Paired samples t-test
ANOVAs
1. 1-way ANOVA
2. 1-way repeated measures ANOVA
3. Factorial ANOVA
4. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Why a t-test or ANOVA?
•A t-test or ANOVA is used to determine
whether a sample of scores are from the same
population as another sample of scores.
•These are inferential tools for examining
differences between group means.
• Is the difference between two sample means
‘real’ or due to chance?
5. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
t-tests
•One-sample
One group of participants, compared with fixed,
pre-existing value (e.g., population norms)
•Independent
Compares mean scores on the same variable
across different populations (groups)
•Paired
Same participants, with repeated measures
6. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Major assumptions
•Normally distributed variables
•Homogeneity of variance
In general, t-tests and ANOVAs are robust to
violation of assumptions, particularly with
large cell sizes, but don't be complacent.
7. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Use of t in t-tests
•t reflects the ratio of between group variance
to within group variance
•Is the t large enough that it is unlikely that the
two samples have come from the same
population?
•Decision: Is t larger than the critical value for t?
(see t tables – depends on critical and N)
8. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
One-tail vs. two-tail tests
• Two-tailed test rejects null hypothesis if
obtained t-value is extreme is either direction
• One-tailed test rejects null hypothesis if
obtained t-value is extreme is one direction
(you choose – too high or too low)
• One-tailed tests are twice as powerful as two-
tailed, but they are only focused on identifying
differences in one direction.
9. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
One sample t-test
• Compare one group (a sample) with a fixed, pre-
existing value (e.g., population norms)
• Do uni students sleep less than
the recommended amount?
e.g., Given a sample of N = 190 uni
students who sleep M = 7.5 hrs/day (SD
= 1.5), does this differ significantly from 8
hours hrs/day ( = .05)?
10. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
One sample t-test
11. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Independent groups t-test
• Compares mean scores on the same variable
across different populations (groups)
• Do males & females differ in the
amount of sleep they get?
12. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Assumptions
(Indep. samples t-test)
•LOM
– IV is ordinal / categorical
– DV is interval / ratio
•Homogeneity of Variance: If variances unequal (Levene’s
test), adjustment made
•Normality: t-tests robust to modest departures from
normality, otherwise consider use of Mann-Whitney U test
•Independence of observations (one participant’s score is
not dependent on any other participant’s score)
13. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Do males and females differ in in
amount of sleep per night?
14. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Independent samples t-test
•Comparison b/w means of 2 independent
sample variables = t-test
(e.g., what is the difference in Educational Satisfaction
between male and female students?)
•Comparison b/w means of 3+
independent sample variables = 1-way
ANOVA
(e.g., what is the difference in Educational Satisfaction
between students enrolled in four different faculties?)
16. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Paired samples t-test
1-way repeated measures ANOVA
• Same participants, with repeated measures
• Data is sampled within subjects. Measures are
repeated e.g.,:
–Time e.g., pre- vs. post-intervention
–Measures e.g., approval ratings of
brand X and brand Y
• The paired t-test will show whether the differences
observed in the 2 measures will be found reliably in
repeated samples.
17. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Assumptions
(Paired samples t-test)
• LOM:
– IV: Two measures from same participants (w/in
subjects)
• a variable measured on two occasions or
• two different variables measured on the same occasion
– DV: Continuous (Interval or ratio)
• Normal distribution of difference scores (robust to
violation with larger samples)
• Independence of observations (one participant’s score is
not dependent on another’s score)
18. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example
• In this example, we want to compare the weight
changes amongst obese children after 6 weeks went
for weight management program.
• Five obese children are selected at random from the
school A.
• We are interested in the following research question:
Does an intervention have an effect on body weight of
the obese children?
• The average weight, in both pre and post treatment
is recorded in columns 1 and 2 (see next slide) for
each of 5 people (P1-P5):
20. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 4.1 (cont.)
• Unlike the independent samples t-test, on each row
the numbers in columns 2 and 3 come from the
same people.
• Person 2, for example, weigh of 64.5 kg in the pre-
treatment, but lost to an average of 62 kg in the
post-treatment (after all treatment was completed).
• It appears that this treatment may have improved
body weight.
21. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 4.1 (cont.)
• The paired t-test will allow us to see if the
improvement that we see in this sample is reliable.
• If we selected another 5 obese children
at random from the weight management program,
would we still see an improvement?
• Without having to go through the trouble and
expense of repeated sampling (called replication), we
can estimate whether the difference in the 2 means
is so large in magnitude that we would likely find the
same result if we chose another 5 persons.
22. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 4.1 (cont.)
1
2 212121
22
21
n
SSrSS
t
xxxxxx
, df = n-1
23. Example 4.1 (cont.)
• This paired “t” needs a couple more values
that we have not yet computed.
• First, we need to find the Standard Deviation
of X1 and X2, called Sx1 and Sx2.
• These are simply the square-root of the
variances
( and ).6325.04.02
1
xS 3784.19.12
2
xS
24. Example 4.1 (cont.)
• Second, we need to find the correlation
between the pre and post-treatment ( ),
or likewise columns 2 and 3.
• Another section will illustrate how to compute
a correlation.
• This computation is somewhat long, so we’ll
avoid it for now.
• I’ll just tell you the correlation is:
rx1x2=0.9177.
• Any scientific or statistical calculator can get
you this answer.
21xxr
25. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 4.1 (cont.)
15
)3784.1)(6325)(.9177(.29.14.
5254
t 4.78, df = 4
26. Example 4.1 (cont.)
• Finally, this computed “t” statistic must be compared
with the critical value of the t-distribution.
• The critical value of the “t” is the highest magnitude
we should expect to find if there is really no
difference between the population means of X1 and
X2, or in other words, no difference between weight
in the pre and post treatment in the weight
management program.
• Since we expect there should be a weight loss, this is
a 1-tailed test.
SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
27. Example 4.1 (cont.)
• The C.V. t(4), α=.05 = 2.132, therefore we reject the null
hypothesis because the absolute value of our “t” at
4.78 is greater than the critical value.
• This is a 1-tailed t-test, so we must verify this
conclusion by noting that the mean of the post
treatment at 52kg, is lower than than the mean of
the pre-treatment average of 54 kg.
SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
28. Example 4.1 (cont.)
Our research conclusion states the facts in simple
terms:
mean weight was decreased significantly from the
pre-treatment
(M = 54) to the post-treatment (M = 52),
t(4) = 4.78, p < .05 (one-tailed).
SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
29. Example 4.1 Using SPSS
• First, we must setup the variables in SPSS.
• Although not strictly necessary, it is good practice to
give a unique code to each participant (“personid”).
• Unlike the independent samples t-test, the paired t-
test has separate entries for 2 dependent variables,
rather than an independent and dependent:
– DependentVariable1 = pretreat
(for Pre-treatment)
– DependentVariable2 = posttreat
(for Post-treatment)
SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
30. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 4.1 Using SPSS
• In our example, the variables are setup as
follows in the SPSS variable view:
31. Example 4.1 Using SPSS
• To run a Paired Samples t Test in SPSS, click
Analyze > Compare Means > Paired-Samples
T Test.
SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
32. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 4.1 Using SPSS
• You will be presented with the Paired-Samples T Test dialogue
box, as shown below.
• transfer the variables pretreat and posttreat into the Paired
Variables: box.
33. Example 4.1 Using SPSS
• Paired Sample Statistics Table
– The first table, titled Paired Samples Statistics, is where SPSS Statistics
has generated descriptive statistics for the variables. You could use the
results here to describe the characteristics of the pre- and post
treatment.
SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
34. Example 4.1 Using SPSS
• Paired Samples Test Table
– The Paired Samples Test table is where the results of the dependent t-
test are presented.
• You are essentially conducting a one-sample t-test on the differences between
the groups.
SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
35. Example 4.1 Using SPSS
• You should focus your attention first of the mean
values for the pre and the post treatment.
• As before, the means (Pre-treatment=54 and Post-
treatment=52) give us our conclusion.
• Namely, we conclude that weight decreased from
the pre to the post season.
• The statistics tell us that our conclusion is true not
only for this sample of 5 persons, but also for other
samples of 5 persons in the weight management
program.
SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
36. Example 4.1 Conclusion
• Our test is 1-tailed, so we must divide the
2-tailed probability provided by SPSS in half
(p=.009/2 = .0045).
• When expressed to 2 significant digits, this value will
round to “.00” and as a result the lowest value that
can be represented in APA style is “p<.01.”
• In short, we can now write our conclusion as follows:
Weight of obese children decreased
significantly from the pre-treatment
(M = 54) to the post-treatment (M = 52),
t(4) = 4.78, p < .01 (one-tailed).
SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
37. t-tests
• Difference between a set value and a variable →
one-sample t-test
• Difference between two independent groups →
independent samples t-test
= BETWEEN-SUBJECTS
• Difference between two related measures (e.g.,
repeated over time or two related measures at one
time) → paired samples t-test
= WITHIN-SUBJECTS
SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
39. Introduction to ANOVA
(Analysis of Variance)
• Extension of a t-test to assess differences in the
central tendency (M) of several groups or variables.
• DV variance is partitioned into between-group and
within-group variance
• Levels of measurement:
• Single DV: metric,
• 1 or more IVs: categorical
SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
40. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Introduction
• ANOVA is an acronym for ANalysis Of VAriance.
• The adjective oneway means that there is a single
variable that defines group membership (called a
factor).
• Comparisons of means using more than one variable
is possible with other kinds of ANOVA analysis.
41. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
When to use a One-way ANOVA
• One-way ANOVA is a generalization of the
independent samples t-test.
• Recall that the independent samples t-test is
used to compare the mean values of 2
different groups.
• A One-way ANOVA does the same thing, but it
has the advantage of allowing comparisons
between more than 2 groups.
42. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
When to use a Oneway ANOVA
(continued)
• In health, for example, we often want to contrast
several conditions in an experiment; such as a
control, a standard treatment, and a newer
“experimental” treatment.
• Because Oneway ANOVA is simply a
generalization of the independent samples t-test,
we use this procedure (to follow) to recalculate
our previous 2 groups example.
• Later, we will do an example with more than 2
groups.
43. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example
• Let’s return to our example
of the nasi lemak (NL) vs. roti canai (RC) diet BUT
now we add up another one which in nasi dagang
diet (ND)
• Our research question is:
“Is there any weight
gain difference between
a 1-week exclusive diet
of either NL, RC or ND?”
44. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example -con’t
Column 3 Column 4 Column 5
X1 : NL X2 : RC X3 : ND
1 3 3 1 1 0
2 4 2 0 0 1
2 4 3 0 0 0
2 4 3 0 0 0
3 5 4 1 1 1
2 4 3 2 2 2
0.4 0.4 0.4
2
11 )(
2
22 )(
1
n
sx
2
2
)(
2 3
2
33 )(
H0: μ1= μ2= μ3
Ha: At least one pair is different.
45. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Comparing the groups
• Averages within groups:
– NL: 2
– RC: 4
– ND: 3
• Total average:
• Variance around the mean matters for comparison.
• We must compare the variance within the groups to
the variance between the group means.
3
555
)3(5)4(5)2(5
46. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Variance within and between
groups
• Sum of squares within groups:
– SSW =
= (1-2)2+(2-2) 2+(2-2) 2+(2-2) 2+……+(5-4) 2+……
= 6
• Compare it with sum of squares between groups:
– SSB =
– = (2-3) 2 +(2-3) 2+(2-3) 2+……+(4-3) 2+……
= 5 (2-3) 2 + 5 (4-3) 2 + (3-3) 2 = 10
– Comparing these, we also need to take into account the
number of observations and sizes of groups
2)( jj
])(X[ 2
Tj
47. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Adjusting for group sizes
• Divide by the number of degrees of freedom
• F:
• , reject H0 if this is large
MSG
MSW
Both are estimates of population
variance of error under H0
n: number of observations
K: number of groups
1
SSG
MSG
K
MSB
SSB
SSW
MSW
n K
MSW
SSW
MSW
MSB
48. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example– Test statistic thresholds
• If populations are normal, with the same
variance, then we can show that under the
null hypothesis,
• Reject at confidence level if
1,~ K n K
MSG
F
MSW
1, ,K n K
MSG
F
MSW
The F distribution, with
K-1 and n-K degrees of
freedom
Find this value in a table
MSB
MSW
MSB
MSW
49. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example– con’t
94.8
9.48
13 3
SSW
MSW
n K
MSW
6
15-3
0.5
52.43
26.2
1 3 1
SSG
MSG
K
MSB
SSB 10
3-1
5.0
26.2
2.76
9.48
MSG
MSW
MSB
MSW
5.0
0.5
10.0 F3-1,15-3,0.05 = 3.89
Thus we reject the null hypothesis in our case.
50. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example – ANOVA table
Next, we need to fill-in the so-called ANOVA table:
Source of
variation
Sum of
squares
Deg. of
freedom
Mean
squares
F ratio
Between
groups
SSB K-1 MSB MSB/MSW
Within
groups
SSW n-K MSW
Total SST n-1
51. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example – ANOVA table (cont.)
Next, we need to fill-in the so-called ANOVA table:
Source
of
Variance
(SV)
Sum of
Squares
(SS)
Degrees
of
Freedom
(df)
Mean
Squares
(MS)
F-ratio
(F)
Critical
Value
(CV)
Reject
Decision
(Reject?)
Between 10 3-1= 2 10/2= 5.0 5.0/0.5=
10.0
3.89 Is F-ratio
> CV ?
YES
Within 6 15-3= 12 6/12= 0.5
Total 16 15-1= 14
53. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example– F-tests
• In our case, when diet has no effect,
differences between diet are entirely due to
chance. Numerator and denominator will be
similar. F-ratio should have value around 1.00
• When the diet does have an effect then the
between-diet differences (numerator) should
be larger than chance (denominator). F-ratio
should be noticeably larger than 1.00
54. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example Using SPSS
• 1. Quick Data Check
– We first want to get an idea of what our data basically look like. A nice
option for the data at hand is a running a histogram of weight for each
of the three groups separately. The screenshot below walks you
through doing so.
55. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example Using SPSS
The shapes of the frequency distributions are
normally distributed
56. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Normality assumption
57. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1 Using SPSS (cont.)
2. Running SPSS One-Way ANOVA
1
2
3
4
5
58. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1 Using SPSS (cont.)
2. Running SPSS One-Way ANOVA (con’t.)
Under button. Tick the checkbox as shown
below:
4
59. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1 Using SPSS (cont.)
2. Running SPSS One-Way ANOVA (con’t.)
Click the button. Tick the Descriptive checkbox in
the –Statistics– area, as shown below:
5
60. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example Using SPSS (cont.)
3. SPSS One-Way ANOVA Output
– Two sections (boxes) appear in the output: Descriptives
– “N” in the first column refers to the number of cases used for calculating the
descriptive statistics. These numbers being equal to our sample sizes tells us
that there are no missing values on the dependent variable.
– The mean weights are the core of our output. After all, our main research
question is whether these differ for different diets.
61. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1 Using SPSS (cont.)
• 3. SPSS One-Way ANOVA Output-con’t
– The second section, ANOVA table
• The significance level is 0.003 (p <0.01), and, therefore, there is a
statistically significant difference in the mean weight gain between the
different diets.
• which of the specific groups differed?
• Find this out in the Multiple Comparisons table which contains the results
of post-hoc tests.
62. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1 Using SPSS (cont.)
3. SPSS One-Way ANOVA Output-con’t
• The table below, Multiple Comparisons, shows which groups differed from
each other.
• there is a significant difference in weight gain between NL diet and RC diet
(p = 0.002). However, there were no differences between NL diet and ND
diet (p=0.105), as well as between RC diet and ND diet (p=0.105)
63. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
In the Literature
• First and foremost, report descriptive
statistics.
• Regarding the significance test,report
– the F value;
– df1, the numerator degrees of freedom;
– df2, the denominator degrees of freedom;
– the p value
64. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
In the Literature
• There was a statistically significant difference between groups
as determined by one-way ANOVA (F(2,12) = 10.0, p<0.01). A
Tukey post-hoc test revealed that weight gain was statistically
significantly lower in NL diet (M= 2.00, SD= 0.71) compared to
RC diet (M= 4.00, SD = 0.71, p <0.01). However, ND diet (M=
3.00, SD= 0.71) did not significantly differ from NL and RC diet.
Diet Mean (SD) t statistics (df) p-value
Nasi Lemak 2.00 (0.71) 10.0(2,12) 0.003
Roti Canai 4.00 (0.71)
Nasi Dagang 3.00 (0.71)
Table 1: Type of diet associated with weight gain
65. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Steps in solving One-Way ANOVA
post hoc Test Problems - 1
The following is a guide to the decision process for answering
homework problems about one-way ANOVA post hoc test
problems:
Is the dependent variable
ordinal or interval level and
does independent variable
define groups?
Incorrect
application of
a statistic
No
Compute the skewness, and kurtosis for the
variable to test assumption of normality.
Yes
66. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Steps in solving One-Way ANOVA
post hoc Test Problems - 1
The following is a guide to the decision process for answering
homework problems about one-way ANOVA post hoc test
problems:
Is the dependent variable
ordinal or interval level and
does independent variable
define groups?
Incorrect
application of
a statistic
No
Compute the skewness, and kurtosis for the
variable to test assumption of normality.
Yes
67. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Steps in solving One-Way ANOVA
post hoc Test Problems - 2
Yes
No
Assumption of normality
satisfied? (skew, kurtosis
between -1.0 and + 1.0)
No
Sample size 10+ in
each group to apply
Central Limit Theorem?
Incorrect
application of
a statistic
Yes
Compute the one-way ANOVA with Tukey
HSD post hoc option selected
68. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Steps in solving One-Way ANOVA
post hoc Test Problems - 3
Is the p-value for the Tukey
HSD post hoc test <= alpha?
Examine Tukey HSD post hoc test result
False
Is the p-value for the F
ratio test <= alpha?
No
Yes
69. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
recap
• FIVE assumptions
– dependent variable should be measured at the interval or
ratio level (i.e., they are continuous).
– independent variable should consist of two or more
categorical, independent groups.
– should have independence of observations, which means
that there is no relationship between the observations in
each group or between the groups themselves.
– homoscedasticity: the dependent variable has the same
variance within each population;
– normality: the dependent variable is Gaussianly
distributed within each population;
70. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Any
Questio
ns?
Conce
pts?
Equations?