0
Analysis of Variance                 4                 4                 4        95% CI                 4                ...
OverviewAnalysing differences  Correlations vs. differences  Which difference test?  Parametric vs. non-parametricst-tests...
OverviewANOVAs 1-way ANOVA 1-way repeated measures ANOVA Factorial ANOVAAdvanced ANOVAs Mixed design ANOVA (Split-plot ANO...
Readings – Assumed knowledgeHowell (2010):Ch3 The Normal DistributionCh4 Sampling Distributions and Hypothesis TestingCh7 ...
ReadingsHowell (2010):Ch14 Repeated-Measures DesignsCh16 Analyses of Variance and Covariance as General Linear ModelsSee a...
Analysing differencesCorrelations vs. differencesWhich difference test?Parametric vs. non-parametric
Correlational vs     difference statisticsCorrelation and regressiontechniques reflect thestrength of associationTests of ...
Correlational vs    difference statisticsIn MLR we see the world asmade of covariation.Everywhere we look, we seerelations...
Correlational vs    difference statisticsLR/MLR e.g.,What is the relationshipbetween gender and height inhumans?t-test/ANO...
Which difference test? (2 groups)                              How many groups?                              (i.e. categor...
Parametric vs.    non-parametric statisticsParametric statistics – inferential test that assumes certain characteristics a...
Parametric vs.  non-parametric statisticsThere is generally at least onenon-parametric equivalent test foreach type of par...
Parametric vs.    non-parametric statisticsParametric statistics commonly usedfor normally distributed interval or ratiode...
So, when do I use a    non-parametric test?Consider non-parametric tests when (any of the following):Assumptions, like nor...
Some Commonly Used  Some commonly used parametric & non-parametric tests     Parametric & Nonparametric Parametric Non-par...
t-testst-testsOne-sample t-testsIndependent sample t-testsPaired sample t-tests
Why a t-test or ANOVA?A t-test or ANOVA is used todetermine whether a sample of scoresare from the same population asanoth...
t-testsOne-sampleOne group of participants, compared withfixed, pre-existing value (e.g., populationnorms)IndependentCompa...
Major assumptionsNormally distributed variablesHomogeneity of varianceIn general, t-tests and ANOVAs are  robust to violat...
Use of t in t-testst reflects the ratio of between groupvariance to within group varianceIs the t large enough that it is ...
Ye good ol’ normal distribution             68%                            95%                           99.7%            ...
One-tail vs. two-tail testsTwo-tailed test rejects null hypothesis if obtained t-value is extreme is either directionOne-t...
One sample t-testCompare one group (a sample) with a fixed, pre-existing value (e.g., population norms)Do uni students sle...
One-sample t-test
Independent groups t-testCompares mean scores on the same variable across different populations (groups)Do Americans vs. N...
Assumptions         (Indep. samples t-test)LOM   IV is ordinal / categorical   DV is interval / ratioHomogeneity of Varian...
Do males and females differ inin amount of sleep per night?
Do males and females differ in                   memory recall?                                                      Group...
Adolescents         Same Sex Relations in      Single Sex vs. Co-Ed Schools                                               ...
Adolescents        Opposite Sex Relations in       Single Sex vs. Co-Ed Schools                                           ...
Independent samples t-testComparison b/w means of 2 independentsample variables = t-test(e.g., what is the difference in E...
Paired samples t-test → 1-way repeated measures           ANOVASame participants, with repeated measuresData is sampled wi...
Assumptions          (Paired samples t-test)LOM:  IV: Two measures from same participants (w/in   subjects)    a variable ...
Does an intervention have an effect?There was no significant difference between pretest and posttest scores (t(19) = 1.78,...
Adolescents Opposite Sex              vs. Same Sex Relations                                  Paired Samples Statistics   ...
Paired samples t-test → 1-way repeated measures ANOVAComparison b/w means of 2 within subjectvariables = t-testComparison ...
Summary    (Analysing Differences)Non-parametric and parametric tests can be used for examining differences between the ce...
t-testsDifference between a set value and a variable → one-sample t-testDifference between two independent groups → indepe...
Are the differences in a sample    generalisable to a population?              30              25Percentage    20 Reportin...
Introduction to ANOVA         (Analysis of Variance)Extension of a t-test to assess differences in the central tendency (M...
Example ANOVA research questionAre there differences in the degree of  religious commitment between countries  (UK, USA, a...
Example ANOVA research questionDo university students have different levels  of satisfaction for educational, social, and ...
Example ANOVA research questionsAre there differences in the degree of  religious commitment between countries  (UK, USA, ...
Example ANOVA research questionsDoes couples relationship satisfaction  differ between males and females and  before and a...
Example ANOVA research questionsAre there differences in university student  satisfaction between males and females  (gend...
Introduction to ANOVAInferential: What is the likelihood that  the observed differences could have  been due to chance?Fol...
F testANOVA partitions the sums of squares (variance from the mean) into:  Explained variance (between groups)  Unexplaine...
F is the ratio ofbetween-group : within-group variance                                    48
Follow-up testsANOVA F-tests are a "gateway".If F is significant, then...           interpret (main and           interact...
One-way ANOVA                50
Assumptions – One-way ANOVADependent variable (DV) must be:LOM: Interval or ratioNormality: Normally distributed for all I...
One-way ANOVA:  Are there differences insatisfaction levels betweenstudents who get different          grades?            ...
400                                        Recoding                                         needed to300                  ...
These groups could be combined.                               AVGRADE Average Grade                                       ...
The recoded data has more            similar group sizes and is            appropriate for ANOVA.                         ...
SDs are similar (homogeneity of      variance). Ms suggest that      higher grade groups are more      satisfied.         ...
Levenes test indicates homogeneity of variance.                                           aLevenes Test of Equality of Err...
Tests of Between-Subjects EffectsDependent Variable: EDUCAT                  Type III SumSource             of Squares    ...
One-way ANOVA:  Does locus of control differ  between three age groups?Age               Locus of Control20-25 year-olds  ...
42.00                  40.00                  38.0095% CI control1                  36.00                          60-65 y...
The SDs vary between groups (the third group has almost double the SD of the younger group). Levenes test is significant (...
ANOVAcontrol1                 Sum of                 Squares    df        Mean Square   F       Sig.Between Groups    395....
Which age groups differ in their mean locus of  control scores? (Post hoc tests).Conclude: Gps 0 differs from 2; 1 differs...
Follow-up (pairwise) testsPost hoc: Compares everypossible combinationPlanned: Compares specificcombinations(Do one or the...
Post hocControl for Type I error rateScheffe, Bonferroni, Tukey’sHSD, or Student-Newman-KeulsKeeps experiment-wise error r...
PlannedNeed hypothesis before you startSpecify contrast coefficients toweight the comparisons(e.g., 1st two vs. last one)T...
Assumptions -    Repeated measures ANOVARepeated measures designs have the additional assumption of Sphericity:Variance of...
Assumptions -   Repeated measures ANOVASphericity is commonly violated, however the multivariate test (provided by default...
Example: Repeated measures          ANOVADoes LOC vary over time?Baseline6 months12 months                             69
Mean LOC scores (with 95% C.I.s)across 3 measurement occasions         40         39         38                           ...
Descriptive statistics             Descriptive Statistics            Mean      Std. Deviation   Ncontrol1    37.0167      ...
Mauchlys test of sphericity                                                                          b                    ...
Tests of within-subject effects                                      Tests of Within-Subjects EffectsMeasure: MEASURE_1   ...
1-way repeated measures         ANOVADo satisfaction levels vary   between Education,  Teaching, Social and   Campus aspec...
Descriptive Statistics                  Mean       Std. DeviationEDUCAT              3.74               .54TEACHG         ...
4         4         495% CI         4         4         3         N=     594      594      594      594              EDUCA...
Tests of within-subject effects                               Tests of Within-Subjects EffectsMeasure: MEASURE_1          ...
Factorial ANOVA (2-way):  Are there differences insatisfaction levels between     gender and age?                         ...
Factorial ANOVALevels of measurement   2 or more between-subjects    categorical/ordinal IVs   1 interval/ratio DVe.g., Do...
Factorial ANOVAFactorial designs test Main Effects andInteractions. For a 2-way design:Main effect of IV1Main effect of IV...
300200100                                                                    Std. Dev = 6.36                              ...
AGE                                                    Cumulative             Frequency   Percent    Valid Percent    Perc...
4                               M a le s3 .5                               F e m a le s   3       1 7 to 2 2   O ver 22   ...
Tests of Between-Subjects EffectsDependent Variable: TEACHG                Type III SumSource           of Squares        ...
Descriptive StatisticsDependent Variable: TEACHGAGEX Age        GENDER       Mean      Std. Deviation   N1.00 17 to 22   0...
Factorial ANOVA (2-way):Are there differences in LOC between gender and age?                               86
Example: Factorial ANOVAMain effect 1: - Do LOC scores differ by Age?Main effect 2: - Do LOC scores differ by Gender?Inter...
Example: Factorial ANOVAIn this example, there are:  Two main effects (Age and Gender)  One interaction effect (Age x Gend...
Plot of LOC by Age and Gender                                      Estim ated Marginal Means of control1                  ...
gender                  50.00                                                    female                          Age x gen...
42.00                                     Age main effect                  40.00                  38.0095% CI control1    ...
Descriptives for Age main effect                 Descriptivescontrol1             N           Mean      Std. Deviation.00 ...
Gender main effect                  40.00                          Error-bar graph for95% CI control1                     ...
Descriptives for Gender main effect                  Descriptives control1              N           Mean      Std. Deviati...
Descriptives for LOC by           Age and Gender                      Descriptive StatisticsDependent Variable: control1ag...
Tests of between-subjects effects                        Tests of Between-Subjects EffectsDependent Variable: control1    ...
InteractionsIV1 = Separate lines for morning and eveningexercise.IV2 = Light and heavy exerciseDV = Av. hours of sleep per...
Interactions
Interactions
Mixed design ANOVA (SPANOVA) Independent groups (e.g., males and   females) with repeated measures on   each group (e.g., ...
Mixed design ANOVA (SPANOVA)IV1 is between-subjects (e.g., Gender)IV2 is within-subjects (e.g., SocialSatisfaction and Cam...
Mixed design ANOVA (SPANOVA) An experiment has two IVs: Between-subjects =  Gender (Male or Female) - varies  between subj...
Mixed design ANOVA: DesignIf A is Gender and B is Spacing  the Reading experiment is of the  type A X (B) or 2 x (3)Bracke...
Mixed design ANOVA:         AssumptionsNormalityHomogeneity of varianceSphericityHomogeneity of inter-correlations        ...
Homogeneity of         intercorrelationsThe pattern of inter-correlations among  the various levels of repeated measure  f...
Mixed design ANOVA:        ExampleDo satisfaction levels vary   between gender for education and teaching?                ...
EDUCAT       3.80                                       TEACHG       3.75       3.70Mean       3.65       3.60       3.55 ...
Tests of within-subjects contrasts                         Tests of Within-Subjects ContrastsMeasure: MEASURE_1           ...
Tests of between-subjects effects                     Tests of Between-Subjects EffectsMeasure: MEASURE_1Transformed Varia...
1. genderMeasure: MEASURE_1                                         95% Confidence Intervalgender       Mean       Std. Er...
What is ANCOVA?Analysis of CovarianceExtension of ANOVA,using ‘regression’ principlesAssesses effect ofone variable (IV) o...
ANCOVA      (Analysis of Covariance)A covariate IV is added to an ANOVA(can be dichotomous or metric)Effect of the covaria...
Why use ANCOVA?Reduces variance associated withcovariate (CV) from the DV error(unexplained variance) termIncreases power ...
Why use ANCOVA?Adjusts group means to what theywould have been if all Ps had scoredidentically on the CV.The differences b...
Assumptions of ANCOVAAs per ANOVANormalityHomogeneity of Variance (useLevene’s test)                                      ...
Assumptions of ANCOVAIndependence of observationsIndependence of IV and CVMulticollinearity - if more than oneCV, they sho...
Assumptions of ANCOVACheck for linearity between CV &DV - check via scatterplot andcorrelation.If the CV is not correlated...
Assumptions of ANCOVAHomogeneity of regressionAssumes slopes of regression linesbetween CV & DV are equal for each levelof...
Assumptions of ANCOVA               60               50               40               30               20 achievement    ...
ANCOVA example 1:Does education satisfaction differ between people with  different levels of coping(‘Not coping’, ‘Just co...
200100                                                        Std. Dev = 1.24                                             ...
COPEX Coping                                                                    Cumulative                             Fre...
Descriptive StatisticsDependent Variable: EDUCATCOPEX Coping          Mean       Std. Deviation   N1.00 Not Coping        ...
Tests of Between-Subjects EffectsDependent Variable: EDUCAT                  Type III SumSource             of Squares    ...
ANCOVA Example 2: Does     teaching method affect  academic achievement after   controlling for motivation?IV = teaching m...
ANCOVA example       Teachin          g       Method        (IV)                AcademiMotivat            c  ion          ...
ANCOVA example    Teaching     Method           Academic           Achieveme               nt    Motivation               ...
ANCOVA example 2                                  Tests of Between-Subjects EffectsDependent Variable: achievement        ...
ANCOVA example 2                                   Tests of Between-Subjects Effects Dependent Variable: achievement      ...
ANCOVA & hierarchical MLRANCOVA is similar to hierarchicalregression – assesses impact of IVon DV while controlling for 3r...
Summary of ANCOVAUse ANCOVA in survey researchwhen you can’t randomly allocateparticipants to conditionse.g., quasi-experi...
Summary of ANCOVADecide which variable(s) are IV, DV & CV.Check assumptions:normalityhomogeneity of variance (Levene’s tes...
Effect sizesThree effect sizes are relevant toANOVA:Eta-square (η ) provides an overall                 2test of size of e...
Effect Size: Eta-squared (η 2)Analagous to R2 from regression= SSbetween / SStotal = SSB / SST= prop. of variance in Y exp...
Effect Size: Eta-squared (η 2)Interpret as for r2 or R2Cohens rule of thumb for interpreting η2:   .01 is small   .06 medi...
ANOVA ontrol1                 Sum of                 Squares    df        Mean Square   F       Sig. etween Groups    395....
Effect Size: Eta-squared (η 2)The eta-squared column in SPSS F-tableoutput is actually partial eta-squared (ηp2).Partial e...
Results - Writing up ANOVAEstablish clear hypotheses – one for each main or interaction or covariate effectTest the assump...
Results - Writing up ANOVAReport on test results – Size,  direction and significance (F, p,  partial eta-squared)Conduct p...
SummaryHypothesise each main effect and interaction effect.F is an omnibus “gateway” test; may require follow-up tests.Con...
SummaryChoose from mixed-design ANOVA or ANCOVA for lab reportRepeated measure designs include the assumption of sphericit...
SummaryReport on the size of effects potentially using:  Eta-square (η ) as the omnibus ES                 2  Partial eta-...
Open Office ImpressThis presentation was made usingOpen Office Impress.Free and open source software.http://www.openoffice...
Upcoming SlideShare
Loading in...5
×

ANOVA

13,431

Published on

Explains the principles of ANOVA, starting with explanation of t-tests and includes the design, assumptions, and interpretations for one-way ANOVA, one-way repeated measures ANOVA, factorial ANOVA, mixed-design ANOVA (SPANOVA), and ANCOVA.

Published in: Education, Technology
1 Comment
10 Likes
Statistics
Notes
  • ขอบคุณมากๆค่ะ
       Reply 
    Are you sure you want to  Yes  No
    Your message goes here
No Downloads
Views
Total Views
13,431
On Slideshare
0
From Embeds
0
Number of Embeds
6
Actions
Shares
0
Downloads
851
Comments
1
Likes
10
Embeds 0
No embeds

No notes for slide
  • 7126/6667 Survey Research & Design in Psychology Semester 1, 2012, University of Canberra, ACT, Australia James T. Neill Home page: http://ucspace.canberra.edu.au/display/7126 Lecture page: http://en.wikiversity.org/wiki/Survey_research_and_design_in_psychology/Lectures/ANOVA http://ucspace.canberra.edu.au/pages/viewpage.action?pageId=43549230 http://en.wikiversity.org/wiki/ANOVA Image source: http://commons.wikimedia.org/wiki/File:3D_Bar_Graph_Meeting.jpg ' License: Public domain License: CC-by-A 2.0 Author: lumaxart ( http://www.flickr.com/photos/lumaxart/ ) Description: Explains the use of advanced ANOVAs. This lecture is accompanied by a computer-lab based tutorial, the notes for which are available here: http://ucspace.canberra.edu.au/display/7126/Tutorial+-+Multiple+linear+regression
  • Image source: http://commons.wikimedia.org/wiki/File:Information_icon4.svg License: Public domain Assumed knowledge: t-tests and basic ANOVAs
  • Image source: http://commons.wikimedia.org/wiki/File:Information_icon4.svg License: Public domain Assumed knowledge: t-tests and basic ANOVAs
  • Image source: http://commons.wikimedia.org/wiki/File:Information_icon4.svg License: Public domain
  • Correlation/regression techniques reflect the strength of association between continuous variables Tests of group differences ( t -tests, ANOVA) indicate whether significant differences exist between group means
  • In MLR we see world is made of covariation. In ANOVA we see the world as made of differences. In MLR, everywhere we look, we see patterns and relationships. In ANOVA we view everything as having the same, more or less than other things.
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/ “ A t-test is used to determine whether a set or sets of scores are from the same population.” - Coakes & Steed (1999), p.61
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/ Adapted from: http://www.tufts.edu/~gdallal/npar.htm
  • Image source: http://commons.wikimedia.org/wiki/File:Information_icon4.svg License: Public domain
  • “ A t-test is used to determine whether a set or sets of scores are from the same population.” - Coakes & Steed (1999), p.61
  • Image soruce: Unknown
  • Image source: Unknown
  • Also called: One sample t -test
  • Image source: James Neill, Creative Commons Attribution 3.0 Australia, http://creativecommons.org/licenses/by/3.0/au/deed.en Fabricated data
  • Adapted from slide 23 of Howell Ch12 Powerpoint: IV is ordinal / categorical e.g., gender DV is interval / ratio e.g., self-esteem Homogeneity of Variance If variances unequal (Levene’s test), adjustment made Normality t-tests robust to modest departures from normality (often violated without consequences) look at histograms, skewness, & kurtosis; consider use of Mann-Whitney U test if departure from normality is severe, particularly if sample size is small (e.g., < 50) Independence of observations (one participant’s score is not dependent on any other participant’s score)
  • Image source: James Neill, Creative Commons Attribution 3.0 Australia, http://creativecommons.org/licenses/by/3.0/au/deed.en Fabricated data
  • Image source: S;ode 23 pf Howell Ch12 Powerpoint Independent samples t -test
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Also called related samples t-test or repeated measures t-test
  • Also known as: Related samples t -test and Repeated measures t -test How much do you like the colour red? How much do you like the colour blue? Is there a difference between people’s liking of red and blue?
  • Image source: James Neill, Creative Commons Attribution 3.0 Australia, http://creativecommons.org/licenses/by/3.0/au/deed.en Data based on Howell (2010), pp. 485-587dsaf
  • Image source: James Neill, Creative Commons Attribution 3.0 Australia, http://creativecommons.org/licenses/by/3.0/au/deed.en
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/ Data source: Unknown
  • Image source: http://commons.wikimedia.org/wiki/File:Tor_Dilsburg.jpg License: Public domain
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/ Homogeneity of variances assumption appears to be violated, but a useful rule of thumb is that ANOVA is robust when the largest SD is not more than approximately double the smallest SD. Also, remember, ANOVA is more robust with larger N and similar cell sizes.
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • When results are consistent, not of major concern. When results are discrepant, better to go with MANOVA
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • LOC = Locus of Control Interaction: In other words, does any relationship between age and control vary as a function of gender)
  • LOC = Locus of Control Interaction: In other words, does any relationship between age and control vary as a function of gender)
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/ Age main effect
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/ Age main effect
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: Unknown
  • Image source: Unknown
  • Image source: Unknown
  • Check: Random selection
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/ Note: By chance, the means for each level of IV are the same – but not different SE, CIs.
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Image source: James Neill, Creative Commons Attribution-Share Alike 2.5 Australia, http://creativecommons.org/licenses/by-sa/2.5/au/
  • Transcript of "ANOVA"

    1. 1. Analysis of Variance 4 4 4 95% CI 4 4 3 N= 594 594 594 594 EDUCAT TEACHG CAMPUS SOCIAL Lecture 9Survey Research & Design in Psychology James Neill, 2012
    2. 2. OverviewAnalysing differences Correlations vs. differences Which difference test? Parametric vs. non-parametricst-tests One-sample t-test Independent samples t-test Paired samples t-test 2
    3. 3. OverviewANOVAs 1-way ANOVA 1-way repeated measures ANOVA Factorial ANOVAAdvanced ANOVAs Mixed design ANOVA (Split-plot ANOVA) ANCOVA 3
    4. 4. Readings – Assumed knowledgeHowell (2010):Ch3 The Normal DistributionCh4 Sampling Distributions and Hypothesis TestingCh7 Hypothesis Tests Applied to MeansCh11 Simple Analysis of VarianceCh12 Multiple Comparisons Among Treatment MeansCh13 Factorial Analysis of Variance 4
    5. 5. ReadingsHowell (2010):Ch14 Repeated-Measures DesignsCh16 Analyses of Variance and Covariance as General Linear ModelsSee also: Inferential statistics decision-making tree 5
    6. 6. Analysing differencesCorrelations vs. differencesWhich difference test?Parametric vs. non-parametric
    7. 7. Correlational vs difference statisticsCorrelation and regressiontechniques reflect thestrength of associationTests of differences reflectdifferences in central tendency ofvariables between groups andmeasures. 7
    8. 8. Correlational vs difference statisticsIn MLR we see the world asmade of covariation.Everywhere we look, we seerelationships.In ANOVA we see the world asmade of differences.Everywhere we look we seedifferences. 8
    9. 9. Correlational vs difference statisticsLR/MLR e.g.,What is the relationshipbetween gender and height inhumans?t-test/ANOVA e.g.,What is the difference betweenthe heights of human males andfemales? 9
    10. 10. Which difference test? (2 groups) How many groups? (i.e. categories of IV) 1 group = More than 2 groups = ANOVA mod one-sample t-test 2 groups: Are the groups independent or dependent? Independent groups Dependent groups Non-para DV = Mann-Whitney U Non-para DV = Wilcoxon Para DV = Independent samples t-test Para DV = Paired samples t-test 10
    11. 11. Parametric vs. non-parametric statisticsParametric statistics – inferential test that assumes certain characteristics are true of an underlying population, especially the shape of its distribution.Non-parametric statistics – inferential test that makes few or no assumptions about the population from which observations were drawn (distribution-free tests). 11
    12. 12. Parametric vs. non-parametric statisticsThere is generally at least onenon-parametric equivalent test foreach type of parametric test.Non-parametric tests are generallyused when assumptions about theunderlying population arequestionable (e.g., non-normality). 12
    13. 13. Parametric vs. non-parametric statisticsParametric statistics commonly usedfor normally distributed interval or ratiodependent variables.Non-parametric statistics can beused to analyse DVs that are non-normal or are nominal or ordinal.Non-parametric statistics are lesspowerful that parametric tests. 13
    14. 14. So, when do I use a non-parametric test?Consider non-parametric tests when (any of the following):Assumptions, like normality, have been violated.Small number of observations (N).DVs have nominal or ordinal levels of measurement. 14
    15. 15. Some Commonly Used Some commonly used parametric & non-parametric tests Parametric & Nonparametric Parametric Non-parametric Purpose Testst test (independent) Mann-Whitney U; Wilcoxon rank-sum independent samp Compares two t test (paired) Wilcoxon matched pairs signed-rank two related samples Compares 1-way ANOVA Kruskal-Wallis Compares three or more group Friedman; 2-way ANOVA Compares groups classified by two differ χ2 test of independence
    16. 16. t-testst-testsOne-sample t-testsIndependent sample t-testsPaired sample t-tests
    17. 17. Why a t-test or ANOVA?A t-test or ANOVA is used todetermine whether a sample of scoresare from the same population asanother sample of scores.These are inferential tools forexamining differences between groupmeans.Is the difference between two sample means ‘real’ or due to chance? 17
    18. 18. t-testsOne-sampleOne group of participants, compared withfixed, pre-existing value (e.g., populationnorms)IndependentCompares mean scores on the samevariable across different populations(groups)PairedSame participants, with repeatedmeasures 18
    19. 19. Major assumptionsNormally distributed variablesHomogeneity of varianceIn general, t-tests and ANOVAs are robust to violation of assumptions, particularly with large cell sizes, but dont be complacent. 19
    20. 20. Use of t in t-testst reflects the ratio of between groupvariance to within group varianceIs the t large enough that it is unlikely thatthe two samples have come from thesame population?Decision: Is t larger than the critical valuefor t? (see t tables – depends on critical α and N) 20
    21. 21. Ye good ol’ normal distribution 68% 95% 99.7% 21
    22. 22. One-tail vs. two-tail testsTwo-tailed test rejects null hypothesis if obtained t-value is extreme is either directionOne-tailed test rejects null hypothesis if obtained t-value is extreme is one direction (you choose – too high or too low)One-tailed tests are twice as powerful as two-tailed, but they are only focused on identifying differences in one direction. 22
    23. 23. One sample t-testCompare one group (a sample) with a fixed, pre-existing value (e.g., population norms)Do uni students sleep less than the recommended amount? e.g., Given a sample of N = 190 uni students who sleep M = 7.5 hrs/day (SD = 1.5), does this differ significantly from 8 hours hrs/day (α = .05)? 23
    24. 24. One-sample t-test
    25. 25. Independent groups t-testCompares mean scores on the same variable across different populations (groups)Do Americans vs. Non-Americans differ in their approval of Barack Obama?Do males & females differ in the amount of sleep they get? 25
    26. 26. Assumptions (Indep. samples t-test)LOM IV is ordinal / categorical DV is interval / ratioHomogeneity of Variance: If variances unequal(Levene’s test), adjustment madeNormality: t-tests robust to modest departuresfrom normality, otherwise consider use of Mann-Whitney U testIndependence of observations (oneparticipant’s score is not dependent on any otherparticipant’s score) 26
    27. 27. Do males and females differ inin amount of sleep per night?
    28. 28. Do males and females differ in memory recall? Group Statistics gender_R Gender Std. Error of respondent N Mean Std. Deviation Mean immrec immediate 1 Male 1189 7.34 2.109 .061 recall-number correct_wave 1 2 Female 1330 8.24 2.252 .062 Independent Samples Test Levenes Test for Equality of Variances t-test for Equality of Means 95% Confidence Interval of the Mean Std. Error Difference F Sig. t df Sig. (2-tailed) Difference Difference Lower Upperes 4.784 .029 -10.268 2517 .000 -.896 .087 -1.067 -.725es -10.306 2511.570 .000 -.896 .087 -1.066 -.725 28
    29. 29. Adolescents Same Sex Relations in Single Sex vs. Co-Ed Schools Group Statistics Std. Error Type of School N Mean Std. Deviation Mean SSR Single Sex 323 4.9995 .7565 4.209E-02 Co-Educational 168 4.9455 .7158 5.523E-02 Independent Samples Test Levenes Test for Equality of Variances t-test for Equality of Means 95% Confidence Interval of the Mean Std. Error Difference F Sig. t df Sig. (2-tailed) Difference Difference Lower UpperSSR Equal variances .017 .897 .764 489 .445 5.401E-02 7.067E-02 -8.48E-02 .1929 assumed Equal variances .778 355.220 .437 5.401E-02 6.944E-02 -8.26E-02 .1906 not assumed 29
    30. 30. Adolescents Opposite Sex Relations in Single Sex vs. Co-Ed Schools Group Statistics Std. Error Type of School N Mean Std. Deviation Mean OSR Single Sex 327 4.5327 1.0627 5.877E-02 Co-Educational 172 3.9827 1.1543 8.801E-02 Independent Samples Test Levenes Test for Equality of Variances t-test for Equality of Means 95% Confidence Interval of the Mean Std. Error Difference F Sig. t df Sig. (2-tailed) Difference Difference Lower UpperSSR Equal variances .017 .897 .764 489 .445 5.401E-02 7.067E-02 -8.48E-02 .1929 assumed Equal variances .778 355.220 .437 5.401E-02 6.944E-02 -8.26E-02 .1906 not assumed 30
    31. 31. Independent samples t-testComparison b/w means of 2 independentsample variables = t-test(e.g., what is the difference in Educational Satisfactionbetween male and female students?)Comparison b/w means of 3+ independentsample variables = 1-way ANOVA(e.g., what is the difference in Educational Satisfactionbetween students enrolled in four different faculties?) 31
    32. 32. Paired samples t-test → 1-way repeated measures ANOVASame participants, with repeated measuresData is sampled within subjects. Measures are repeated e.g.,: Time e.g., pre- vs. post-intervention Measures e.g., approval ratings of brand X and brand Y 32
    33. 33. Assumptions (Paired samples t-test)LOM: IV: Two measures from same participants (w/in subjects) a variable measured on two occasions or two different variables measured on the same occasion DV: Continuous (Interval or ratio)Normal distribution of difference scores (robust to violation with larger samples)Independence of observations (one participant’s score is not dependent on another’s score) 33
    34. 34. Does an intervention have an effect?There was no significant difference between pretest and posttest scores (t(19) = 1.78, p = .
    35. 35. Adolescents Opposite Sex vs. Same Sex Relations Paired Samples Statistics Std. Error Mean N Std. Deviation Mean Pair SSR 4.9787 951 .7560 2.451E-02 1 OSR 4.2498 951 1.1086 3.595E-02 Paired Samples Test Paired Differences 95% Confidence Interval of the Std. Error Difference Mean Std. Deviation Mean Lower Upper t df Sig. (2-tailed)Pair 1 SSR - OSR .7289 .9645 3.128E-02 .6675 .7903 23.305 950 .000 35
    36. 36. Paired samples t-test → 1-way repeated measures ANOVAComparison b/w means of 2 within subjectvariables = t-testComparison b/w means of 3+ withinsubject variables = 1-way repeatedmeasures ANOVA(e.g., what is the difference in Campus, Social,and Education Satisfaction?) 36
    37. 37. Summary (Analysing Differences)Non-parametric and parametric tests can be used for examining differences between the central tendency of two of more variablesLearn when to use each of the parametric tests of differences, from one-sample t-test through to ANCOVA (e.g. use a decision chart). 37
    38. 38. t-testsDifference between a set value and a variable → one-sample t-testDifference between two independent groups → independent samples t-test = BETWEEN-SUBJECTSDifference between two related measures (e.g., repeated over time or two related measures at one time) → paired samples t-test = WITHIN-SUBJECTS 38
    39. 39. Are the differences in a sample generalisable to a population? 30 25Percentage 20 Reporting Binge 15Drinking inPast Month 10 5 0 12 to 17 18 to 25 26 to 34 35+ Age of 1997 USA Household Sample
    40. 40. Introduction to ANOVA (Analysis of Variance)Extension of a t-test to assess differences in the central tendency (M) of several groups or variables.DV variance is partitioned into between-group and within-group varianceLevels of measurement: Single DV: metric, 1 or more IVs: categorical 40
    41. 41. Example ANOVA research questionAre there differences in the degree of religious commitment between countries (UK, USA, and Australia)?1-way ANOVA1-way repeated measures ANOVAFactorial ANOVAMixed ANOVAANCOVA 41
    42. 42. Example ANOVA research questionDo university students have different levels of satisfaction for educational, social, and campus-related domains ?1-way ANOVA1-way repeated measures ANOVAFactorial ANOVAMixed ANOVAANCOVA 42
    43. 43. Example ANOVA research questionsAre there differences in the degree of religious commitment between countries (UK, USA, and Australia) and gender (male and female)?1-way ANOVA1-way repeated measures ANOVAFactorial ANOVAMixed ANOVAANCOVA 43
    44. 44. Example ANOVA research questionsDoes couples relationship satisfaction differ between males and females and before and after having children?1-way ANOVA1-way repeated measures ANOVAFactorial ANOVAMixed ANOVAANCOVA 44
    45. 45. Example ANOVA research questionsAre there differences in university student satisfaction between males and females (gender) after controlling for level of academic performance?1-way ANOVA1-way repeated measures ANOVAFactorial ANOVAMixed ANOVAANCOVA 45
    46. 46. Introduction to ANOVAInferential: What is the likelihood that the observed differences could have been due to chance?Follow-up tests: Which of the Ms differ?Effect size: How large are the observed differences? 46
    47. 47. F testANOVA partitions the sums of squares (variance from the mean) into: Explained variance (between groups) Unexplained variance (within groups) – or error varianceF = ratio between explained & unexplained variancep = probability that the observed mean differences between groups could be attributable to chance 47
    48. 48. F is the ratio ofbetween-group : within-group variance 48
    49. 49. Follow-up testsANOVA F-tests are a "gateway".If F is significant, then... interpret (main and interaction) effects and consider whether to conduct follow-up tests planned comparisons post-hoc contrasts. 49
    50. 50. One-way ANOVA 50
    51. 51. Assumptions – One-way ANOVADependent variable (DV) must be:LOM: Interval or ratioNormality: Normally distributed for all IV groups (robust to violations of this assumption if Ns are large and approximately equal e.g., >15 cases per group)Variance: Equal variance across for all IV groups (homogeneity of variance)Independence: Participants data should be independent of others data 51
    52. 52. One-way ANOVA: Are there differences insatisfaction levels betweenstudents who get different grades? 52
    53. 53. 400 Recoding needed to300 achieve min. 15 per200 group.100 Std. Dev = .71 Mean = 3.0 0 N = 531.00 1.0 2.0 3.0 4.0 5.0 Average Grade 53
    54. 54. These groups could be combined. AVGRADE Average Grade Cumulative Frequency Percent Valid Percent PercentValid 1 Fail 1 .2 .2 .2 2 Pass 125 20.5 23.5 23.7 3 2 .3 .4 24.1 3 Credit 299 48.9 56.3 80.4 4 4 .7 .8 81.2 4 Distinction 88 14.4 16.6 97.7 5 High Distinction 12 2.0 2.3 100.0 Total 531 86.9 100.0Missing System 80 13.1Total 611 100.0 54
    55. 55. The recoded data has more similar group sizes and is appropriate for ANOVA. AVGRADX Average Grade (R) Cumulative Frequency Percent Valid Percent PercentValid 2.00 Fail/Pass 128 20.9 24.1 24.1 3.00 Credit 299 48.9 56.3 80.4 4.00 D/HD 104 17.0 19.6 100.0 Total 531 86.9 100.0Missing System 80 13.1Total 611 100.0 55
    56. 56. SDs are similar (homogeneity of variance). Ms suggest that higher grade groups are more satisfied. Descriptive StatisticsDependent Variable: EDUCATAVGRADX Mean Std. Deviation NAverage Grade (R)2.00 Fail/Pass 3.57 .53 1283.00 Credit 3.74 .51 2994.00 D/HD 3.84 .55 104Total 3.72 .53 531 56
    57. 57. Levenes test indicates homogeneity of variance. aLevenes Test of Equality of Error VariancesDependent Variable: EDUCAT F df1 df2 Sig. .748 2 528 .474Tests the null hypothesis that the error variance ofthe dependent variable is equal across groups. a. Design: Intercept+AVGRADX 57
    58. 58. Tests of Between-Subjects EffectsDependent Variable: EDUCAT Type III SumSource of Squares df Mean Square F Sig.Corrected Model 4.306a 2 2.153 7.854 .000Intercept 5981.431 1 5981.431 21820.681 .000AVGRADX 4.306 2 2.153 7.854 .000Error 144.734 528 .274Total 7485.554 531Corrected Total 149.040 530 a. R Squared = .029 (Adjusted R Squared = .025) Follow-up tests should then be conducted because the effect of Grade is statistically significant (p < .05). 58
    59. 59. One-way ANOVA: Does locus of control differ between three age groups?Age Locus of Control20-25 year-olds Lower = internal40-45 year olds Higher = external60-65 year-olds 59
    60. 60. 42.00 40.00 38.0095% CI control1 36.00 60-65 year-olds appear 34.00 to be more internal, but the overlapping confidence intervals 32.00 30.00 indicate that this may not be statistically 28.00 significant. 20-25 40-45 60-65 age 60
    61. 61. The SDs vary between groups (the third group has almost double the SD of the younger group). Levenes test is significant (variances are not homogenous). Descriptives control1 95% Confidence Interval for Mean N Mean Std. Deviation Std. Error Lower Bound Upper Bound Minimum Maximum .00 20-25 20 39.1000 5.25056 1.17406 36.6427 41.5573 32.00 46.00 1.00 40-45 20 38.5500 5.29623 1.18427 36.0713 41.0287 30.00 46.00 2.00 60-65 20 33.4000 9.29289 2.07795 29.0508 37.7492 19.00 45.00 Total 60 37.0167 7.24040 .93473 35.1463 38.8871 19.00 46.00 Test of Homogeneity of Variances control1 Levene Statistic df1 df2 Sig. 13.186 2 57 .000 61
    62. 62. ANOVAcontrol1 Sum of Squares df Mean Square F Sig.Between Groups 395.433 2 197.717 4.178 .020Within Groups 2697.550 57 47.325Total 3092.983 59 There is a significant effect for Age (F (2, 59) = 4.18, p = .02). In other words, the three age groups are unlikely to be drawn from a population with the same central tendency for LOC. 62
    63. 63. Which age groups differ in their mean locus of control scores? (Post hoc tests).Conclude: Gps 0 differs from 2; 1 differs from 2 63
    64. 64. Follow-up (pairwise) testsPost hoc: Compares everypossible combinationPlanned: Compares specificcombinations(Do one or the other; not both) 64
    65. 65. Post hocControl for Type I error rateScheffe, Bonferroni, Tukey’sHSD, or Student-Newman-KeulsKeeps experiment-wise error rateto a fixed limit 65
    66. 66. PlannedNeed hypothesis before you startSpecify contrast coefficients toweight the comparisons(e.g., 1st two vs. last one)Tests each contrast at critical α 66
    67. 67. Assumptions - Repeated measures ANOVARepeated measures designs have the additional assumption of Sphericity:Variance of the population difference scores for any two conditions should be the same as the variance of the population difference scores for any other two conditionsTest using Mauchlys test of sphericity (If Mauchly’s W Statistic is p < .05 then assumption of sphericity is violated.) 67
    68. 68. Assumptions - Repeated measures ANOVASphericity is commonly violated, however the multivariate test (provided by default in PASW output) does not require the assumption of sphericity and may be used as an alternative.The obtained F ratio must then be evaluated against new degrees of freedom calculated from the Greenhouse-Geisser, or Huynh-Feld, Epsilon values. 68
    69. 69. Example: Repeated measures ANOVADoes LOC vary over time?Baseline6 months12 months 69
    70. 70. Mean LOC scores (with 95% C.I.s)across 3 measurement occasions 40 39 38 Not much variation95% CI 37 between means. 36 35 control1 control2 control3 70
    71. 71. Descriptive statistics Descriptive Statistics Mean Std. Deviation Ncontrol1 37.0167 7.24040 60control2 37.5667 6.80071 60control3 36.9333 6.92788 60 Not much variation between means. 71
    72. 72. Mauchlys test of sphericity b Mauchlys Test of SphericityMeasure: MEASURE_1 a Epsilon Approx. GreenhousWithin Subjects Effect Mauchlys W Chi-Square df Sig. e-Geisser Huynh-Feldt Lower-boundfactor1 .938 3.727 2 .155 .941 .971 .500Tests the null hypothesis that the error covariance matrix of the orthonormalized transformed dependent variables isproportional to an identity matrix. a. May be used to adjust the degrees of freedom for the averaged tests of significance. Corrected tests are displayed in the Tests of Within-Subjects Effects table. b. Design: Intercept Within Subjects Design: factor1 Mauchlys test is not significant, therefore sphericity can be assumed. 72
    73. 73. Tests of within-subject effects Tests of Within-Subjects EffectsMeasure: MEASURE_1 Type III SumSource of Squares df Mean Square F Sig.factor1 Sphericity Assumed 14.211 2 7.106 2.791 .065 Greenhouse-Geisser 14.211 1.883 7.548 2.791 .069 Huynh-Feldt 14.211 1.943 7.315 2.791 .067 Lower-bound 14.211 1.000 14.211 2.791 .100Error(factor1) Sphericity Assumed 300.456 118 2.546 Greenhouse-Geisser 300.456 111.087 2.705 Huynh-Feldt 300.456 114.628 2.621 Lower-bound 300.456 59.000 5.092 Conclude: Observed differences in means could have occurred by chance (F (2, 118) = 2.79, p = .06) if critical alpha = .05 73
    74. 74. 1-way repeated measures ANOVADo satisfaction levels vary between Education, Teaching, Social and Campus aspects of university life? 74
    75. 75. Descriptive Statistics Mean Std. DeviationEDUCAT 3.74 .54TEACHG 3.63 .65CAMPUS 3.50 .61SOCIAL 3.67 .65 75
    76. 76. 4 4 495% CI 4 4 3 N= 594 594 594 594 EDUCAT TEACHG CAMPUS SOCIAL 76
    77. 77. Tests of within-subject effects Tests of Within-Subjects EffectsMeasure: MEASURE_1 Type III SumSource of Squares df Mean Square F Sig.SATISF Sphericity Assumed 18.920 3 6.307 28.386 .000 Greenhouse-Geisser 18.920 2.520 7.507 28.386 .000 Huynh-Feldt 18.920 2.532 7.472 28.386 .000 Lower-bound 18.920 1.000 18.920 28.386 .000Error(SATISF) Sphericity Assumed 395.252 1779 .222 Greenhouse-Geisser 395.252 1494.572 .264 Huynh-Feldt 395.252 1501.474 .263 Lower-bound 395.252 593.000 .667 77
    78. 78. Factorial ANOVA (2-way): Are there differences insatisfaction levels between gender and age? 78
    79. 79. Factorial ANOVALevels of measurement 2 or more between-subjects categorical/ordinal IVs 1 interval/ratio DVe.g., Does Educational Satisfaction vary according to Age (2) and Gender (2)? 2 x 2 Factorial ANOVA 79
    80. 80. Factorial ANOVAFactorial designs test Main Effects andInteractions. For a 2-way design:Main effect of IV1Main effect of IV2Interaction between IV1 and IV2If significant effects are found and there are more than 2 levels of an IV are involved then follow-up tests are required. 80
    81. 81. 300200100 Std. Dev = 6.36 Mean = 23.5 0 N = 604.00 17.5 22.5 27.5 32.5 37.5 42.5 47.5 52.5 20.0 25.0 30.0 35.0 40.0 45.0 50.0 55.0 AGE 81
    82. 82. AGE Cumulative Frequency Percent Valid Percent PercentValid 17 3 .5 .5 .5 18 46 7.5 7.6 8.1 19 69 11.3 11.4 19.5 20 114 18.7 18.9 38.4 21 94 15.4 15.6 54.0 22 64 10.5 10.6 64.6 23 29 4.7 4.8 69.4 24 29 4.7 4.8 74.2 25 30 4.9 5.0 79.1 26 15 2.5 2.5 81.6 27 16 2.6 2.6 84.3 28 12 2.0 2.0 86.3 29 7 1.1 1.2 87.4 30 7 1.1 1.2 88.6 31 8 1.3 1.3 89.9 32 7 1.1 1.2 91.1 33 7 1.1 1.2 92.2 34 3 .5 .5 92.7 82
    83. 83. 4 M a le s3 .5 F e m a le s 3 1 7 to 2 2 O ver 22 83
    84. 84. Tests of Between-Subjects EffectsDependent Variable: TEACHG Type III SumSource of Squares df Mean Square F Sig.Corrected Model 2.124a 3 .708 1.686 .169Intercept 7136.890 1 7136.890 16996.047 .000AGEX .287 1 .287 .683 .409GENDER 1.584 1 1.584 3.771 .053AGEX * GENDER 6.416E-02 1 6.416E-02 .153 .696Error 250.269 596 .420Total 8196.937 600Corrected Total 252.393 599 a. R Squared = .008 (Adjusted R Squared = .003) 84
    85. 85. Descriptive StatisticsDependent Variable: TEACHGAGEX Age GENDER Mean Std. Deviation N1.00 17 to 22 0 Male 3.5494 .6722 156 1 Female 3.6795 .5895 233 Total 3.6273 .6264 3892.00 over 22 0 Male 3.6173 .7389 107 1 Female 3.7038 .6367 104 Total 3.6600 .6901 211Total 0 Male 3.5770 .6995 263 1 Female 3.6870 .6036 337 Total 3.6388 .6491 600 85
    86. 86. Factorial ANOVA (2-way):Are there differences in LOC between gender and age? 86
    87. 87. Example: Factorial ANOVAMain effect 1: - Do LOC scores differ by Age?Main effect 2: - Do LOC scores differ by Gender?Interaction: - Is the relationship between Age and LOC moderated by Gender? (Does any relationship between Age and LOC vary as a function of Gender?) 87
    88. 88. Example: Factorial ANOVAIn this example, there are: Two main effects (Age and Gender) One interaction effect (Age x Gender)IVs Age recoded into 2 groups (2) Gender dichotomous (2)DV Locus of Control (LOC) 88
    89. 89. Plot of LOC by Age and Gender Estim ated Marginal Means of control1 45.00 gender female male Estimated Marginal Means 40.00 35.00 30.00 25.00 20-25 40-45 60-65 age 89
    90. 90. gender 50.00 female Age x gender interaction male 45.00 40.0095% CI control1 35.00 30.00 25.00 20.00 20-25 40-45 60-65 age 90
    91. 91. 42.00 Age main effect 40.00 38.0095% CI control1 36.00 34.00 32.00 Error-bar graph for 30.00 Age main effect 28.00 20-25 40-45 60-65 age 91
    92. 92. Descriptives for Age main effect Descriptivescontrol1 N Mean Std. Deviation.00 20-25 20 39.1000 5.250561.00 40-45 20 38.5500 5.296232.00 60-65 20 33.4000 9.29289Total 60 37.0167 7.24040 92
    93. 93. Gender main effect 40.00 Error-bar graph for95% CI control1 Gender main effect 35.00 30.00 female male gender 93
    94. 94. Descriptives for Gender main effect Descriptives control1 N Mean Std. Deviation .00 female 30 42.9333 2.40593 1.00 male 30 31.1000 5.33272 Total 60 37.0167 7.24040 94
    95. 95. Descriptives for LOC by Age and Gender Descriptive StatisticsDependent Variable: control1age gender Mean Std. Deviation N.00 20-25 .00 female 43.9000 1.91195 10 1.00 male 34.3000 1.82878 10 Total 39.1000 5.25056 201.00 40-45 .00 female 43.1000 2.02485 10 1.00 male 34.0000 3.01846 10 Total 38.5500 5.29623 202.00 60-65 .00 female 41.8000 2.89828 10 1.00 male 25.0000 4.13656 10 Total 33.4000 9.29289 20Total .00 female 42.9333 2.40593 30 1.00 male 31.1000 5.33272 30 Total 37.0167 7.24040 60 95
    96. 96. Tests of between-subjects effects Tests of Between-Subjects EffectsDependent Variable: control1 Type III SumSource of Squares df Mean Square F Sig.Corrected Model 2681.483a 5 536.297 70.377 .000Intercept 82214.017 1 82214.017 10788.717 .000age 395.433 2 197.717 25.946 .000gender 2100.417 1 2100.417 275.632 .000age * gender 185.633 2 92.817 12.180 .000Error 411.500 54 7.620Total 85307.000 60Corrected Total 3092.983 59 a. R Squared = .867 (Adjusted R Squared = .855) 96
    97. 97. InteractionsIV1 = Separate lines for morning and eveningexercise.IV2 = Light and heavy exerciseDV = Av. hours of sleep per night
    98. 98. Interactions
    99. 99. Interactions
    100. 100. Mixed design ANOVA (SPANOVA) Independent groups (e.g., males and females) with repeated measures on each group (e.g., word recall under three different character spacing conditions (Narrow, Medium, Wide)). Since such experiments have mixtures of between-subject and within-subject factors they are said to be of mixed design Since output is split into two tables of effects, this is also said to be split-plot ANOVA (SPANOVA) 100
    101. 101. Mixed design ANOVA (SPANOVA)IV1 is between-subjects (e.g., Gender)IV2 is within-subjects (e.g., SocialSatisfaction and Campus Satisfaction)Of interest are:Main effect of IV1Main effect of IV2Interaction b/w IV1 and IV2If significant effects are found and morethan 2 levels of an IV are involved, thenspecific contrasts are required, either:A priori (planned) contrastsPost-hoc contrasts 101
    102. 102. Mixed design ANOVA (SPANOVA) An experiment has two IVs: Between-subjects = Gender (Male or Female) - varies between subjects Within-subjects = Spacing (Narrow, Medium, Wide) Gender - varies within subjects 102
    103. 103. Mixed design ANOVA: DesignIf A is Gender and B is Spacing the Reading experiment is of the type A X (B) or 2 x (3)Brackets signify a mixed design with repeated measures on Factor B 103
    104. 104. Mixed design ANOVA: AssumptionsNormalityHomogeneity of varianceSphericityHomogeneity of inter-correlations 104
    105. 105. Homogeneity of intercorrelationsThe pattern of inter-correlations among the various levels of repeated measure factor(s) should be consistent from level to level of the Between-subject Factor(s)The assumption is tested using Box’s MstatisticHomogeneity is present when the Mstatistic is NOT significant at p > .001. 105
    106. 106. Mixed design ANOVA: ExampleDo satisfaction levels vary between gender for education and teaching? 106
    107. 107. EDUCAT 3.80 TEACHG 3.75 3.70Mean 3.65 3.60 3.55 Male Female gender 107
    108. 108. Tests of within-subjects contrasts Tests of Within-Subjects ContrastsMeasure: MEASURE_1 Type III SumSource SATISF of Squares df Mean Square F Sig.SATISF Linear 3.262 1 3.262 22.019 .000SATISF * GENDER Linear 1.490E-02 1 1.490E-02 .101 .751Error(SATISF) Linear 88.901 600 .148 108
    109. 109. Tests of between-subjects effects Tests of Between-Subjects EffectsMeasure: MEASURE_1Transformed Variable: Average Type III SumSource of Squares df Mean Square F Sig.Intercept 16093.714 1 16093.714 29046.875 .000GENDER 3.288 1 3.288 5.934 .015Error 332.436 600 .554 109
    110. 110. 1. genderMeasure: MEASURE_1 95% Confidence Intervalgender Mean Std. Error Lower Bound Upper Bound0 Male 3.630 .032 3.566 3.6931 Female 3.735 .029 3.679 3.791 2. satisfMeasure: MEASURE_1 95% Confidence Intervalsatisf Mean Std. Error Lower Bound Upper Bound1 3.735 .022 3.692 3.7782 3.630 .027 3.578 3.682 110
    111. 111. What is ANCOVA?Analysis of CovarianceExtension of ANOVA,using ‘regression’ principlesAssesses effect ofone variable (IV) onanother variable (DV)after controlling for a third variable(CV) 111
    112. 112. ANCOVA (Analysis of Covariance)A covariate IV is added to an ANOVA(can be dichotomous or metric)Effect of the covariate on the DV isremoved (or partialled out)(akin to Hierarchical MLR)Of interest are:Main effects of IVs and interaction termsContribution of CV (akin to Step 1 in HMLR)e.g., GPA is used as a CV, when analysing whetherthere is a difference in Educational Satisfactionbetween Males and Females. 112
    113. 113. Why use ANCOVA?Reduces variance associated withcovariate (CV) from the DV error(unexplained variance) termIncreases power of F-testMay not be able to achieveexperimental control over a variable(e.g., randomisation), but canmeasure it and statistically controlfor its effect. 113
    114. 114. Why use ANCOVA?Adjusts group means to what theywould have been if all Ps had scoredidentically on the CV.The differences between Ps on theCV are removed, allowing focus onremaining variation in the DV due tothe IV.Make sure hypothesis (hypotheses) is/areclear. 114
    115. 115. Assumptions of ANCOVAAs per ANOVANormalityHomogeneity of Variance (useLevene’s test) a Levenes Test of Equality of Error Variances Dependent Variable: achievement F df1 df2 Sig. .070 1 78 .792 Tests the null hypothesis that the error variance of the dependent variable is equal across groups. a. Design: Intercept+MOTIV+TEACH 115
    116. 116. Assumptions of ANCOVAIndependence of observationsIndependence of IV and CVMulticollinearity - if more than oneCV, they should not be highlycorrelated - eliminate highlycorrelated CVsReliability of CVs - not measuredwith error - only use reliable CVs 116
    117. 117. Assumptions of ANCOVACheck for linearity between CV &DV - check via scatterplot andcorrelation.If the CV is not correlated withthe DV there is no point in usingit. 60 50 40 30 20 achievement 10 0 -2 0 motivation 2 4 6 8 10 12 117
    118. 118. Assumptions of ANCOVAHomogeneity of regressionAssumes slopes of regression linesbetween CV & DV are equal for each levelof IV, if not, don’t proceed with ANCOVACheck via scatterplot with lines of best fit 118
    119. 119. Assumptions of ANCOVA 60 50 40 30 20 achievement Teaching Method 10 conservative 0 innovative -2 0 2 4 6 8 10 12 motivation 119
    120. 120. ANCOVA example 1:Does education satisfaction differ between people with different levels of coping(‘Not coping’, ‘Just coping’ and ‘Coping well’) with average grade as a covariate? 120
    121. 121. 200100 Std. Dev = 1.24 Mean = 4.6 0 N = 584.00 0.0 1.0 2.0 3.0 4.0 5.0 6.0 7.0 Overall Coping 121
    122. 122. COPEX Coping Cumulative Frequency Percent Valid Percent PercentValid 1.00 Not Coping 94 15.4 16.1 16.1 2.00 Coping 151 24.7 25.9 42.0 3.00 Coping Well 338 55.3 58.0 100.0 Total 583 95.4 100.0Missing System 28 4.6Total 611 100.0 122
    123. 123. Descriptive StatisticsDependent Variable: EDUCATCOPEX Coping Mean Std. Deviation N1.00 Not Coping 3.4586 .6602 832.00 Just Coping 3.6453 .5031 1293.00 Coping Well 3.8142 .4710 300Total 3.7140 .5299 512 123
    124. 124. Tests of Between-Subjects EffectsDependent Variable: EDUCAT Type III SumSource of Squares df Mean Square F Sig.Corrected Model 11.894a 3 3.965 15.305 .000Intercept 302.970 1 302.970 1169.568 .000AVGRADE 2.860 1 2.860 11.042 .001COPEX 7.400 2 3.700 14.283 .000Error 131.595 508 .259Total 7206.026 512Corrected Total 143.489 511 a. R Squared = .083 (Adjusted R Squared = .077) 124
    125. 125. ANCOVA Example 2: Does teaching method affect academic achievement after controlling for motivation?IV = teaching methodDV = academic achievementCV = motivationExperimental design - assume students randomly allocated to different teaching methods. 125
    126. 126. ANCOVA example Teachin g Method (IV) AcademiMotivat c ion Achieve (CV) ment (DV) 126
    127. 127. ANCOVA example Teaching Method Academic Achieveme nt Motivation 127
    128. 128. ANCOVA example 2 Tests of Between-Subjects EffectsDependent Variable: achievement Type III SumSource of Squares df Mean Square F Sig. Eta SquaredCorrected Model 189.113a 1 189.113 1.622 .207 .020Intercept 56021.113 1 56021.113 480.457 .000 .860TEACH 189.113 1 189.113 1.622 .207 .020Error 9094.775 78 116.600Total 65305.000 80Corrected Total 9283.888 79 a. R Squared = .020 (Adjusted R Squared = .008) A one-way ANOVA shows a non-significant effect for teaching method (IV) on academic achievement (DV) 128
    129. 129. ANCOVA example 2 Tests of Between-Subjects Effects Dependent Variable: achievement Type III Sum Source of Squares df Mean Square F Sig. Eta Squared Corrected Model 3050.744a 2 1525.372 18.843 .000 .329 Intercept 2794.773 1 2794.773 34.525 .000 .310 MOTIV 2861.632 1 2861.632 35.351 .000 .315 TEACH 421.769 1 421.769 5.210 .025 .063 Error 6233.143 77 80.950 Total 65305.000 80 Corrected Total 9283.888 79 a. R Squared = .329 (Adjusted R Squared = .311)An ANCOVA is used to adjust for differences inmotivationF has gone from 1 to 5 and is significant because theerror term (unexplained variance) was reduced byincluding motivation as a CV. 129
    130. 130. ANCOVA & hierarchical MLRANCOVA is similar to hierarchicalregression – assesses impact of IVon DV while controlling for 3rdvariable.ANCOVA more commonly used ifIV is categorical. 130
    131. 131. Summary of ANCOVAUse ANCOVA in survey researchwhen you can’t randomly allocateparticipants to conditionse.g., quasi-experiment, or controlfor extraneous variables.ANCOVA allows us to statisticallycontrol for one or more covariates. 131
    132. 132. Summary of ANCOVADecide which variable(s) are IV, DV & CV.Check assumptions:normalityhomogeneity of variance (Levene’s test)Linearity between CV & DV (scatterplot)homogeneity of regression (scatterplot –compares slopes of regression lines)Results – does IV effect DV aftercontrolling for the effect of the CV? 132
    133. 133. Effect sizesThree effect sizes are relevant toANOVA:Eta-square (η ) provides an overall 2test of size of effectPartial eta-square (η p2 ) provides anestimate of the effects for each IV.Cohen’s d: Standardised differencesbetween two means. 133
    134. 134. Effect Size: Eta-squared (η 2)Analagous to R2 from regression= SSbetween / SStotal = SSB / SST= prop. of variance in Y explained by X= Non-linear correlation coefficient= prop. of variance in Y explained by XRanges between 0 and 1 134
    135. 135. Effect Size: Eta-squared (η 2)Interpret as for r2 or R2Cohens rule of thumb for interpreting η2: .01 is small .06 medium .14 large 135
    136. 136. ANOVA ontrol1 Sum of Squares df Mean Square F Sig. etween Groups 395.433 2 197.717 4.178 .020Within Groups 2697.550 57 47.325 otal 3092.983 59a percentage: 12.8% of the total variance in control is ex 136
    137. 137. Effect Size: Eta-squared (η 2)The eta-squared column in SPSS F-tableoutput is actually partial eta-squared (ηp2).Partial eta-squared indicates the size of effectfor each IV (also useful).η2 is not provided by SPSS – calculateseparately: = SSbetween / SStotal = prop. of variance in Y explained by XR2 at the bottom of SPSS F-tables is the linear effect as per MLR – ifan IV has 3 or more non-interval levels, this won’t equate with η2. 137
    138. 138. Results - Writing up ANOVAEstablish clear hypotheses – one for each main or interaction or covariate effectTest the assumptions, esp. LOM, normality and n for each cell, homogeneity of variance, Boxs M, SphericityPresent the descriptive statistics (M, SD, skewness, and kurtosis in a table, with marginal totals)Present a figure to illustrate the data (bar, error-bar, or line graph) 138
    139. 139. Results - Writing up ANOVAReport on test results – Size, direction and significance (F, p, partial eta-squared)Conduct planned or post-hoc testing as appropriate, with pairwise effect sizes (Cohens d)Indicate whether or not results support hypothesis (hypotheses) 139
    140. 140. SummaryHypothesise each main effect and interaction effect.F is an omnibus “gateway” test; may require follow-up tests.Conduct follow-up tests where sig. main effects have three or more levels. 140
    141. 141. SummaryChoose from mixed-design ANOVA or ANCOVA for lab reportRepeated measure designs include the assumption of sphericity 141
    142. 142. SummaryReport on the size of effects potentially using: Eta-square (η ) as the omnibus ES 2 Partial eta-square (ηp ) for each IV 2 Standardised mean differences for the differences between each pair of means (e.g., Cohens d) 142
    143. 143. Open Office ImpressThis presentation was made usingOpen Office Impress.Free and open source software.http://www.openoffice.org/product/impress.html 143
    1. A particular slide catching your eye?

      Clipping is a handy way to collect important slides you want to go back to later.

    ×